Deep Learning in Speech Synthesis (Google Research Talk)

Source

Evernote/IFTTT Feedly/Deep Learning in Speech Synthesis.md

Summary

2013 년 구글 연구 발표로, 통계적 파라메트릭 음성 합성 (Statistical Parametric Speech Synthesis) 에 딥러닝을 적용한 최근 사례를 소개합니다. 기존 은닉 마르코프 모델 (HMM) 기반 접근법과 딥러닝 기반 접근법의 차이를 비교 분석합니다.

Key Points

통계적 파라메트릭 음성 합성 분야에 딥러닝 적용 사례 제시
기존 HMM 기반 방법과 딥러닝 기반 방법의 비교
2013 년 9 월 기준 구글 연구진 발표 자료

Speech and Natural Language: Where Are We Now And Where Are We Headed
심층 신경망을 이용한 통계적 파라미터 음성 합성
Accurate and Compact Large Vocabulary Speech Recognition on Mobile Devices
Speaker Adaptation of Context Dependent Deep Neural Networks
Facebook, 일부 딥러닝 도구 오픈소스화
Language Model Verbalization for Automatic Speech Recognition
Language Modeling Capitalization
구글의 대화형 검색 (2013)
iVector-based Acoustic Data Selection
Google SyntaxNet 오픈소스 공개 및 원리
Life of Pi 의 털 렌더링 기술 (Rendering Fur in Life of Pi)
Deep Learning via Semi-Supervised Embedding
오프라인 아랍어 손글씨 인식 기술 동향 (A Survey)
Target Language Adaptation of Discriminative Transfer Parsers
Wireless Networks Design in the Era of Deep Learning Model-Based, AI-Based, or Both
AGC 및 다중 스타일 학습을 통한 소형 키워드 스포팅
Coordinated Multi-Device Presentations: Ambient-Audio Identification
Defunctionalized Interpreters for Call-by-Need Evaluation
대규모 분산 음향 모델링 및 백오프 N-그램
KamitaniLab DeepImageReconstruction 데이터 및 데모 코드
DurIAN_4S: 말하기 데이터로부터 노래 합성 학습
Affinity Weighted Embedding
내가 찾은 Deep Learning 공부 최단경로(?)
쉽게 풀어쓴 딥러닝(Deep Learning)의 거의 모든 것
Recurrent Neural Networks for Voice Activity Detection
Behavioural reconfigurable and adaptive data reduction in body sensor networks
Improved Domain Adaptation for Statistical Machine Translation
Efficient Estimation of Word Representations in Vector Space
모바일 음성 검색을 위한 Google 쿼리 스트림의 언어 모델링 경험적 탐색
음성 품질 지표의 배경 소음 및 네트워크 열화에 대한 강건성 비교 (VISQOL, PESQ, POLQA)
영어 책 코퍼스 기반 시계열 구문 N-그램 데이터셋
3음만으로 음악을 식별하는 알고리즘 개발
Universal Dependency Annotation for Multilingual Parsing
Scalable Decipherment for Machine Translation via Hash Sampling
심층 피처 합성 (Deep Feature Synthesis) 개요
Token and Type Constraints for Cross-Lingual Part-of-Speech Tagging
SOINN (Self-Organizing Incremental Neural Network)
The Intervalgram: 대규모 커버송 인식을 위한 오디오 특징
Transfer Learning In MIR: Sharing Learned Latent Representations For Music Audio Classification And Similarity
From mixed-mode to multiple devices. Web surveys, smartphone surveys and apps
신경망을 이용한 자기 학습 헬리콥터
WLAN-셀룰러 음성 핸드오버 평가를 위한 분석적 프레임워크
Eureka: Edge-Based Discovery of Training Data for Machine Learning
기계학습 고급 컨셉 및 연구자 조언 (노영균 교수 특강)
Summarization Through Submodularity and Dispersion
Latent Mixture of Discriminative Experts (LMDE)
오디오 처리를 위한 동기식 프로그래밍: 룩업 테이블 오실레이터 사례 연구
A Semantic Matching Energy Function for Learning with Multi-relational Data
예측 모델링에 대한 실용서
Continuous Birdsong Recognition Using Gaussian Mixture Modeling of Image Shape Features
Cross-Domain Feature Learning in Multimedia
Deep Learning 입문자를 위한 학습 로드맵 및 조언
자동화에 대한 신뢰 (Trust in Automation)
Video Snippets
Breaking Out of Local Optima with Count Transforms and Model Recombination: A Study in Grammar Induction
Enlisting the Ghost: Modeling Empty Categories for Machine Translation
Google Research Archive Paper 40700
신경망과 딥러닝 1. 퍼셉트론
모바일 네트워크 데이터를 활용한 도시 감지 연구 조사
State-based model slicing: A survey
Image Annotation in Presence of Noisy Labels
인공신경망 학습 레시피 (Andrej Karpathy)
Scalability vs. Fault Tolerance in Aspen Trees

AncomWiki

탐색기

Deep Learning in Speech Synthesis (Google Research Talk)

Deep Learning in Speech Synthesis (Google Research Talk)

Source

Summary

Key Points

그래프 뷰

목차

백링크

AncomWiki

탐색기

Deep Learning in Speech Synthesis (Google Research Talk)

Deep Learning in Speech Synthesis (Google Research Talk)

Source

Summary

Key Points

Related

그래프 뷰

목차

백링크