|
河井恒 研究業績一覧 (21件)
論文
-
Peng Shen,
Xugang Lu,
Sheng Li,
Hisashi Kawai.
Knowledge Distillation-based Representation Learning for Short-Utterance Spoken Language Identification.,
IEEE/ACM Trans. Audio, Speech \& Language Process. (TASLP),
Vol. 28,
pp. 2674--2683,
July 2020.
-
Junichi Yamagishi,
Hisashi Kawai,
Takao Kobayashi.
Phone duration modeling using gradient tree boosting,
Speech Communication,
Elsevier,
vol. 50,
no. 5,
pp. 405-415,
May 2008.
国際会議発表 (査読有り)
-
Sheng Li,
Chen Chen,
Chin Yuen Kwok,
Chenhui Chu,
Eng Siong Chng,
Hisashi Kawai.
Investigating ASR Error Correction with Large Language Model and Multilingual 1-best Hypotheses,
INTERSPEECH,
Dec. 2024.
-
Zhuo Gong,
Saito Daisuke,
Sheng Li,
Hisashi Kawai,
Minematsu Nobuaki.
Can We Train a Language Model Inside an End-to-End ASR Model? - Investigating Effective Implicit Language Modeling,
the Second Workshop on When Creative AI Meets Conversational AI,
Dec. 2022.
-
Zhuo Gong,
Daisuke Saito,
Longfei Yang,
Takahiro Shinozaki,
Sheng Li,
Hisashi Kawai,
Nobuaki Minematsu.
Self-Adaptive Multilingual ASR Rescoring with Language Identification and Unified Language Model.,
ISCA-Odyssey (The Speaker and Language Recognition Workshop),
pp. 415--420,
Dec. 2022.
-
Peng Shen,
Xugang Lu,
Komei Sugiura,
Sheng Li,
Hisashi Kawai.
Compensation on x-vector for short utterance spoken language identification.,
ISCA-Odyssey (The Speaker and Language Recognition Workshop),
pp. 47-52,
Dec. 2020.
-
Sheng Li,
Xugang Lu,
Raj Dabre,
Peng Shen,
Hisashi Kawai.
Joint Training End-to-End Speech Recognition Systems with Speaker Attributes.,
ISCA-Odyssey (The Speaker and Language Recognition Workshop),
pp. 385--390,
Dec. 2020.
-
Sheng Li,
Chenchen Ding,
Xugang Lu,
Peng Shen,
Tatsuya Kawahara,
Hisashi Kawai..
End-to-End Articulatory Attribute Modeling for Low-resource Multilingual Speech Recognition.,
INTERSPEECH,
pp. 2145--2149,
Sept. 2019.
-
Sheng Li,
Xugang Lu,
Chenchen Ding,
Peng Shen,
Tatsuya Kawahara,
Hisashi Kawai..
Investigating Radical-based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese.,
INTERSPEECH,
pp. 2200--2204,
Sept. 2019.
-
Sheng Li,
Raj Dabre,
Xugang Lu,
Peng Shen,
Tatsuya Kawahara,
Hisashi Kawai.
Improving Transformer-based Speech Recognition Systems with Compressed Structure and Speech Attributes Augmentation.,
INTERSPEECH,
pp. 4400--4404,
Sept. 2019.
-
Peng Shen,
Xugang Lu,
Sheng Li,
Hisashi Kawai..
Interactive learning of teacher-student model for short utterance spoken language identification.,
IEEE-ICASSP,
pp. 5981--5985,
May 2019.
-
Ryoichi Takashima,
Sheng Li,
Hisashi Kawai..
Investigation of Sequence-level Knowledge Distillation Methods for CTC Acoustic Models.,
IEEE-ICASSP,
pp. 6156--6160,
May 2019.
-
Sheng Li,
Xugang Lu,
Ryoichi Takashima,
Phen Shen,
Tatsuya Kawahara,
Hisashi Kawai.
Improving very deep time-delay neural network with vertical-attention for effectively training CTC-based ASR systems.,
IEEE Spoken Language Technology Workshop (IEEE-SLT),
pp. 77--83,
Dec. 2018.
-
Sheng Li,
Xugang Lu,
Ryoichi Takashima,
Peng Shen,
Tatsuya Kawahara,
Hisashi Kawai..
Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks.,
INTERSPEECH,
pp. 3708--3712,
Sept. 2018.
-
Sheng Li,
Xugang Lu,
Ryoichi Takashima,
Peng Shen,
Tatsuya Kawahara,
Hisashi Kawai..
Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks.,
INTERSPEECH,
pp. 3708--3712,
Sept. 2018.
-
Peng Shen,
Xugang Lu,
Sheng Li,
Hisashi Kawai..
Feature Representation of Short Utterances based on Knowledge Distillation for Spoken Language Identification.,
INTERSPEECH,
pp. 1813--1817,
Sept. 2018.
-
Xugang Lu,
Peng Shen,
Sheng Li,
Yu Tsao,
Hisashi Kawai..
Temporal Attentive Pooling for Acoustic Event Detection.,
INTERSPEECH,
pp. 1354--1357,
Sept. 2018.
-
Ryoichi Takashima,
Sheng Li,
Hisashi Kawai.
CTC Loss Function with a Unit-level Ambiguity Penalty.,
IEEE-ICASSP,
pp. 5909--5913,
May 2018.
-
Sheng Li,
Xugang Lu,
Peng Shen,
Ryoichi Takashima,
Tatsuya Kawahara,
Hisashi Kawai.
Incremental training and constructing the very deep convolutional residual network acoustic models.,
IEEE Workshop Automatic Speech Recognition \& Understanding (IEEE-ASRU),
pp. 222--227,
Dec. 2017.
-
Peng Shen,
Xugang Lu,
Sheng Li,
Hisashi Kawai.
Conditional Generative Adversarial Nets Classifier for Spoken Language Identification.,
INTERSPEECH,
pp. 2814--2818,
Sept. 2017.
-
Ryoichi Takashima,
Sheng Li,
Hisashi Kawai.
An Investigation of a Knowledge Distillation Method for CTC Acoustic Models.,
pp. 5809--5813,
May 2017.
[ BibTeX 形式で保存 ]
[ 論文・著書をCSV形式で保存
]
[ 特許をCSV形式で保存
]
|