-  Biographical Sketch B

Berlin Chen is a Professor in the Computer Science and Information Engineering Department at National Taiwan Normal University (NTNU), Taipei, Taiwan. He received his Ph.D. degree in computer science and information engineering from National Taiwan University (NTU) in June 2001, and then joined NTNU as an Assistant Professor in August 2002. He became an Associate Professor in August 2006 and was promoted to the rank of Professor in February 2010. Prof. Chen's research interests generally lie in the areas of speech recognition and natural language processing, multimedia information retrieval, and artificial intelligence; he is the author/coauthor of over 200 academic publications. Prof. Chen is a member of IEEE, ISCA and ACLCLP. (More)

 

-  Research Interests Briefing of Research

- Speech Recognition

        -  DNN-HMM and End-to-End Neural Modeling Structures
       
Discriminative and Robust Feature Representation Learning
        -  Acoustic and Language Modeling
        -  Search Algorithms for large vocabulary ASR systems

Natural Language Processing and Information Retrieval
        - 
Text and speech Summarization
       
Retrieval Modeling and Ranking Algorithms 
        - 
Text and Speech Indexing and Retrieval 
        -  QA and FAQ systems

-  Computer Assisted Language Learning     

        - Computer-Assisted Pronunciation Training (CAPT)
        - Text Readability Assessment
        - Reading Comprehension

-  Machine Learning and Data  Analytics
        -   Representation Learning: Surface and Deep Learning   
        -   Generative and Discriminative Modeling Techniques 
        -   Unsupervised and Lightly Supervised Training Techniques

Human-Machine Interaction

 

-  Research Articles   B

Some recent publications are listed below. A complete list of research articles can be found here.

Wen-Ting Tseng, Yung-Chang Hsu, Berlin Chen, "Effective FAQ retrieval and question matching tasks with unsupervised knowledge injection," The 24th International Conference on Text, Speech and Dialogue (TSD 2021), Olomouc, Czech Republic, September 6-9, 2021.

Bi-Cheng Yan, Berlin Chen, "End-to-end mispronunciation detection and diagnosis from raw waveforms," the 29th European Signal Processing Conference (EUSIPCO 2021), Virtual Conference Event, August 23-27, 2021. (Oral)

Shih-Hsuan Chiu, Tien-Hong Lo and Berlin Chen, "Cross-sentence neural language models for conversational speech recognition," The Annual International Joint Conference on Neural Networks (IJCNN 2021), Virtual Conference Event, July 18-22, 2021.

Fu-An Chao, Jeih-weih Hung, Berlin Chen, "Cross-domain single-channel speech enhancement model with bi-projection fusion module for noise-robust ASR," IEEE International Conference on Multimedia & Expo (ICME 2021), Virtual Conference Event, July 5-9, 2021. (Oral)

Shih-Hsuan Chiu, Berlin Chen, "Innovative BERT-based reranking language models for speech recognition," IEEE Workshop on Spoken Language Technology (SLT 2021), Virtual Conference Event, January 19-22, 2021.

Shi-Yan Weng, Tien-Hong Lo, Berlin Chen, "An effective contextual language modeling framework for speech summarization with augmented features," the 28th European Signal Processing Conference (EUSIPCO 2020), Amsterdam, Netherlands, January 18-22, 2021.

Yu-Te Wu, Berlin Chen, Li Su, "Multi-instrument automatic music transcription with self-attention-based instance segmentation," IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 28, pp. 2796-2809, October 2020.

Bi-Cheng Yan, Meng-Che Wu, Berlin Chen, "Exploring feature enhancement in the modulation spectrum domain via ideal ratio mask for robust speech recognition," 2020 APSIPA Annual Summit and Conference (APSIPA ASC 2020), Auckland, New Zealand, December 7-10, 2020.

Tien-Hong Lo, Fu-An Chao, Shi-Yan Weng, Berlin Chen, "The NTNU System at the Interspeech 2020 Non-Native Children's Speech ASR Challenge," the 21st Annual Conference of the International Speech Communication Association (Interspeech 2020), Shanghai, China, October 25-29, 2020.

Bi-Cheng Yan, Meng-Che Wu, Hsiao-Tsung Hung, Berlin Chen, "An end-to-end mispronunciation detection system for L2 English speech leveraging novel anti-phone modeling," the 21st Annual Conference of the International Speech Communication Association (Interspeech 2020), Shanghai, China, October 25-29, 2020.

Tien-Hong Lo, Shi-Yan Weng, Hsiu-Jui Chang, Berlin Chen, "An effective end-to-end modeling approach for mispronunciation detection," the 21st Annual Conference of the International Speech Communication Association (Interspeech 2020), Shanghai, China, October 25-29, 2020.

Shao-Wei Fan-Jiang, Tien-Hong Lo, Berlin Chen, "Spoken document retrieval leveraging BERT-based modeling and query reformulation," the 45th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), Barcelona, Spain, May 4-8, 2020.

Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen*, "Enhanced language modeling with proximity and sentence relatedness information for extractive broadcast news summarization," ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 19, No. 3, Article 46: 1-19, February 2020.

Xiaofei Lu, Berlin Chen (Eds.), Computational and Corpus Approaches to Chinese Language Learning (ISBN: 978-981-13-3570-9),  Singapore: Springer, April 2019. (Book Review)

Xiaofei Lu, Berlin Chen, "Computational and corpus approaches to Chinese language learning: An introduction," Chapter 1 of the book "Computational and Corpus Approaches to Chinese Language Learning (ISBN: 978-981-13-3570-9)," pp.  3-11, Singapore: Springer, April 2019.

Berlin Chen*, Yao-Chi Hsu, "Mandarin Chinese mispronunciation detection and diagnosis leveraging deep neural network based acoustic modeling and training techniques," Chapter 11 of the book "Computational and Corpus Approaches to Chinese Language Learning (ISBN: 978-981-13-3570-9)," pp. 217-234, Singapore: Springer, April 2019.

Hou-Chiang Tseng, Berlin Chen, Tao-Hsing Chang, Yao-Ting Sung," Integrating LSA-based hierarchical conceptual space and machine learning methods for leveling the readability of domain specific texts," Natural Language Engineering, Vol 25, No. 3, pp. 331-361, May 2019.

Hsiao-Yun Lin, Tien-Hong Lo, Berlin Chen, "Enhanced BERT-based ranking models for spoken document retrieval," IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 2019), Sentosa, Singapore, December 14-18, 2019.

Hou-Chiang Tseng, Hsueh-Chih Chen, Kuo-En Chang, Yao-Ting Sung and Berlin Chen, "An innovative BERT-based readability model," International Conference of Innovative Technologies and Learning (ICITL 2019), Tromso, Norway, December 2-5, 2019.

Tien-Hong Lo, Berlin Chen, "Semi-supervised training of acoustic models leveraging knowledge transferred from out-of-domain data," 2019 APSIPA Annual Summit and Conference (APSIPA ASC 2019), Lanzhou, China, November 18-21, 2019.

Yu-Te Wu, Berlin Chen, Li Su, "Polyphonic music transcription with semantic segmentation," the 44th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019), Brighton, United Kingdom, May 12-17, 2019. (Oral)

Tzu-En Liu, Shih-Hung Liu, Berlin Chen, "A hierarchical neural summarization framework for spoken documents," the 44th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019), Brighton, United Kingdom, May 12-17, 2019. (Oral)

Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, "An information distillation framework for extractive summarization," IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 26, No. 1, pp. 161-170, January 2018.

Yu-Te Wu, Berlin Chen, Li Su, "Automatic music transcription leveraging generalized cepstral features and deep learning," the 43nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018), Calgary, Alberta, Canada, April 15-20, 2018.

Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, "Essence vector-based query modeling for spoken document retrieval," the 43nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018), Calgary, Alberta, Canada, April 15-20, 2018.

Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen*, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu, "A position-aware language modeling framework for extractive broadcast news speech summarization," ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 16, No. 4, Article 27:1-13, August 2017.

Tien-Hong Lo, Ying-Wen Chen, Kuan-Yu Chen, Hsin-Min Wang, Berlin Chen, "Neural relevance-aware query modeling for spoken document retrieval," IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 2017), Okinawa, Japan, December 16-20, 2017.

Chin-Hong Shih, Bi-Cheng Yan, Shih-Hung Liu, Berlin Chen, "Investigating Siamese LSTM networks for text categorization," 2017 APSIPA Annual Summit and Conference (APSIPA ASC 2017), Kuala Lumpur, Malaysia, December 12-15, 2017.

Bi-Cheng Yan, Chin-Hong Shih, Shih-Hung Liu, Berlin Chen, "Exploring low-dimensional structures of modulation spectra for robust speech recognition," the 18th Annual Conference of the International Speech Communication Association (Interspeech 2017), Stockholm, Sweden, August 20-24, 2017. (Oral) 

Ying-Wen Chen, Kuan-Yu Chen, Hsin-Min Wang, Berlin Chen, "Exploring the use of significant words language modeling for spoken document retrieval," the 18th Annual Conference of the International Speech Communication Association (Interspeech 2017), Stockholm, Sweden, August 20-24, 2017.

Ming-Han Yang, Hung-Shin Lee, Yu-Ding Lu, Kuan-Yu Chen,Yu Tsao, Berlin Chen, Hsin-Min Wang, "Discriminative autoencoders for acoustic modeling," the 18th Annual Conference of the International Speech Communication Association (Interspeech 2017), Stockholm, Sweden, August 20-24, 2017. (Oral) 

Bi-Cheng Yan, Chin-Hong Shih, Shih-Hung Liu, Berlin Chen, "Enhancing feature modulation spectra with dictionary learning approaches for robust speech recognition," the IEEE International Conference on Multimedia & Expo (ICME 2017), Hong Kong, July 10-14, 2017. (Oral) 

Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, "A locality-preserving essence vector modeling framework for spoken document retrieval," the 42nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, USA, March 5-9, 2017.

Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen, Hsin-Min Wang, Wen-Lian Hsu, "Leveraging manifold learning for extractive broadcast news summarization," the 42nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, USA, March 5-9, 2017.

Jeih-weih Hung, Hsin-Ju Hsieh, Berlin Chen, "Robust speech recognition via enhancing the complex-valued acoustic spectrum in modulation domain,"  IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 24, No. 2, pp. 236-251, February 2016.

Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen*, Hsin-Min Wang, Hsin-Hsi Chen, "Exploring the use of unsupervised query modeling techniques for speech recognition and summarization," Speech Communication, Vol. 80, pp. 49-59, June 2016.

Hsien-sheng Hsiao, Cheng-Sian Chang, Chiou-Yan Lin, Berlin Chen, Chia-Hou Wu, "The development and evaluation of listening and speaking diagnosis and remedial teaching system," British Journal of Educational Technology, Vol. 47, No. 2, pp. 372-389, March 2016.

Chun-I Tsai, Hsiao-Tsung Hung, Kuan-Yu Chen, Berlin Chen, "Extractive speech summarization leveraging convolutional neural network techniques," IEEE Workshop on Spoken Language Technology (SLT 2016), San Diego, California, USA, December 13-16 , 2016.

Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen and Hsin-Min Wang, "Learning to distill: the essence vector modeling framework," the 26th International Conference on Computational Linguistics (COLING 2016), Osaka, Japan, December 13-16, 2016.

Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu, "Graph regularized nonnegative matrix factorization for extractive speech summarization," 2016 APSIPA Annual Summit and Conference (APSIPA ASC 2016), Jeju Island, Korea, December 13-16, 2016.

Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen and Hsin-Min Wang, "A novel paragraph embedding method for spoken document summarization," 2016 APSIPA Annual Summit and Conference (APSIPA ASC 2016), Jeju Island, Korea, December 13-16, 2016.

Hsin-Ju Hsieh, Berlin Chen and Jeih-weih Hung, "Employing median filtering to enhance the complex-valued acoustic spectrograms in modulation domain for noise-robust speech recognition" the 10th International Symposium on Chinese Spoken Language Processing (ISCSLP 2016), Tianjin, China, October 17-20, 2016.

Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, Hsin-Hsi Chen, "Novel word embedding and translation-based language modeling for extractive speech summarization," 2016 ACM International Conference on Multimedia, Amsterdam, The Netherlands, October 5-19, 2016. (Short Paper)

Yao-Chi Hsu, Ming-Han Yang, Hsiao-Tsung Hung, Berlin Chen, "Mispronunciation detection leveraging maximum performance criterion training of acoustic models and decision functions," the 17th Annual Conference of the International Speech Communication Association (Interspeech 2016), San Francisco, USA, September 8-12, 2016.

Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen and Wen-Lian Hsu, "Exploring word mover’s distance and semantic-aware embedding techniques for extractive broadcast news summarization," the 17th Annual Conference of the International Speech Communication Association (Interspeech 2016), San Francisco, USA, September 8-12, 2016.

Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, "Improved spoken document summarization with coverage modeling techniques," the 41st IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, China, March 20-25, 2016.

Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen*, Hsin-Min Wang, Ea-Ee Jan, Wen-Lian Hsu and Hsin-Hsi Chen, "Extractive broadcast news summarization leveraging recurrent neural network language modeling techniques," IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 23, No. 8, pp. 1322-1334, August 2015.

Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen*, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu, "Combining relevance language modeling and clarity measure for extractive speech summarization," IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 23, No. 6, pp. 957-969, June 2015.

Hsin-Ju Hsieh, Berlin Chen*, Jeih-weih Hung, "Histogram equalization of contextual statistics of speech features for robust speech recognition," Multimedia Tools and Applications,  Vol. 74, No. 17, pp. 6769-6795, 2015.

Shih-Hung Liu, Hung-Shin Lee, Hsiao-Tsung Hung, Kuan-Yu Chen, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu, "Incorporating proximity information in relevance language modeling for extractive speech summarization," 2015 APSIPA Annual Summit and Conference (APSIPA ASC 2015), Hong Kong, December 16-19, 2015.

Hsin-Ju Hsieh, Berlin Chen and Jeih-weih Hung, "Enhancing the complex-valued acoustic spectrograms in modulation domain for creating noise-robust features in speech recognition," 2015 APSIPA Annual Summit and Conference (APSIPA ASC 2015), Hong Kong, December 16-19, 2015.

Kuan-Yu Chen, Kai-Wun Shih, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, "Incorporating paragraph embeddings and density peaks clustering for spoken document summarization," IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 2015), Scottsdale, Arizona, USA, December 13-17, 2015.

Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu, “Positional language modeling for extractive broadcast news speech summarization," the 16th Annual Conference of the International Speech Communication Association (Interspeech 2015), Dresden, Germany, September 6-10, 2015.

Kuan-Yu Chen, Shih-Hung Liu, Hsin-Min Wang, Berlin Chen, Hsin-His Chen, “Leveraging word embeddings for spoken document summarization," the 16th Annual Conference of the International Speech Communication Association (Interspeech 2015), Dresden, Germany, September 6-10, 2015.

Kuan-Yu Chen, Hsin-Min Wang, Berlin Chen, Hsin-His Chen, "I-vector based language modeling for query representation," the 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2015), Brisbane, Australia, April 19-24, 2015.

Hsuan-Sheng Chiu,  Kuan-Yu Chen, Berlin Chen*, "Leveraging topical and positional cues for language modeling in speech recognition,"  Multimedia Tools and Applications, Vol. 72, No. 2, pp. 1465-1481, September 2014.

Berlin Chen*, Yi-Wen Chen, Kuan-Yu Chen, Hsin-Min Wang, Kuen-Tyng Yu, "Enhancing query formulation for spoken document retrieval," Special Issue on Emerging Technologies and Applications of Artificial Intelligence, Journal of Information Science and Engineering, Vol. 30 No. 3, pp. 553-569, May 2014.

Hsin-Ju Hsieh, Berlin Chen and Jeih-weih Hung, "Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition," 2014 APSIPA Annual Summit and Conference (APSIPA ASC 2014), Siem Reap, city of Angkor Wat, Cambodia, December 9-12, 2014. 

Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen, Ea-Ee Jan, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu, "A margin-based discriminative modeling approach for extractive speech summarization," 2014 APSIPA Annual Summit and Conference (APSIPA ASC 2014), Siem Reap, city of Angkor Wat, Cambodia, December 9-12, 2014.

Kuan-Yu Chen, Shih-Hung Liu, Berlin  Chen, Hsin-Min Wang, Wen-Lian Hsu, Hsin-Hsi Chen, Ea-Ee Jan, "Leveraging effective query modeling techniques for speech recognition and summarization," the Conference on Empirical Methods on Natural Language Processing (EMNLP 2014), October 25-29, Doha, Qatar, 2014. (Short Paper)

Yu-Chen Kao, Yi-Ting Wang, Berlin Chen, "Effective modulation spectrum factorization for robust speech recognition," the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, September 14-18, 2014.

Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu, "Enhanced language modeling for extractive speech summarization with sentence relatedness information," the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, September 14-18, 2014.

Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, Wen-Lian Hsu and Hsin-Hsi Chen, "A recurrent neural network language modeling framework for extractive speech summarization," the IEEE International Conference on Multimedia & Expo (ICME 2014), Chengdu, China, July 14-18, 2014. (Full Paper)

Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu, "Effective pseudo-relevance feedback for language modeling in extractive speech summarization," the 39th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), Florence, Italy, May 4-9, 2014.

Berlin Chen*, Kuan-Yu Chen, "Leveraging relevance cues for language modeling in speech recognition," Information Processing & Management, Vol. 49, No. 4, pp. 807-816, July 2013.

Berlin Chen*, Shih-Hsiang Lin, Yu-Mei Chang, Jia-Wen Liu,  "Extractive speech summarization using evaluation metric-related training criteria," Information Processing & Management, Vol. 49, No. 1, pp. 1-12, January 2013.

Berlin Chen, Yi-Wen Chen, Kuan-Yu Chen, Ea-Ee Jan, "Effective pseudo-relevance feedback for language modeling in speech recognition," IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 2013), Olomouc, Czech Republic, December 8-12, 2013.

Yi-Wen Chen, Bo-Han Hao, Kuan-Yu Chen, Berlin Chen, "Incorporating proximity information for relevance language modeling in speech recognition," the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), Lyon, France, August 25-29, 2013.

Yu-Chen Kao, Berlin Chen, "Distribution-based feature normalization for robust speech recognition leveraging context and dynamics cues," the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), Lyon, France, August 25-29, 2013.

Hsin-Ju Hsieh, Berlin Chen, Jeih-weih Hung, "Histogram equalization of real and imaginary modulation spectra for noise-robust speech recognition," the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), Lyon, France, August 25-29, 2013.

Berlin Chen, Hao-Chin Chang, Kuan-Yu Chen, "Sentence modeling for extractive speech summarization," the IEEE International Conference on Multimedia & Expo (ICME 2013), San Jose, California, USA, July 15-19, 2013. (Full Paper)

Yi-Wen Chen, Kuan-Yu Chen, Hsin-Min Wang, Berlin Chen, "Effective pseudo-relevance feedback for spoken document retrieval," the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), Vancouver, Canada, May 26-31, 2013.

Berlin Chen*, Kuan-Yu Chen, Pei-Ning Chen, Yi-Wen Chen, "Spoken document retrieval with unsupervised query modeling techniques," IEEE Transactions on Audio, Speech and Language Processing, Vol. 20, No. 9, pp. 2602-2612, November 2012.

Berlin Chen*, Shih-Hsiang Lin, "A risk-aware modeling framework for speech summarization," IEEE Transactions on Audio, Speech and Language Processing, Vol. 20, No. 1, pp. 199-210, January 2012.

Yueng-Tien Lo, Shih-Hsiang Lin, Berlin Chen, "Constructing effective ranking models for speech summarization," the 37th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan, March 25 - 30, 2012.

Kuan-Yu Chen, Hao-Chin Chang, Berlin Chen, Hsin-Min Wang, "Word relevance modeling for speech recognition," the 13th Annual Conference of the International Speech Communication Association (Interspeech 2012), Portland, Oregon, USA, September 9-13, 2012.

Hsin-Ju Hsieh, Jeih-weih Hung, Berlin Chen, "Exploring joint equalization of spatial-temporal contextual statistics of speech features for robust speech recognition," the 13th Annual Conference of the International Speech Communication Association (Interspeech 2012), Portland, Oregon, USA, September 9-13, 2012.

Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen*, “Leveraging Kullback-Leibler divergence measures and information-rich cues for speech summarization,” IEEE Transactions on Audio, Speech and Language Processing, Vol. 19, No. 4, pp. 871-882, May 2011.

Berlin Chen*, Wei-Hau Chen, Shih-Hsiang Lin, Wen-Yi Chu, "Robust speech recognition using spatial–temporal feature distribution characteristics," Pattern Recognition Letters, Vol. 32, No. 7, pp. 919-926, 1 May 2011.

Berlin Chen*, Shih-Hsiang Lin, "Distribution-based feature compensation for robust speech recognition," Chapter 10 (pp. 155-168) of the book "Recent Advances in Robust Speech Recognition Technology," edited by J. Ramírez and J. M. Górriz, Bentham Science Publishers, 2011. (DOI:10.2174/97816080517241110101)

Berlin Chen, Pei-Ning Chen, Kuan-Yu Chen, "Query modeling for spoken document retrieval," IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 2011), Hawaii, USA, December 11-15, 2011.

Wen-Yi Chu, Jeih-weih Hung, Berlin Chen, "Modulation spectrum factorization for robust speech recognition," 2011 APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xian, China, October 18-21, 2011.

Pei-Ning Chen, Kuan-Yu Chen, Berlin Chen, "Leveraging relevance cues for improved spoken document retrieval," the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, August 28-31, 2011.

Berlin Chen, Jia-Wen Liu, "Discriminative language modeling for speech recognition with relevance information," the IEEE International Conference on Multimedia & Expo (ICME 2011), Barcelona, Spain, July 11-15, 2011. (Regular Paper)

Kuan-Yu Chen, Berlin Chen, "Relevance language modeling for speech recognition," the 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, May 22-27, 2011.

Shih-Hsiang Lin, Ea-Ee Jan, Berlin Chen, "Handling verbose queries for spoken document retrieval," the 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, May 22-27, 2011.

Shih-Hsiang Lin, Berlin Chen, "A risk minimization framework for extractive speech summarization," the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), pp. 79-87, Uppsala, Sweden, July 11-16, 2010.  (Long Paper)

Kuan-Yu Chen, Hsuan-Sheng Chiu, Berlin Chen, "Latent topic modeling of word vicinity information for speech recognition," the 35th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp. 5394-5397, Dallas, Texas, USA, March 14-19, 2010.

 

arXiv.org

-  Activities

Vice President of ACLCLP, October 2019- September 2021.

Board Member of ACLCLP, October 2007- September 2021.

Speech, Language, and Audio (SLA) Technical Committee Member, Asia-Pacific Signal and Information Processing Association (APSIPA), 2017 to 2019.

Editor-in-Chief, International Journal of Computational Linguistics and Chinese Language Processing (IJCLCLP), January 2020 ~.

Associate Editor, The Journal of Information Science and Engineering (JISE), 2017 to  2023.

External Grant Reviewer, Research Grants Council (RGC) of Hong Kong, 2008-2009, 2011-2020.

Session Chair,  International Symposium on Chinese Spoken Language Processing (ISCSLP 2018), Taipei, Taiwan, November 26-29, 2018.

Session Chair,  International Conference on Asian Language Processing (IALP 2018), Bandung, Indonesia, November 15-18, 2018.

Session Chair,  the 42th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, USA, March 5-9, 2017.

Program Committee Member, The 23st ACM International Conference on Multimedia  (ACM MM 2014), November 3-7, 2014.

Program Committee Member, The 21st ACM International Conference on Multimedia  (ACM MM 2013), October 21-25, 2013.

Program Committee Member, 2013 IEEE International Conference on Multimedia & Expo (ICME2013), San Jose, California, USA, July 15-19, 2013.

Chair of Academic Council of ACLCLP, September 2007- September 2014.

Program Committee Member, The 50th Annual Meeting of the Association for Computational Linguistics, July 8-14, 2012.

Program Committee Member, 2012 IEEE International Conference on Multimedia & Expo (ICME2012), Melbourne, Australia in 9th – 13th July, 2012.

Program Committee Member, The 5th International Joint Conference on Natural Language Processing (IJCNLP 2011),  Chiang Mai, Thailand, November 8-13, 2011.

Program Committee Member, 2011 APSIPA Annual Summit and Conference (APSIPA ASC 2011),  Xi'an, China, October 18-21, 2011.

Program Committee Member, The Oriental COCOSDA 2011 Conference,  Hsinchu, Taiwan, October 26- 28, 2011.

Program Committee Member, ROCLING 2011, September 8-9, 2011.

Program Committee Member, The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, June 19-24, 2011.

Special Session Co-Chair & Session Co-Chair, International Symposium on Chinese Spoken Language Processing (ISCSLP 2010), November 29 - December 3, 2010.

Program Committee Member, The 4th workshop on Searching Spontaneous Conversational Speech (in conjunction with ACM Multimedia 2010), Florence, Italy, October 29, 2010.

Special Session Co-Organizer, "Open Vocabulary Spoken Document Retrieval," the 11th Annual Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, September 26-30, 2010.

Program Committee Member, ROCLING 2010, September 1-2, 2010.

Session Chair, "Short Talks: Speech, Multimodal, and Summarization," the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), Uppsala, Sweden, July 11–16, 2010.

Editorial Board Member, International Journal of Computational Linguistics and Chinese Language Processing (IJCLCLP), May 2008 to April 2012.

Program Committee Member, The 14th Conference on Artificial Intelligence and Applications (TAAI2009), October 30-31, 2009.

Program Committee Member, The 3rd workshop on Searching Spontaneous Conversational Speech (in conjunction with ACM Multimedia 2009), Beijing, China, October 23, 2009.

Program Co-chair, ROCLING 2009, September 1-2, 2009.

Program Committee, ROCLING 2009.

Co-Guest Editor, IJCLCLP Special Issue on "Selected Papers from ROCLING XX", 2009.

Program Committee Member: The 13th Conference on Artificial Intelligence and Applications (TAAI2008), November 21-22, 2008.

Program Co-chair, ROCLING 2008, September 4-5, 2008.

Program Committee Member, The 2nd workshop on Searching Spontaneous Conversational Speech (in conjunction with ACM SIGIR 2008), Singapore, 24 July 2008.

Co-Guest Editor, IJCLCLP Special Issue on "Text Mining", 2008.

Co-Guest Editor, IJCLCLP Special Issue on "Selected Papers from ROCLING XIX", 2008.

Program Co-chair, ROCLING 2007, September 6-7, 2007.

Program Committee Member, The 2nd Pacific-Rim Symposium on Image and Video Technology (PSIVT'07), December 17-19, 2007.

Trainee, National Institute of Information and Communications Technology (NICT), Japan, August 1 - September 8, 2006.

Program Committee Member: The 1st Pacific-Rim Symposium on Image and Video Technology (PSIVT'06), December 11-13, 2006.

Workshop General Chair, 2006 Speech Signal Processing Workshop, Taipei, Taiwan, April 21, 2006.

Session Chair, ROCLING 2005 (September 15-16).

Program Committee Member, The 1st Workshop on Intelligent Web Technologies (IWT 2004), September 2, 2004.

Visiting Scholar, Institute of Information Science, Academia Sinica, Taiwan, February and July-August  2004.

Workshop Co-Chair, "Workshop on Digital Archives Technology and Innovalue", Taipei, Taiwan, December 18-19, 2003.

Workshop General Chair, "2003 Information Retrieval Workshop", Hsinchu, Taiwan, September 18, 2003.

Visiting Scholar, Institute of Information Science, Academia Sinica, Taiwan, July - September 2003.

 

-  Referee for Journals and Conferences

      - Journals

IEEE/ACM Transactions on Audio, Speech, and Language Processing
IEEE Transactions on Signal Processing
IEEE Transactions on Multimedia
IEEE Transactions on Mobile Computing
IEEE Journal of Selected Topics in Signal Processing
ACM Transactions on Information Systems
ACM Transactions on Asian and Low-Resource Language Information Processing
The Journal of the Acoustical Society of America
Information Processing & Management
Information Sciences
Speech Communication
Computer Speech & Language
Neurocomputing
International Journal of Computational Linguistics and Chinese Language Processing
International Journal of Information Technology & Decision Making
IEEE Signal Processing Letters
Pattern Recognition Letters
Information Processing Letters
EURASIP Journal on Audio, Speech, and Music Processing
Data & Knowledge Engineering
ETRI Journal
Journal of Information Science and Engineering 
Journal of the Chinese Institute of Engineers
Signal, Image and Video Processing

      - Conferences

ICASSP 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020
ASRU 2007, 2009, 2011, 2013, 2015, 2017, 2019
• Interspeech 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020
ACL 2011, 2012, 2016
EMNLP 2014   
• NAACL 2018
COLING 2016, 2020
IJCNLP 2005, 2011, 2013
• ICME 2010, 2011, 2012, 2013, 2015, 2016, 2017, 2018, 2019, 2020
• ACM MM 2013, 2014
ISCAS 2009, 2010, 2012
APSIPA ASC 2009, 2010, 2011, 2014, 2015, 2016, 2017, 2018, 2017, 2018, 2019
LREC 2018, 2020
ISCSLP 2008, 2010
• ROCLING 2005, 2006, 2008, 2010, 2011, 2015, 2019

SSCS 2008, 2009, 2010
TAAI 2005, 2006, 2007, 2008, 2009, 2011
• NST 2008, 2009

• TENCON 2007
 

-  Invited Talks

2020/06/08 Institute of Information Science, Academia Sinica, TIGP (SNHCC)
  Title: Some Novel Approaches to Speech and Language Processing 
2020/06/03 Department of Computer Science, University of Taipei
  Title: Recent Advances in Automatic Speech Recognition and its Applications 
2020/04/23 EE Department, National Taiwan Normal University (NTNU)
  Title:  Recent Developments in Automatic Speech Recognition and its Applications
2019/11/15 Graduate Institute of Linguistics, National Cheng-Chi University (NCCU)
  Title:  ASR and Its Applications to Computer-Assisted Pronunciation Training
2019/11/15 National Taipei University of Business (NTUB)
  Title:  ASR and Its Applications to Computer‐Assisted Pronunciation Training
2019/11/14 CSIE Department, Chung Hua University (CHU)
  Title:  Recent Developments in Automatic Speech Recognition and its Applications
2019/10/05 Taiwan AI Academy
  Title:  Recent Developments in Automatic Speech Recognition and its Applications
2019/06/28 Computers for Chinese Language Acquisition: Summer Institute and Consortium (CLASIC 2019), NTNU
  Title:  Computer-Assisted Pronunciation Training for Mandarin Chinese
2019/02/15 ASUS Intelligent Cloud Service Center (AICS)
  Title:  Recent Developments in Automatic Speech Recognition and its Applications
2018/10/23 Institute of Information Science, Academia Sinica
  Title:  Some New Approaches to Automatic Speech Recognition and its Applications
2018/05/30 Graduate Institute of Library & Information Studies, National Taiwan Normal University (NTNU)
  Title:  Recent Developments in Deep Learning for Multimedia Processing
2018/03/29 IM Department, National Taiwan University of Science and Technology (NTUST)
  Title:  Recent Developments in Deep Learning for Multimedia Processing
2018/03/09 CSIE Department, National Defense University (NDU)
  Title:  Recent Developments in Deep Learning for Multimedia Processing
2016/06/29 Development Center for Compilation and Translation, National Academy for Educational Research (NAER)
  Title:  Recent Developments in Machine Learning-based Text Readability Assessment
2016/05/12 Department of Information Management, Yuan Ze University (YZU) 
  Title:  Several New Representation Learning Approaches to Automatic Speech Recognition and its Applications
2016/04/20 National Taiwan Normal University (NTNU)
  Title:  Mandarin Mispronunciation Detection
2015/10/14 One Day Workshop on Optimization, Department of Mathematics, National Taiwan Normal University (NTNU)
  Title:  Recent Developments in Statistical Modeling Techniques for Speech and Natural Language Processing
2015/06/26 Research Center for Psychological and Educational Testing (RCPET), National Taiwan Normal University (NTNU)
  Title:  Recent Developments in Automatic Summarization
2014/12/29 CSIE Department, National Taiwan University of Science and Technology (NTUST)
  Title:  Recent Progress in Automatic Speech Recognition and Applications  
2014/05/05 Taipei Municipal Jianguo High School
  Title:  Recent Developments in Speech Recognition Technologies and their Applications  
2013/06/08 Workshop on Technology for Language Research and Language Learning, IACL 2013
  Title:  Speech Recognition and its Applications to Computer-Assisted Language Learning
2013/05/22 Graduate Institute of Communication Engineering, National Taiepi University (NTPU)
  Title:  Recent Developments in Language Modeling Techniques and their Applications
2013/01/11 Research Center for Psychological and Educational Testing (RCPET), National Taiwan Normal University (NTNU)
  Title:  Topic Language Models and their Applications
2013/01/03 Department of Chinese, National Taiwan Normal University (NTNU)
  Title:  Recent Developments in Speech Retrieval and Related Applications
2012/05/29 Graduate Institute of Library & Information Studies, National Taiwan Normal University (NTNU)
  Title:  Recent Developments in Speech Retrieval and Related Applications
2012/05/01 Department of Electronic Engineering, Ming Chi University of Technology (MCUT)
  Title:  Introduction to Automatic Speech Recognition and its Application
2011/12/28 CSIE Department, National Central University (NCU)
  Title:  Relevance Language Modeling for Speech Recognition and  Related Applications
2010/12/30 Department of Information Management, Yuan Ze University (YZU)
  Title:  Handling Verbose Queries for Speech Retrieval
2010/12/24 Department Computer Science and Engineering, Yuan Ze University (YZU)
  Title:    Language Modeling for Speech Recognition and Related Applications
2010/12/09 CS Department, National Cheng-chi University (NCCU)
  Title:    Speech summarization - From the view of decision theory
2010/08/11 Research Center for Psychological and Educational Testing (RCPET), National Taiwan Normal University (NTNU)
  Title:    Probabilistic Latent Topic Analysis and Its Applications
2010/05/14 CSIE Department, National Taiwan University (NTU)
  Title:    A risk minimization framework for extractive speech summarization
2010/05/04 2010 Speech Signal Processing Workshop (held at National Chiayi University)
  Title:    Recent Developments in Speech Retrieval and Summarization
2009/12/28 Graduate Institute of Library, Information and Archival Studies, National Cheng-chi University (NCCU)
  Title:    Spoken Document Recognition, Retrieval and Summarization
2009/11/24 EE Department, National Sun Yat-sen University (NSYSU)
  Title:    Recent Developments in Speech Recognition Technologies and Their Applications to Multimedia Information Access
  2009/10/02 EE Department, National Tsing Hua University (NTHU)
  Title:    Recent Developments in Speech Recognition Technologies and Their Applications to Multimedia Information Access
2009/05/04 CSIE Department, National Taiwan University of Science and Technology (NTUST)
  Title:    Latent topic modeling of word co-occurrence information for spoken document retrieval
2009/03/25 Research Center for Psychological and Educational Testing (RCPET), National Taiwan Normal University (NTNU)
  Title:    Speech Recognition Technologies and Their Applications to Multimedia Information Access
2009/01/21 Google Taipei
  Title: Recent Developments in Chinese Spoken Document Search and Distillation
2008/11/06 CSIE Department, National Chiao Tung University (NCTU)
  Title:    Spoken Document Retrieval and Summarization
2008/05/16 CSIE Department, National Sun Yat-sen University (NSYSU)
  Title: Exploiting Feature Distribution Characteristics for Robust Speech Recognition
2008/04/17 IM Department, National Taiwan University of Science and Technology (NTUST)
  Title: Spoken Document Retrieval and Summarization
2008/03/28 CS Department, National Defense University (NDU)
  Title: Spoken Document Retrieval and Summarization
2007/11/29 CS Department, National Cheng-chi University (NCCU)
  Title: Spoken Document Recognition, Retrieval and Summarization
2007/08/10 Compal Communications, Inc.
  Title: Speech Recognition Technology and Its Applications
2007/05/30 Telecommunication Laboratories, Chunghwa Telecom Co., Ltd.
  Title: Spoken Document Recognition, Organization and Retrieval
2007/05/07 Graduate Institute of Library & Information Studies, National Taiwan Normal University (NTNU)
  Title: Recent Progress in Spoken Document Recognition, Organization and Retrieval
2006/11/22 CSIE Department, National Central University (NCU)
  Title: Extractive Chinese Spoken Document Summarization Using Probabilistic Ranking Models
2006/08/03 National Institute of Information and Communications Technology (NiCT), ATR-SLC, Japan
  Title: Chinese Spoken Document Recognition, Organization and Retrieval
2006/06/08 EE Department, National Chi Nan University (NCNU)
  Title: Chinese Spoken Document Recognition, Organization and Retrieval
2006/03/20 Graduate Institute of Information and Computer Education, National Taiwan Normal University (NTNU)
  Title: Exploring Probabilistic Latent Semantic Information for Speech Processing
2006/01/13 Telecommunication Laboratories, Chunghwa Telecom Co., Ltd.
  Title: Data-Driven and Discriminative Approaches to Transcription, Retrieval and Organization of Mandarin Spoken Documents
2005/12/23 2005 Information Processing Workshop (held at Institute of Information Science, Academia Sinica)
  Title: Chinese Spoken Document Recognition, Organization and Retrieval
2005/11/14 CSIE Department, National Taiwan University of Science and Technology (NTUST)
  Title: Chinese Spoken Document Recognition, Retrieval and Organization
2005/08/12 Information & Communications Research Laboratories, Industrial Technology Research Institute (ITRI)
  Title: Recent Approaches to Transcription, Retrieval and Organization of Mandarin Broadcast News Speech
2005/05/04 Delta Electronics, Inc.
  Title: Recent Approaches to Transcription, Retrieval and Organization of Mandarin Broadcast News Speech
2004/11/18 EE Department, National Chi Nan University (NCNU)
  Title: Chinese Speech Recognition and Information Retrieval
2004/10/18 Department of Information and Computer Education, National Taiwan Normal University (NTNU)
  Title: Chinese Speech Recognition and Information Retrieval
2004/09/22 CSIE Department, National Taiwan Normal University (NTNU)
  Title: Chinese Speech Recognition and Information Retrieval
2004/04/26 Press Conference, National Museum of History
  Title: The development of automatic categorized indexing and multi-modal retrieval techniques for multimedia content of digital libraries
2004/04/09 CSIE Department, National Cheng Kung University (NCKU)
  Title: Lightly Supervised and Data-Driven Approaches to Mandarin Broadcast News Transcription
2003/12/18 2003 Workshop on Digital Archives Technology and Innovalue, National Taiwan Normal University (NTNU)
  Title: Voice Retrieval of Mandarin Speech Information
2003/11/13 VisionNext Co., Ltd.
  Title: Voice Retrieval of Mandarin Speech Information
2003/10/16 Graduate Institute of Biomedical informatics, Taipei Medical University
  Title: Chinese Speech Information Retrieval
2003/06/09 National Taipei University of Education (NTUE)
  Title: Speech Retrieval of Mandarin Broadcast News
2003/06/09 CSIE Department, National Tsing Hua University (NTHU)
  Title: Speech Retrieval of Mandarin Broadcast News
2002/04/26 2002 Speech Signal Processing Workshop
  Title: Retrieval of Mandarin Chinese Broadcast News via Speech
2001/07/23 Institute of Information Science, Academia Sinica
  Title: Speech Information Retrieval for Mandarin Chinese - Syllable- Based Indexing Features, Statistical Retrieval Models, Improved  Approaches 

 

-  Projects & Collaborations

- Ministry of Science and Technology (MOST) / National Science Council (NSC)

Title of Project Position Project Duration Status

Conversational Speech Summarization Leveraging Novel Neural Network Techniques (MOST 109-2221-E-003 -020 -MY3)

Principal Investigator 2020/8/1~2023/7/31 In Execution

 A Study on Novel Meeting Speech Recognition Methods and Their Applications (MOST 108-2221-E-003-005-MY3)

Principal Investigator 2019/8/1~2022/7/31 In Execution

 Exploration of Innovative Speech Summarization Methods and their Applications (MOST 107-2221-E-003 -013 -MY2)

Principal Investigator 2018/8/1~2020/7/31 Complete

 Exploration of Novel Speech Recognition and Organization Techniques (MOST 105-2221-E-003 -018 -MY3)

Principal Investigator 2016/8/1~2019/7/31 Complete

 Investigating Novel Modeling Techniques for Speech Summarization  (MOST 104-2221-E-003-018-MY3)

Principal Investigator 2015/8/1~2018/7/31 Complete
 Exploring Novel Language Modeling Techniques for Large Vocabulary Continuous Speech Recognition (MOST 103-2221-E-003 -016 -MY2) Principal Investigator 2014/8/1~2016/7/31 Complete

Research on Spoken Document Summarization (NSC 101-2221-E-003 -024 -MY3)

Principal Investigator 2012/8/1~2015/7/31 Complete

Development of a Real-Time Speech Transcription System (NSC 102-2221-E-003 -014 -)

Principal Investigator 2013/8/1~2014/7/31 Complete

Investigating Advanced Language Models for Large Vocabulary Continuous Speech Recognition( NSC 99-2221-E-003-017-MY3)

Principal Investigator 2010/8/1~2013/7/31 Complete

A Study on Advanced Methods for Speech Search and Summarization (NSC 98-2221-E-003-011-MY3)

Principal Investigator 2009/8/1~2012/7/31 Complete

Scientific Exploration of Multimedia Technologies (NSC 99-2515-S-003-004-)

Principal Investigator 2010/5/1~2011/4/30 Complete

A Study on Robust Speech Feature Extraction Techniques (NSC 96-2628-E-003-015-MY3)

Principal Investigator 2007/8/1~2010/7/31 Complete

Investigating Minimum Phone Error Discriminative Training Approaches for Mandarin Large Vocabulary Continuous Speech  Recognition (NSC 95-2221-E-003-014-MY3)

Principal Investigator 2006/8/1~2009/7/31 Complete

Advanced Research on Acoustic Modeling, Language Modeling and Acoustic Feature Extraction for LVCSR (NSC 94-2213-E-003-007)

Principal Investigator 2005/8/1~2006/7/31 Complete

An Research on the Integration of Speech Recognition, Retrieval and Summarization (NSC 93-2213-E-003-004- )

Principal Investigator 2004/8/1~2005/9/31 Complete

A Study on Mandarin Chinese Speech Information Summarization (NSC 92-2213-E-003-008-)

Principal Investigator 2003/8/1~2004/7/31 Complete

A Study on Recognition and Retrieval of Chinese Speech Information (NSC 91-2218-E-003-002-)

Principal Investigator 2002/11/1~2003/7/31 Complete

- Aim for the Top University Plan

- Delta Project

- ASUS AICS

Prototype Systems

  Focus, Expertise, Patience & Speed