PAPER SUBMISSION FOR 2022
Whose submitted the latest research and development
Exploration of Semantic Information of Previous Sentences for Automatic Speech Recognition
Abstract
In a recent study, semantic information of the current sentence helps improve automatic speech recognition (ASR) performance in noisy environments. This work aims to improve the ASR system in noisy conditions by exploiting semantic information from previously recognized sentences to re-evaluate the N-best hypotheses list. The semantic probability score, used to reevaluate the N-best hypotheses list, is obtained by two approaches. The first approach is to use a deep neural network (DNN) semantic model with bidirectional encoder representations from transformers (BERT), namely P-BERT, to compare sentence hypotheses pairwise and choose the hypothesis with better semantic consistency. In the second approach, we exploit Universal Sentence Encoder, a pre-trained sentence encoding model based on transformer architecture. We represent previously recognized sentence and current sentence hypotheses as high dimensional vectors and compute the semantic distance between sentence vectors of previously recognized sentence and current sentence hypotheses. We perform experiments on the publicly available TED-LIUM corpus with different noise levels. We evaluate these two approaches using different context lengths. The proposed methods show the improvement of the ASR system over the baseline method, which only uses semantic information from the current sentence. Our experiment results show that most of the best results are obtained from the P-BERT rescoring method.
2022 Papers
Local and Global Orientation Correction for Oriented Human (Pose) Detection
Preliminary Study on SSCF-derived Polar Coordinate for ASR
Text Recognition on the Khmer Identification Cards and Its Application in Electronic Know Your Customer (e-KYC)
Cambodia Distributed Ledger – CamDL
Job Trends Analysis Using Power BI
Students’ Sentiment and Feedback Analysis on Online Learning System during COVID-19
Temperature Forcasting in Pnhom Penh Using Time Series Models
Eveluation of Regularization based Contiual Learning Alogorithm in the Context of Human Activity Recognition
Implementation of Deep Learning for Smart City Application: Lessons Learned
Intelligent Control in SDN/NFV-Empowered IoT System for Smart City Application
ENI-ETSI Meets the Proactive Network Solutions for Multi-tier Networking
ADDRESS
National Road 6A, Kthor, Prek Leap Chroy Changvar, Phnom Penh, Cambodia
CONTACT US
Phone: +855 10 344 040
Email: pr@cadt.edu.kh