The FIRST Company to Develop Multimodal AI with 200+ Installations Worldwide

The FIRST Company to Develop Multimodal AI with 200+ Installations Worldwide

Top

Speech processing Engineer


Aimesoft is a pioneering multimodal Artificial Intelligence solution provider gathering insightful AI practitioners to solve real-world problems. Aimesoft’s mission is to transform business and life with the power of Multimodal AI. Our greatest asset is a team of more than 40 experts and research engineers with solid programming background and rich experience in working with challenging projects. Under the lead of Dr. Nguyen Tuan Duc (CEO) and with the consultation of scientists and experts from the University of Tokyo, Japan, Aimesoft is taking strong transformation steps to affirm the No. 1 place in providing Artificial Intelligence solutions to solve customers' business problems.

 

Job description:

-Research and develop Speech processing algorithms: noise cancellation/ noise filtering, speech recognition, speech synthesis, speaker recognition, speaker diarization, and direct translation of Japanese, English,and Vietnamese speeches.

-Build acoustic models, language models, and decoders; optimize dictionaries.

-Research and develop voice recognition models based on neural networks.

-Research and build large lexical training databases to ensure the coverage of regional accents, ages, genders, environments,..for specific scenarios.

-Research and develop machine learning models for speech synthesis, voice clones, voice converter… based on models such as HMM, DNN.

 

Requirements:

-Having the basic knowledge of digital signal processing, mathematical basis for digital signal processing(Fast Fourier Transform, spectrogram, signal filters…).

-Having the basic knowledge of building speech recognition/ synthesis models (acoustic model, language model, feature bank,...)

-Having the basic knowledge of machine learning, mastering the fundamental models used in voice processing: HMM, DNN, DTW.

-Having the basic knowledge of deep learning with neural networks and seq2seq models.

-Having experience in using frameworks and toolkits for voice recognition such as: Kaldi, Sphinx, Julius, HTK.

-Having proficiency in object-oriented programming with one of the following languages: C++/C, Python, Java.

-Having good English skill is preffered. 

 

Benefits:

-Tet Bonus and 13th salary.

-Incentive followed by the company policy.

-Insurance and annual health check programs.

-Approval of bonuses or salary increments every 3 months.

-Freedom to develop your own career and skill roadmap at the company, with development progress review every 3 months.

-Free training provided by leading lecturers and experts in the company.

-Team building and annual tourism.

-Work-hour: 8:30- 18:00, from Monday to Friday.

 

Deadline: Dec 31, 2023 

 

Contact:

-Email: jobs@aimesoft.com

-Mobile: (+84) 985 387 426 

-Address: 3F Hoang Ngoc Building, No.4, Ln.82 Dich Vong Hau, Cau Giay, Ha Noi

Copyright © 2024 Aimesoft. All Rights Reserved.