Day 1: December 4,2023 | Monday | Time in IST | ||||
---|---|---|---|---|
Local Time (IST) | Session | Session Chair | Schedule | Venue |
9:00 - 10:00 | Registration process | Auditorium | ||
10:00 - 11:00 | Opening Ceremony | Auditorium | ||
11:00 - 11:30 | High Tea | Auditorium | ||
11:30 - 12:25 | Keynote 1 | Prof. S S Agrawal | Prof. Minematsu Nobuaki Graduate School of Engineering. The University of Tokyo |
Seminar Hall (Admin Block) |
12:25 -14:00 PM | Session 1: Speech Recognition & Synthesis | Prof. Rohit Sinha | PID:56 - Designing and Developing a Marathi Speech Database for Native and Non-Native Emotional Speech in the Marathi Language PID:63 - NayanCom - A Smart Patient Communication System PID:64 - Automated Diagnosis of Parkinson’s Disease using Speech Signals with Machine Learning PID:34 - VoiceBank-2023: A MultiSpeaker Mandarin Speech Corpus for Constructing Personalized TT S Systems for the Speech Impaired PID:80 - Enhancing Efficiency and Conservation via Speech Processing in Lutyens's Delhi Residential Revitalization PID:168 - CTC-Based End-to-End Speech Recognition for Low Resource Language Sanskrit PID:137 - Leveraging Cross Lingual Speech Representations To Build ASR For Under-resourced Languages PID:143 - Investigating Activation Functions to enhance speaker Identification with LSTM Networks |
Seminar Hall (Admin Block) |
12:25 -14:00 PM | Session 2 (Online):Speech Understanding Time zone: Asia/Kolkata Google Meet joining info Video call link: https://meet.google.com/gnuqfkd-sim Or dial:(US) +1732-661-8741 PIN: 178 513 671# | Prof. Ratnadeep Deshmukh | PID:8 - Construction and analysis of Tibetan Khampa dialect corpus for speech synthesis PID: 28 - Convolutional Recurrent Neural Network with Attention Mechanism and Feature Aggregation for Voice Activity Detection PID: 57 - An Isolated Words Balanced Corpus for Native and Non-Native Urdu Speakers in Automatic Speech Recognition PID:55 - Zero-shot Mispronunciation Detection by Knowledge-based Data Augmentation PID: 58 - Enhancing Sanskrit Isolated Word Recognition: A Comparative Analysis of MFCC and SVM Feature Integration PID: 94 - Heterogeneous Network Framework with Attention Mechanism of Speech Enhancement for Car Intelligent Cockpit Speech Recognition PID: 97 - Analysis of Non-Matching Reference Approach to Predict Speech Intelligibility |
Conference Hall (Admin Block) |
14:00 - 14:45 | Lunch Break | Outside Seminar Hall (Admin Block) | ||
14:45 - 15:40 | Keynote 2 | Prof. Satoshi Nakamura | Prof. Gerard Bailly CNRS, Grenoble-Alps Univ. & INP Grenoble | Seminar Hall(Admin Block) |
15:45 -17:15 | Session 3(Offline):Speech Processing | Prof. Samudra Vijaya K | PID:169 - Analysis and Classification of Dysarthric Speech PID:26 - Speech Watermarking for Tampering Detection Using Singular Spectrum Analysis with a Psychoacoustic Mode PID:84 - Generating Speech with Prosodic Prominence based on SSL-Visually Grounded Models. PID:107 - Leveraging Speech Recognition for Smart Urban Last Mile Connectivity Enhancement PID:140 - An Ensemble Approach for Speaker Recognition using Deep Neural Networks PID:39 - E2E Refined Dataset PID:125 - Beamforming using Different Window Techniques for Near Field Speech in Anechoic and Reverberant Environment |
Seminar Hall (Admin Block) |
15:45 -17:15 | Session 4(online): Speech Science Time zone: Asia/Kolkata Google Meet joining info Video call link: https://meet.google.com/hhchdoo-aht Or dial:(US) +1484-423-4014 PIN:897 270 687# | Prof. Karunesh Arora | PID:177 - Data augmentation strategies for low resource conversational code-switching PID: 99 - Empirical Analysis of Phonological and Prosodic Features of Native and NonNative Hindi Speakers PID:6 - Second Language Accent Perception and Language Attitude by Mandarin and Cantonese Speakers in Mainland China PID:36 - Acoustic Development of Vowel Production by Prelingually Deaf Chinese Mandarin-speaking Children with Cochlear Implants. PID:43 - Acoustic Features and Patterns of Chinese sibilants and English Fricatives by Native Uyghur Speakers. PID:54 - The Effects of Aging on Electroglottographic and Acoustic Parameters of Voices and the Detection of Change Points in Vocal Aging. PID:67 - A Research on Uygur Primary Teachers’ Production Characteristics and Hierarchy of Difficulty in Acquiring Vowels of Standard Chinese |
Conference Hall (Admin Block) |
17:15 - 17:45 | Tea Break concluding day 1 | Outside Seminar Hall (Admin Block) | ||
Day 2: December 5, 2023 | Tuesday | ||||
Local Time (IST) | Session | Session Chair | Schedule | Venue |
10:00 - 11:30 | Session 5 (Offline):Speech Linguistics | Prof. S. R. M. Prasanna | PID:38 - TAIWANESE HAKKA ACROSS TAIWAN CORPUS AND FORMOSA SPEECH RECOGNITION CHALLENGE 2023 – HAKKA ASR PID:142 - Fine-tuning the Wav2Vec2 Model for Automatic Speech Emotion Recognition System PID:13 - PERCEPTION OF LONG AND SHORT VOWEL CONTRAST IN MONGOLIAN PID:16 - Real-time Hate Speech Detection in Live Streaming Platforms using Quantum Machine Learning PID:25 - Prosody in Everyday Japanese Conversation at the Clause Final PID:27 - NITK-KLESC: Kannada Language Emotional Speech Corpus for Speaker Recognition PID:167 - Speech Recognition Applications in Enhancing Safety for Women in Built Environment 1 PID:123 - M-Diarization: A Myanmar Speaker Diarization using Multi-scale dynamic weights |
Conference Hall (Admin Block) |
10:00 - 11:30 | Session 6 (Offline): Cross Lingual Speech Analysis | Prof. Arun Sharma | PID:33 - Duration Properties and Contrast Preservation in Taifeng Tone Sandhi PID:81 - Emotion Prediction Using Multisource Biosignals During Cognitive Behavior Therapy with Conversational Virtual Agents PID:154 - Exploration of Speech Rhythm in Deori L1 and L2 PID:148 - Focus and Intonation in Dimasa PID:139 - ViQG: Web Tool for Automatic Question Generation from Code for Viva Preparation PID:141 - Efficiently Transferring Pretrained Language Model RoBERTa Base English to Hindi Using WECHSEL PID:48 - Comparative Analysis of Direct Speech-to-Speech Translation and Voice Conversion Using Bi-LSTM PID:76 - Spoken Language Identification Using Convolutional Neural Network In Nepalese Context |
Seminar Hall (Admin Block) |
11:30 - 11:45 | High Tea | Outside Seminar Hall (Admin Block) | ||
11:45 - 12:35 | Keynote 3 | Prof. Gerard Bailly | Prof. S R N Prasanna IIT Dharwad | Seminar Hall (Admin Block) |
12:35 -14:00 PM | Session 7 (Offline): Cross Cultural Speech Studies | Prof. Win Pa Pa | PID:53 - Advancing Transfer Learning Paradigms for Myanmar (Burmese) to Wa (Austroasiatic Language Family) Language Translation PID:156 - Optimizing Direct Speech-toText Translation for un-orthographic lowresource tribal languages using source transliteration PID:51 - Type-Token Analysis on LDC-IL Text Corpus PID:52 - Evaluation of Assamese Speech Data Transcriptions by Levenshtein Distance PID:147 - Empirical Analysis of Machine Learning Models on Parkinson's Speech Dataset PID:149: - The Potential of Speech Technology to Enhance the Quality of Life in Historic Cities PID:157 - Automatic Syllabification of Bengali in SPPAS |
Seminar Hall (Admin Block) |
14:00 - 14:45 | Lunch | Outside Seminar Hall (Admin Block) | ||
14:45 – 15:35 | Keynote 4 | Prof. Minematsu | Prof. Rohit Sinha Professor & HoD Department of Electronics and Electrical Engineering, IIT Guwahati | Seminar Hall (Admin Block) |
15:35 - 16:00 | Felicitation Ceremony | Seminar Hall (Admin Block) | ||
16:00 - 17:00 | Steering Committee Meet | Conference Hall (Admin Block) | ||
17:00 - 17:30 | Tea Break | Seminar Hall (Admin Block) | ||
17:30 - 18:00 | Poster Presentation | Admin Block | ||
18:00 PM onwards | Gala Dinner | India International Centre (IIC). Delhi India | ||
Day 3: December 6, 2023 | Wednesday | ||||
Local Time (IST) | Session | Session Chair | Schedule | Venue |
10:00 - 11:00 | Country Report | Seminar Hall (Admin Block) | ||
11:00 - 11:30 | High Tea | Outside Seminar Hall (Admin Block) | ||
11:30 - 12:25 | Keynote 5 | Prof. S Sakti | Prof. Joseph Mariani Directeur de Recherche Émérite au CNRS LISN Rue John von Neumann University Paris-Saclay | Seminar Hall (Admin Block) |
12:25 - 14:00 | Session 8 (Online): International Variations Time zone: Asia/Kolkata Google Meet joining info Video call link: https://meet.google.com/vsdnmoj-qed Or dial: (US) +1 224-458-3363 PIN: 540 389 505# | Prof. Karunesh Arora | PID:85 - An Experimental Study on Declarative and Interrogative Sentences in Shanghai Chinese PID:37 - Prosodic Encoding of Focus and Interrogative mood in Tianjin Dialect PID:41- Investigation of Validity of Paradigmatic Diagnosis for Downstep in Japanese PID:90 - Exploring Difficulties Encountered by Professional Interpreters in Japanese-toEnglish and English-to-Japanese Simultaneous Translation PID:153 - Feature Comparison for Speech Emotion Recognition on Hindi Language PID:164 - Few-shot meta multilabel classifier for low resource accented codeswitched speech PID:172 - Yangru Tone in Southern Min: Variation across Contexts |
Conference Hall (Admin Block) |
12:25 - 14:00 | Session 9 (Online): Language processing and NLU Time zone: Asia/Kolkata Google Meet joining info Video call link: https://meet.google.com/xffcmpf-irj Or dial: (US) +1 252-787-0077 PIN: 929 367 054# | Dr. Swaran Lata | PID:73 - The Development of a Thai Telephone Conversational Speech Corpus PID:47 - Speech Dataset Development for a Low-Resource Tibeto-Burman Tonal Language PID:42 - INTO_CASS_HEFEI: A Speech Corpus for Intonation and Prosody Study of Chinese Hefei Dialect PID:2 - TELUGU VAKYALU: SPOKEN TELUGU SENTENCES FOR IOT APPLICATIONS PID:92 - IIITH MM2 Speech-Text: A preliminary data for automatic spoken data validation with matched and mismatched speech-text content PID:49 - End-to-End Cross-Lingual Voice Conversion using CycleGAN for Low Resource Indian Languages |
Seminar Hall |
14:00 - 14:45 | Lunch | Outside Seminar Hall (Admin Block) | ||
14:45 -16:15 | Session 10 (Offline): Spoken and Written Language Repositories | Prof. Arjun Li / Prof S S Agrawal | PID:146 - Transforming Shahjahanabad into a Smart Heritage City Integrating Good Governance, Speech, and IoT Technologies for Sustainable Urban Development PID:151 - ASHI: A Database of Assamese Accented Hindi PID:155 - SPIRE-SIES: A Spontaneous Indian English Speech Corpus PID:161 - Design and Validation of HindiSER: Speech Emotion Recognition Dataset for Hindi Language PID:27 - NITK-KLESC: Kannada Language Emotional Speech Corpus for Speaker Recognition PID:163 - Designing an IVR-based Speech Data collection Framework for building Realistic Speech Corpus on Forensic Automatic Speaker Recognition PID:171 - A novel approach for bootstrapping and automatic transcription of low resourced language speech corpus PID: 10 - ChatterBot – An AI Conversational Entity |
Seminar Hall (Admin Block) |
16:15 – 16:30 | Break | |||
16:30 - 17:30 | Valedictory Ceremony | Seminar Hall (Admin Block) | ||
17:30 - 18:30 | Tea Break | Outside Seminar Hall (Admin Block) |