0% Complete
صفحه اصلی
/
اولین همایش بین المللی هوش مصنوعی
Efficient DL Model for Voice Pathology Detection in Healthcare Applications using Sustained Vowels
نویسندگان :
Sahar Farazi
1
Yasser Shekofteh
2
1- Faculty of Computer Science and Engineering, Shahid Beheshti University
2- Faculty of Computer Science and Engineering, Shahid Beheshti University
کلمات کلیدی :
Voice Pathology Detection،Sustained Vowel،Feature extraction،MFCC،LPC،CNN
چکیده :
Voice Pathology Detection (VPD) aims to identify voice impairments through the analysis of speech signals, providing a foundation for developing diagnostic tools in advanced healthcare services to the public. This paper contributes to the development of an efficient and accurate model based on deep learning (DL) for automatic VPD using sustained vowels of speech data. Therefore, this study explores the comparative efficacy of Mel-Frequency Cepstral Coefficients (MFCCs) and Linear Predictive Coding (LPC) as acoustic features extracted from vowels /i/, /a/, and /u/. Using the AVFAD database, we utilized and optimized a Convolutional Neural Network (CNN) as a DL model to classify healthy and pathological voices, prioritizing both accuracy and computational efficiency for real-time applications. Our findings reveal that 20 MFCC features extracted from vowel /i/ achieve the highest accuracy, with the optimal model reaching approximately 88% on test data.
لیست مقالات
لیست مقالات بایگانی شده
Enhanced Early Diagnosis of Parkinson’s Disease via Transformer-Based Deep Learning and GAN-Augmented Handwriting Analysis
Fateme Darkhal - Seyyed Ali Zendehbad - Zahra Sedaghat
Examining Ethical Principles in the Development of AI for Environmental Protection with a Focus on Environmental Justice
Maryam Saadaat Nabavi Meybodi
Evaluating Parkinson’s Disease Severity Through Attention-Based STGCN and S2AGCN Models Utilizing Kinect Skeleton Images
Fatemeh Fadaei Ardestani - Nima Asadi
A Novel Fixed-Parameter Activation Function for Neural Networks: Enhanced Accuracy and Convergence on MNIST
Najmeh Hosseinipour-Mahani - Amirreza Jahantab
Empowering Decision-Making in Venture Investments: A Systematic Review of Machine Learning Applications for Predicting Startup Success
Seyed Mohammad Javad Toghraee - Hadi Nilforoushan - Nafiseh Sanaee
Intermediate Fine-Tuning for Robust Persian Emotion Detection in Text
Morteza Mahdavi Mortazavi - Mehrnoush Shamsfard
Reconstruction of ECoG signals in response to visual stimuli using a model based on convolutional and regression networks.
Mohammad Amin Lotfi - Kimiya ٍEghbal - Fateneh Zareayan Jahromy
A Hybrid Approach for Intrusion Detection in Computer Systems Using Optimized Deep Neural Networks
Yousef Nahi Salman - Maral Kolahkaj
Genetic algorithm-based hyperparameter optimization of convolutional neural network models for white blood cells classification
Ahmad Nasrollahpour - Mohammad Khanabadi Borchalouei - Toktam Khatibi
Potential of machine learning algorithms for predicting the properties of medium-density fiberboard (MDF): preliminary results
Rahim Mohebbi Gargari - Ali Shalbafan - Seyed Jalil Alavi - Maryam Amirmazlaghni - Seyed Hamzeh Sadatnejad - Heiko Thoemen
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.4