语音识别技术在孤独症谱系障碍诊断中的研究进展与应用展望
作者:
作者单位:

1.南京医科大学第四临床医学院;2.南京医科大学附属脑科医院

基金项目:

2024年度南京市卫生科技发展专项资金项目(医学重点科技发展项目)


Research Progress and Application Prospects of Speech Recognition Technology in Autism Spectrum Disorder Diagnosis
Fund Project:

Special Fund Project for the Development of Health Science and Technology in Nanjing in 2024 (Medical Key Science and Technology Development Project)

  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • | |
  • 文章评论
    摘要:

    孤独症谱系障碍(ASD)是一种起病于发育早期的神经发育障碍,其核心症状主要表现在语言和社会交往方面。传统诊断工具(如ADOS-2)应用广泛,但在低龄儿童和基层医疗中可及性与客观性不足。语音识别技术因其量化能力强、成本较低,对婴幼儿早期言语异常具高敏感度,正成为ASD辅助诊断的重要方向。本文系统梳理了语音识别技术在不同年龄段早期筛查、共病情绪问题诊断、疾病严重程度评估及多模态融合等方面的最新研究。结果显示,通过提取基频、语速、停顿等声学特征,可有效区分ASD个体与典型发育儿童,并识别焦虑、抑郁及ADHD等共病。多模态融合(如脑影像、生理信号和行为数据)可进一步提升诊断准确率。但数据多样性不足、方言与年龄适用性局限、共病干扰及隐私保护等挑战依然突出。

    Abstract:

    Autism spectrum disorder (ASD) is a neurodevelopmental disorder that typically arises in early childhood, with core symptoms predominantly manifesting in language and social interaction. Although traditional diagnostic tools, such as the Autism Diagnostic Observation Schedule-2 (ADOS-2), are widely used, they often have limited accessibility and objectivity in younger children and in primary healthcare settings. Due to its strong quantitative capabilities, relatively low cost, and high sensitivity to early speech anomalies in infants, speech recognition technology has emerged as a promising avenue for ASD diagnostic support. This paper systematically reviews the latest research on the application of speech recognition in early screening across different age groups, diagnosis of comorbid emotional issues, assessment of disease severity, and multimodal data integration. The findings show that extracting acoustic features—such as fundamental frequency, speech rate, and pauses—can effectively distinguish individuals with ASD from typically developing children, while also identifying comorbidities like anxiety, depression, and ADHD. Furthermore, multimodal fusion (e.g., neuroimaging, physiological signals, and behavioral data) can further improve diagnostic accuracy. Nevertheless, challenges persist, including inadequate data diversity, limitations related to dialect and age applicability, confounding effects of comorbid conditions, and concerns over privacy.

    参考文献
    相似文献
    引证文献
引用本文
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-11-13
  • 最后修改日期:2025-01-25
  • 录用日期:2025-03-07
关闭