skip to main content
10.1145/3462203.3475912acmconferencesArticle/Chapter ViewAbstractPublication PagesgooditConference Proceedingsconference-collections
short-paper

Deep learning and collaborative training for reducing communication barriers with deaf people

Published: 09 September 2021 Publication History

Abstract

Building bridges to facilitate communication between deaf and hearing people is a pending issue that can no longer be postponed in a technologically advanced society. In this work, we propose a multiplatform service that allows gradually building a collaborative platform to bring the interests of deaf and hearing people closer together. The proposal consists of a client-server architecture with two main parts: a multiplatform interface for video capture and playback, and a sign language recognizer system composed of a skeleton data extractor and a Graph Convolutional Neural Network. The result is a proof-of-concept of the first thematic dictionary accessible through sign language.

References

[1]
Danielle Bragg, Oscar Koller, Mary Bellard, Larwan Berke, Patrick Boudreault, Annelies Braffort, Naomi Caselli, Matt Huenerfauth, Hernisa Kacorri, Tessa Verhoef, Christian Vogler, and Meredith Ringel Morris. 2019. Sign Language Recognition, Generation, and Translation: An Interdisciplinary Perspective. In The 21st International ACM SIGACCESS Conference on Computers and Accessibility, ASSETS 2019, Pittsburgh, PA, USA, October 28-30, 2019, Jeffrey P. Bigham, Shiri Azenkot, and Shaun K. Kane (Eds.). ACM, 16--31.
[2]
Zhe Cao, Gines Hidalgo, Tomas Simon, Shih-En Wei, and Yaser Sheikh. 2021. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 1 (2021), 172--186. https://doi.org/10.1109/TPAMI.2019.2929257
[3]
Helen Cooper, Brian Holt, and Richard Bowden. 2011. Sign Language Recognition. Springer London, London, 539--562.
[4]
Laura Docío-Fernández, José Luis Alba-Castro, Soledad Torres-Guijarro, Eduardo Rodríguez-Banga, Manuel Rey-Area, Ania Pérez-Pérez, Sonia Rico-Alonso, and Carmen García-Mateo. 2020. LSE_UVIGO: A Multi-source Database for Spanish Sign Language Recognition. In Proceedings of the LREC2020: 9th Workshop on the Representation and Processing of Sign Languages: Sign Language Resources in the Service of the Language Community, Technological Challenges and Application Perspectives. 45--52.
[5]
Ziyu Liu, Hongwen Zhang, Zhenghao Chen, Zhiyong Wang, and Wanli Ouyang. 2020. Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020. IEEE, 140--149. https://doi.org/10.1109/CVPR42600.2020.00022
[6]
Razieh Rastgoo, Kourosh Kiani, and Sergio Escalera. 2021. Sign Language Recognition: A Deep Survey. Expert Syst. Appl. 164 (2021), 113794. https://doi.org/10.1016/j.eswa.2020.113794
[7]
Ozge Mercanoglu Sincan, Julio C. S. Jacques Junior, Sergio Escalera, and Hacer Yalim Keles. 2021. ChaLearn LAP Large Scale Signer Independent Isolated Sign Language Recognition Challenge: Design, Results and Future Research. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
[8]
Ozge Mercanoglu Sincan and Hacer Yalim Keles. 2020. AUTSL: A Large Scale Multi-Modal Turkish Sign Language Dataset and Baseline Methods. IEEE Access 8 (2020), 181340--181355. https://doi.org/10.1109/ACCESS.2020.3028072
[9]
Manuel Vázquez-Enríquez, Jose L. Alba-Castro, Laura Docío-Fernández, and Eduardo Rodríguez-Banga. 2021. Isolated Sign Language Recognition with Multi-Scale Spatial-Temporal Graph Convolutional Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
[10]
Sijie Yan, Yuanjun Xiong, and Dahua Lin. 2018. Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition. CoRR abs/1801.07455 (2018). arXiv:1801.07455 http://arxiv.org/abs/1801.07455

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
GoodIT '21: Proceedings of the Conference on Information Technology for Social Good
September 2021
345 pages
ISBN:9781450384780
DOI:10.1145/3462203
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 September 2021

Permissions

Request permissions for this article.

Check for updates

Badges

  • Best Short Paper

Author Tags

  1. client-server architecture
  2. datasets
  3. deep neural networks
  4. sign language recognition
  5. transfer learning

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

GoodIT '21
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 131
    Total Downloads
  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media