CHOSUN

기계학습과 RGB 영상처리를 이용한 수어 동작 인식 방안

Metadata Downloads
Author(s)
김건우
Issued Date
2024
Abstract
Sign language motion recognition using machine learning and RGB image processing Kun-Woo Kim Advisor: Prof. Hyungjun Park, Ph.D. Department of Industrial Engineering Graduate School of Chosun University This study proposes a method to recognize sign language motions using machine learning and RGB image processing. The sign language motions for recognition are selected from 24 sign language words frequently used in the living spaces of the deaf community. For this, 180 sign language motions were acquired using the National Institute of Korean Language's Korean Sign Language Dictionary and public data services. Subsequently, skeleton data was tracked and visualized using OpenCV and MediaPipe, and key points of the human body were extracted to construct the training data. The training data consists of raw-data representing the location data (255 points) of the extracted features, and feature-data representing vector information (54 points) made based on the location data. After that, unnecessary frames were removed through data preprocessing, and the data was processed into a form suitable for the learning model. As a result of training input data on convolutional neural network models and recurrent neural network models using two types of training data, the CNN model using raw-data showed an accuracy of 92.58%, and the LSTM model showed an accuracy of 96.64%. On the other hand, the CNN model using feature-data showed an accuracy of 98.75%, and the LSTM model achieved an accuracy of 99.85%. To verify the usefulness of the developed model, an experimental environment was set up, and 24 new sign language words were filmed using a mobile phone camera and a webcam. As a result of evaluating the performance of the model using the filmed videos, the CNN model using raw-data showed an accuracy of 79.16%, and the LSTM model showed an accuracy of 87.5%, whereas the CNN and LSTM models using feature-data each recorded 91.66%. Through this, it was confirmed that feature-data using vector information is more effective in recognizing sign language motions than location data of features. However, recognition errors of the model for some sign language words were confirmed, which is due to similar types of motions. It was concluded that it is necessary to construct additional training data and tune parameters to solve this. The results of this study are expected to make a meaningful advancement in promoting communication between sign language users and non- sign language users, and it is intended to be used in creating sign language avatar animations and real-time sign language interpretation systems. In the future, it is expected that the daily life of sign language users will be further improved through additional research for model performance improvement and expanding the sign language motion recognition model to recognition models that can be used in various fields.
Alternative Title
Sign language motion recognition using machine learning and RGB image processing
Alternative Author(s)
KunWooKim
Affiliation
조선대학교 일반대학원
Department
일반대학원 산업공학과
Advisor
박형준
Awarded Date
2024-02
Table Of Contents
제 1 장 서론 1
제 1 절 연구 배경 1
제 2 절 연구 목적 3
제 3 절 논문 구성 5
제 2 장 기존연구 고찰 6
제 1 절 센서 및 카메라 기반 동작 인식 . 6
1.1 센서를 이용한 동작 인식 . 6
1.2 카메라를 이용한 동작 인식 9
제 2 절 기계학습을 이용한 동작 인식 13
2.1 심층 신경망 동작 인식 14
2.2 순환 신경망 기반 동작 인식 19
제 3 장 수어 동작 인식을 위한 기계학습 모델 . 23
제 1 절 학습용 수어 단어의 범위 23
1.1 수어 단어 선정 23
1.2 수어 동영상 수집 및 분석 24
제 2 절 기계학습을 위한 골격 데이터 추적 28
2.1 RGB 영상처리 및 시각화 . 28
2.2 MediaPipe 를 활용한 골격 데이터 추적 29
제 3 절 수어 동작 인식 모델 개발 33
3.1 골격 데이터를 이용한 학습 데이터 구축 . 33
3.2 학습 데이터 전처리 46
3.3 기계학습 모델 개발 및 평가 47
제 4 장 수어 동작 인식 모델 구현 및 검증 . 52
제 1 절 비교 및 검증 52
1.1 수어 단어 인식 모델 성능 비교 53
1.2 유용성 검증 55
제 5 장 결론 및 토의 56
참고문헌 58
Degree
Master
Publisher
조선대학교 대학원
Citation
김건우. (2024). 기계학습과 RGB 영상처리를 이용한 수어 동작 인식 방안.
Type
Dissertation
URI
https://oak.chosun.ac.kr/handle/2020.oak/17995
http://chosun.dcollection.net/common/orgView/200000719877
Appears in Collections:
General Graduate School > 3. Theses(Master)
Authorize & License
  • AuthorizeOpen
  • Embargo2024-02-23
Files in This Item:

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.