기계 번역 구조를 활용한 문장형 수학 문제풀이에 관한 연구
- Author(s)
- 김강민
- Issued Date
- 2024
- Keyword
- 딥러닝|자연어 처리|자연어 이해|기계 번역|문장형 수학 문제풀이
- Abstract
- Deep learning has been developed and applied to natural language processing and applied to a variety of fields. However, even when large scale language models are utilized, they lead to unstable results for questions that require mathematical reasoning. To complement these weaknesses and expand the application areas of language models, language understanding-based mathematical inference learning is needed. Research on MWP (Math Word Problem) is a task that began in the 1960s to infer mathematical expressions through language understanding. MWP refers to problems consisting of mathematical logic, numbers, and natural language. MWP solvers, which are based on language models, must accept the problem text as input and make mathematical inferences after understanding the context. However, designing an MWP solver has been rated challenging due to differences in the inherent meanings of words and prior knowledge understood by humans and machines, and deep Research on learning models is necessary. Among the research on MWP solvers is not only the presentation of model designs and methodologies, but also the disclosure of datasets for different types of learning. However, English and Chinese-based datasets are mainly published, and Korean datasets are lacking. In this study, I explores the design of an equation generation model to narrow down the semantic gap by leveraging the Conformer model proposed in the speech recognition domain, and implement a Korean-based MWP generator for collecting large amounts of Korean data. The data generator proposed in this study is based on the elementary mathematics subject level and consists of four types: arithmetic operations, ordering, finding unknown, and geometry. Each type is made up of sub types, for a total of 42 subtypes. Additionally, four data variations are applied during data generation to design a robust model. A total of 210,311 pieces of data were used in the experiment. Of this, 210,000 pieces of data are generated data. The training dataset consists of 150,000 pieces of data, and the validation and evaluation datasets each consist of 30,000 pieces. The 311 pieces of data are real datasets collected from commercially available math problem books. The real dataset was for additional experiments to evaluate the effectiveness of the proposed data generator and MWP solver with Conformer structure. Experiments confirmed that the Conformer model had an accuracy of 91.19% on the produced dataset and 42.76% on the real dataset, outperforming the Sequence-to-Sequence and Transformer models. Although the performance declined on the real dataset due to problem types and words that the model was not trained on, the performance of the Conformer model suggests that the proposed data generator partially reflects real math problems. In this study, I designed an MWP solver by applying other domain models and generated the data necessary for learning the Korean MWP solver. When applied to math education and the service industry, these models can help improve convenience and productivity. However, the performance decreased due to the difference between the proposed data generator and the actual math problem. In addition to complementing the problem types of data generators, future research will need to design MWP solvers with enhanced context understanding capabilities by leveraging large-scale pre-trained models.
- Alternative Title
- A Study on Solving Math Word Problem Using Machine Translation Structure
- Alternative Author(s)
- Kim Kangmin
- Affiliation
- 조선대학교 일반대학원
- Department
- 일반대학원 컴퓨터공학과
- Advisor
- 전찬준
- Awarded Date
- 2024-02
- Table Of Contents
- Ⅰ. 서론 1
A. 연구 배경 및 목적 1
B. 연구 방법 및 내용 5
Ⅱ. 관련 연구 7
A. 전통적 접근방식을 활용한 문장형 수학 문제풀이 7
B. 딥러닝 접근방식을 활용한 문장형 수학 문제풀이 9
C. Sequence-to-Sequence 구조의 풀이모델 10
D. Transformer 구조의 풀이모델 13
E. 단어 임베딩 16
F. 한국어 토큰화 19
Ⅲ. 한국어 문장형 수학 문제 생성기와 풀이모델 20
A. 데이터 생성기의 수학 문제 유형 21
B. 한국어 특성 반영을 위한 데이터 변환 27
C. 한국어 문장형 수학 문제의 규칙 29
D. Conformer 구조의 풀이모델 30
Ⅳ. 실험 수행 및 결과 33
A. 데이터 세트와 평가지표 33
B. 딥러닝 기반 문제 풀이모델의 성능 35
1. 생성된 데이터 세트에서의 모델별 성능 결과 36
2. Conformer 구조와 데이터 생성기의 유효성 평가 41
Ⅴ. 결론 45
참고문헌 47
- Degree
- Master
- Publisher
- 조선대학교 대학원
- Citation
- 김강민. (2024). 기계 번역 구조를 활용한 문장형 수학 문제풀이에 관한 연구.
- Type
- Dissertation
- URI
- https://oak.chosun.ac.kr/handle/2020.oak/17994
http://chosun.dcollection.net/common/orgView/200000719422
-
Appears in Collections:
- General Graduate School > 3. Theses(Master)
- Authorize & License
-
- AuthorizeOpen
- Embargo2024-02-23
- Files in This Item:
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.