CHOSUN

Image Analysis based on AI application for Manufacturing Automation

Metadata Downloads
Author(s)
키벳 던칸
Issued Date
2022
Keyword
Abalone detection and counting Machine learning Yolo-V3 TensorFlow OpenCV Darknet-53,Normal and Abnormal classification, Grad-CAM, ResNet50 model, Convolutional Neural Network (CNN)
Abstract
Abalone detection and counting systems via conveyor belts using machine learning have become a critical and most vital aspect in the fishing industry. In our paper, we present a detection and counting approach for Multi-Object Tracking (MOT) from abalone video data. Considering computational effectiveness and improved detection algorithms, Multiple Object and Tracking is one of the most sort out paradigms. You only look once (YOLOV3) with Tensor Flow algorithm built based on Deep SORT algorithm mainly implemented for MOT specification. Other methods used had low effectiveness with high operational time and wrong counting rate because some of the abalones stick to each other most of the time.
However, to elaborate the challenges posted by the video data of abalone entangling and pairing together on multiple occasions which is hard to count, we tested with the open source-based algorithm (OpenCV) which did detection but incorrect tracking and sorting. After combining Yolov3 and Tensor Flow for generating detections in each frame based on the model (Darknet-53), we use depth separable convolutions and pointwise group convolutions to reduce the parameter size of the network, the results of our experiment exhibited competitive performance in comparison with the open source-based library OpenCV. This proposed algorithm can apply in various conveyors counting controlled-based systems.
We also propose a technique for classifying normal and abnormal images from a broad class of Convolutional Neural Network (CNN) models, data classification more clear and explainable. We introduced Gradient-weighted Class Activation Mapping (Grad-CAM) that uses the gradients of any objective approach of normal or abnormal images fundamentally into the ultimate convolutional layer to yield a coarse localization map for highlighting the critical sectors in the image for predicting the approach. In contrast to previous approaches, Grad-CAM applies to a wide variety of CNN models: CNNs with fully connected layers (e.g. ResNet50 model) and CNN models used in line with various inputs (e.g. visualization) or reinforcement learning, without architectural transformation or re-training.
We incorporate Grad-CAM with current existing fine grained visualizations to generate a high-resolution class discriminative factor for visualization. Guided Grad-CAM is then applied for image classification, captioning based on pre-trained ResNet50 architectures. For image captioning, our visualizations show that even ResNet50 based models learn to localize discriminative regions of the input image. We develop identification of essential neurons via Grad-CAM and incorporate it with neuron names to provide textual explanations for model decisions. Finally, the used test data to establish appropriate trust in predictions from the model and show that Grad-CAM can give accurate predictions from deep networks and show that Grad-CAM helps to make a more accurate prediction even close to identical images either normal or abnormal.
Alternative Title
제조 자동화를 위한 AI 애플리케이션 기반 영상 분석
Alternative Author(s)
KIBET DUNCAN
Affiliation
조선대학교 일반대학원
Department
일반대학원 산업공학과
Advisor
김규태
Awarded Date
2022-02
Table Of Contents
List of figures. iii
List of tables v
Abstract vi

1. Introduction 1
1.1 State-of-the-art 2
1.2 Image processing 3
1.3 Contributions 7
1.4 Thesis Layout 7

2. Theory and Background 9
2.1 YOLOv3 Object detector and its working 9
2.1.1 Architecture (Feature Extractor Network) 9
2.1.2 Bounding box prediction 11
2.1.3 Feature Pyramid network (FPN) 12
2.1.4 Class prediction 13

3. Machine Learning. 14
3.1 Deep SORT 15
3.1.1 Estimation Model 15
3.1.2 Data assocoation 17
3.1.3 How the Hungarian algorithm and Kalman filter work 17
3.1.4 Creation and deletion of track identities 19

4. Yolov3 and Sort for object detection and tracking 21
4.1 Train and evaluation Yolov3 to detect Abalone 21
4.1.1 Tools and frameworks 22
4.2 Training and evaluaton 23
4.3 Tests on the detection and tracking model 26

5. Grad-CAM via Gradient-based Localization 29
5.1.1 Data Sets 30
5.1.2 Weakly-supervised Localization 30
5.1.3 Visualization. 31
5.1.4 CAM vs. Grad-CAM 35
5.1.5 Guided Grad-CAM . 39
5.1.6 Training and evaluation 40
5.1.7 Confusion Matrix 46

6. Results and Discussion 54

7. Conclusion. 54

References 57
Degree
Master
Publisher
조선대학교 대학원
Citation
키벳 던칸. (2022). Image Analysis based on AI application for Manufacturing Automation.
Type
Dissertation
URI
https://oak.chosun.ac.kr/handle/2020.oak/17251
http://chosun.dcollection.net/common/orgView/200000589773
Appears in Collections:
General Graduate School > 3. Theses(Master)
Authorize & License
  • AuthorizeOpen
  • Embargo2022-02-25
Files in This Item:

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.