Hand Gesture Recognition and Voice Conversion for Deaf and Dumb -project

Question

Hand Gesture Recognition and Voice Conversion for Deaf and Dumb -project

asked Jun 25, 2020 in AI-ML-Data Science Projects by cheshta123 (279 points)

Abstract- Communication is the main channel between people to communicate with each other. In the recent years, there has been rapid increase in the number of deaf and dumb victims due to birth defects, accidents and oral diseases. Since deaf and dumb people cannot communicate with normal person so they have to depend on some sort of visual communication.Sometimes people interpret these messages wrongly either through sign language or through lip reading or lip sync. This project is made in such a way to help these specially challenged people hold equal par in the society.

GOEDUHUB's Online Courses @ Udemy

2 Answers

answered Jun 25, 2020 by cheshta123 (279 points)
selected Aug 10, 2020 by Goeduhub

Best answer

Hand Gesture Recognition

Purpose of the model-The Main challenges that this special person facing is the communication gap between -special person and normal person. Deaf and Dumb people always find difficulties to communicate with normal person. This huge challenge makes them uncomfortable and they feel discriminated in society. Because of miss communication Deaf and Dumb people feel not to communicate and hence they never able to express their feelings. HGRVC (Hand Gesture Recognition and Voice Conversion) system localizes and track the hand gestures of the dumb and deaf people in order to maintain a communication channel with the other people.

General idea about our model-The detection of hand gestures can be done using web camera. The pictures are then converted into standard size with the help of pre-processing. The aim of this project is to develop a system that can convert the hand gestures into text. The focus of this project is to place the pictures in the database and with database matching the image is converted into text. The detection involves observation of hand movement. The method gives output in text format that helps to reduce the communication gap between deaf-mute and people.

Architecture of our model

Here we start our implementation of our model -

Basically we will first train our CNN models with a lot of images of hand gestures.

Why CNN: As we have seen in CNN tutorial,CNN reads a very large image in a simple manner. CNN most commonly used to analyze visual imagery and are frequently working behind the scenes in image classification.

For Official documentation of Keras (Click Here) ,and for tensorflow (click here).

import library-

import numpy as np

import matplotlib.pyplot as plt

import utils

import os

%matplotlib inline

from keras.preprocessing.image import ImageDataGenerator

from keras.layers import Dense, Input, Dropout,Flatten, Conv2D

from keras.layers import BatchNormalization, Activation, MaxPooling2D

from keras.models import Model, Sequential

from keras.optimizers import Adam

from keras.callbacks import ModelCheckpoint, ReduceLROnPlateau

from keras.utils import plot_model

from IPython.display import SVG, Image

import tensorflow as tf

print("Tensorflow version:", tf.__version__)

Download the dataset from here.

or you can create a dataset by yourself also.

In this part of code, we have imported Keras and its libraries/layers.
As the version of TensorFlow changes, the way the importing of Keras and keras models changes, so check out official documentation (Links given above). (here tensorflow version 1 is used)

to install libraries we use command on anaconda- pip install "module name"

for example- pip install utils

Lets check our dataset-

for expression in os.listdir("C:/Users/lenovo/signlang/train/"):

print(str(len(os.listdir("C:/Users/lenovo/signlang/train/"+expression)))+" "+expression+' images')

here we get all the no of files of our dataset. here we give the path of our train dataset to see how many files we have in our data set for each class.

dataset of hand gesture images

pre-process the dataset-

img_size=64

batch_size=64

datagen_train=ImageDataGenerator(horizontal_flip=True)

train_generator=datagen_train.flow_from_directory("C:/Users/lenovo/signlang/train",

target_size=(img_size,img_size),

color_mode='grayscale',

batch_size=batch_size,

class_mode='categorical',

shuffle=True)

datagen_validation=ImageDataGenerator(horizontal_flip=True)

validation_generator=datagen_train.flow_from_directory("C:/Users/lenovo/signlang/test",

target_size=(img_size,img_size),

color_mode='grayscale',

batch_size=batch_size,

class_mode='categorical',

shuffle=True)

here we pre-process our dataset and take the data into train_generator to train our model and we convert all the images into gray_scale images. and to validate our model we use validation_generator to check accuracy of the model.