DigiVision

DigiVision is a deep learning based application which is entitled to help the visually impaired people. This application acts as a virtual eye for the visually-impaired people.The application automatically generates the textual description of what's happening in front of the camera and conveys it to a person through proper audio.

It is also capable of recognizing faces and it tells user whether a known person is present in front of him or not. If it is not a known face for the user then it gives the user a choice to set it as a known person for all future identification.

Requirements

Tensorflow (>1.9)
Keras
OpenCV
Python 3.5+
gTTS
pygame
pymongo

Datasets

MS COCO 2017 for Image Processing and Captioning.

Dataset for face Recognition is manually collected.

Features/Functions

Setup and Instructions

Install all the required frameworks, libraries and dependecies as mentioned in Requirements above.
Download the COCO dataset if not available, in order to train the model

Or run:

python download.py

Create your own MongoDB Cluster and replace MONGO_URI in line 16 of f_part.py with your own Mongo AccessID.
Get the source code on your pc via git and navigate inside the folder through your terminal.

  git clone https://github.com/altruistcoder/Digivision

Run the project using:

run.py (for gTTS audio and adding names through Canvas/ python gtk)
digivision.py (for Single face detection along with new face addition through python gtk)
digivision_mul.py (for Multiface detection along with all Input/Outputs through Audio)

python <desired_file_name>.py

It will take around 90 minutes to process all images and approx 5 minutes to process Validation images. Takes around 22 minutes for a single epoch during training on batch size of 256 on NVIDIA GTX 960M. Don't need to re-train data on every single run. Once trained, weights gets loaded automatically.

Demo

Click here for demo of run.py file.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
IC_logs		IC_logs
detection		detection
facenet		facenet
images		images
models/20180204-160909		models/20180204-160909
DetectionToolKit.py		DetectionToolKit.py
FaceToolKit.py		FaceToolKit.py
IC_checkpoints.keras		IC_checkpoints.keras
README.md		README.md
cache.py		cache.py
caption_tune.py		caption_tune.py
coco.py		coco.py
demo.mp4		demo.mp4
digivision.py		digivision.py
digivision_mul.py		digivision_mul.py
download.py		download.py
f_part.py		f_part.py
faceadd.py		faceadd.py
gensound.py		gensound.py
gensoundgtts.py		gensoundgtts.py
haarcascade_frontalface_default.xml		haarcascade_frontalface_default.xml
image-cap.ipynb		image-cap.ipynb
p_part.py		p_part.py
run.py		run.py

altruistcoder/Digivision

Folders and files

Latest commit

History

Repository files navigation

DigiVision

Requirements

Datasets

Features/Functions

Setup and Instructions

Demo

About

Topics

Resources

Stars

Watchers

Forks

Languages