Neural Network Charity Analysis

Analysis Overview

The purpose of this project is to use deep-learning neural networks with the TensorFlow platform in Python, to analyze and classify the success of charitable donations.
We use the following methods for the analysis:

preprocessing the data for the neural network model,
compile, train and evaluate the model,
optimize the model.

Resources

Data Source: charity_data.csv
Software: Python 3.7.7, Anaconda Navigator 1.9.12, Conda 4.8.4, Jupyter Notebook 6.0.3

Results

Data Preprocessing

The columns EIN and NAME are identification information and have been removed from the input data.
The column IS_SUCCESSFUL contains binary data refering to weither or not the charity donation was used effectively. This variable is then considered as the target for our deep learning neural network.
The following columns APPLICATION_TYPE, AFFILIATION, CLASSIFICATION, USE_CASE, ORGANIZATION, STATUS, INCOME_AMT, SPECIAL_CONSIDERATIONS, ASK_AMT are the features for our model.
Encoding of the categorical variables, spliting into training and testing datasets and standardization have been applied to the features.

Compiling, Training, and Evaluating the Model

This deep-learning neural network model is made of two hidden layers with 80 and 30 neurons respectively.
The input data has 43 features and 25,724 samples.
The output layer is made of a unique neuron as it is a binary classification.
To speed up the training process, we are using the activation function ReLU for the hidden layers. As our output is a binary classification, Sigmoid is used on the output layer.
For the compilation, the optimizer is adam and the loss function is binary_crossentropy.
The model accuracy is under 75%. This is not a satisfying performance to help predict the outcome of the charity donations.
To increase the performance of the model, we applied bucketing to the feature ASK_AMT and organized the different values by intervals.
We increased the number of neurons on one of the hidden layers, then we used a model with three hidden layers.
We also tried a different activation function (tanh) but none of these steps helped improve the model's performance.

Summary

The deep learning neural network model did not reach the target of 75% accuracy. Considering that this target level is pretty average we could say that the model is not outperforming.
Since we are in a binary classification situation, we could use a supervised machine learning model such as the Random Forest Classifier to combine a multitude of decision trees to generate a classified output and evaluate its performance against our deep learning model.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Resources		Resources
checkpoints		checkpoints
AlphabetSoupCharity.h5		AlphabetSoupCharity.h5
AlphabetSoupCharity.ipynb		AlphabetSoupCharity.ipynb
AlphabetSoupCharity_Optimization.h5		AlphabetSoupCharity_Optimization.h5
AlphabetSoupCharity_Optimization.ipynb		AlphabetSoupCharity_Optimization.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resources

Resources

checkpoints

checkpoints

AlphabetSoupCharity.h5

AlphabetSoupCharity.h5

AlphabetSoupCharity.ipynb

AlphabetSoupCharity.ipynb

AlphabetSoupCharity_Optimization.h5

AlphabetSoupCharity_Optimization.h5

AlphabetSoupCharity_Optimization.ipynb

AlphabetSoupCharity_Optimization.ipynb

README.md

README.md

Repository files navigation

Neural Network Charity Analysis

Analysis Overview

Resources

Results

Data Preprocessing

Compiling, Training, and Evaluating the Model

Summary

About

Releases

Packages

Languages

cedoula/Neural_Network_Charity_Analysis

Folders and files

Latest commit

History

Repository files navigation

Neural Network Charity Analysis

Analysis Overview

Resources

Results

Data Preprocessing

Compiling, Training, and Evaluating the Model

Summary

About

Topics

Resources

Stars

Watchers

Forks

Languages