Skip to content

Jake-Jasper/DataCreator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Creator

This a project to aid with the creation of artificial datasets for learning purposes - probably.

The aim is to create the utility with as few additional modules as possible, and only use the core python utilities.

An example:

image

The image has a mean of 0.58 in the x-axis, and 0.52 in the Y with a correlation of 0.02. Currently the output is saved as a csv file.

Usage:

Copy DataCreator.py, drawing.py, particle.py to you local directory or git clone https://github.com/Jake-Jasper/DataCreator

import DataCreator

Datacreator.main()

Currently, the way to create sub variables, In the example above for example, is to click the "add sub variable button" which will give the following dialog.

image

In the example, the Blue colour is male and black Female.

You can also set the scale of the data in the x and y direction.

Goals

For the upto date notes on what is currently being worked on see here

  • Make it a pip installable package
  • Improve the implementation of secondary variables.
  • Add the ability to load other datasets e.g. the iris dataset.
  • Add option to enable/disable secondary variable.

Acknowledgements

Thankyou to JimShapedCoding for reviewing the project in this video

About

A tool for creating artificial data

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages