Skip to content
@feature-engine

Feature-engine

Welcome to Feature-engine

My name is Sole, the leading instructor at Train in Data and the maintainer of Feature-engine, and together with a group of passionate data scientists and software developers, we maintain and expand the functionality of this Python library for feature engineering and feature selection for machine learning and its documentation, so you can better prepare your data to craft more powerful and interpretable machine learning models.

At Train in Data, we create intermediate and advanced online courses on machine learning, data science and AI software development, to help you boost your data science skills and leverage the power of this and other popular Python libraries, to create faster and robust machine learning pipelines.

We talk, blog and participate in podcasts about machine learning, software development and open-source, so you'll hear about us a lot on the digital sphere ;)

Online Courses

Check out the courses that we teach.

Courses What you will learn
Feature engineering for machine learning Learn to create new features, impute missing data, encode categorical variables, transform and discretize features and much more.
Feature selection for machine learning Learn to select features using wrapper, filter, embedded and hybrid methods, and build simpler and reliable models.
Hyperparameter optimization for machine learning Learn about grid and random search, Bayesian Optimization, Multi-fidelity models, Optuna, Hyperopt, Scikit-Optimize and more.
Machine learning with imbalanced data Learn about under- and over-sampling, ensemble and cost-sensitive methods and improve the performance of models trained on imbalanced data.
Feature engineering for time series forecasting Learn to create lag and window features, impute data in time series, encode categorical variabes and much more, specifically for forecasting.
Forecasting with Machine Learning Learn to perform time series forecasting with machine learning models like linear regression, random forests and xgboost.
Machine Learning Interpretability Learn to interpret the predictions of your white box and black box machine learning models.

Books

Find out more about machine learning through our books, and have the code at your fingertips.

Books Summary
Python feature engineering Cookbook, second edition Over 70 Python recipes to implement feature engineering in tabular, transactional, time series and text data.
Feature selection in machine learning with Python Over 20 methods to select the most predictive features and build simpler, faster, and more reliable machine learning models.

Open-source

The open-source libraries I contribute to.

Library About Sponsor us
Feature-engine Multiple transformers for missind data imputation, categorical encoding, variable transformation and discretization, feature creation and more. Sponsor us

Our contributors

Get to know who's behind Feature-engine scene.

Instructor Role
Soledad Galli Maintainer

Follow us

Follow us on social media or through our website to be up to date with our latest news.

Media Summary
Train in Data Enroll in our courses and books
LinkedIn I talk about data science, machine learning and how to become a data scientist.
Twitter I tweet about data science, machine learning and how to become a data scientist.
Facebook I talk about data science, machine learning and how to become a data scientist.
Instagram I post about data science, machine learning and how to become a data scientist.
Newsletter I talk about data science, machine learning and how to become a data scientist.
Blog I write about data science, machine learning, feature engineering and selection and more.

Sponsor Feature-engine

Help me gather a team of regular software developers and data scientists to fast track the development of Feature-engine's functionality and documentation. Joing our sponsors through Github sponsors or Buy me a coffee and help us democratize data science and machine learning tools and knowledge!


Profile views counter


We hope to see you around.

Popular repositories

  1. feature_engine feature_engine Public

    Feature engineering package with sklearn like functionality

    Python 1.8k 302

  2. feature-engine-examples feature-engine-examples Public

    Jupyter Notebook 27 18

  3. feature_engine-feedstock feature_engine-feedstock Public

    Forked from conda-forge/feature_engine-feedstock

    A conda-smithy repository for feature_engine.

  4. .github .github Public

    About Feature-engine

Repositories

Showing 4 of 4 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…