Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: introduce nyc weather features to feature-engineering on fabric single-tech sample #657

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

thurstonchen
Copy link
Contributor

@thurstonchen thurstonchen commented Dec 8, 2023

Type of PR

  • Documentation changes
  • Code changes

Purpose

Introduce NYC weather dataset.

  • to make our model training scenario a bit closer to the practical use case :- )
  • switch to LightGBMRegressor model to have higher model performance metrics.
  • now we create two feature sets, nyctaxi and nycweather, the latter could be reused by other model training requirement, which is a good showcase on why we need feature store.

Besides introducing NYC weather data of year 2022, we also add Jan. 2023 taxi trip and weather data to do the batch inferencing.
Hi @promisinganuj, if you agree this PR, we'll share those new data files to you via Teams, thanks! :- )

Does this introduce a breaking change? If yes, details on what can break

NO

Author pre-publish checklist

  • Added test to prove my fix is effective or new feature works
  • No PII in logs
  • Made corresponding changes to the documentation

Validation steps

  • Put new NYC weather files to the public storage account.
  • Run the Fabric data pipeline again.
  • Run model training and inferencing notebooks, now the latter notebook also requires attaching the same lakehouse used by other notebooks.

Issues Closed or Referenced

N/A

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant