Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

package review #1

Open
margauxgo opened this issue Apr 2, 2024 · 5 comments
Open

package review #1

margauxgo opened this issue Apr 2, 2024 · 5 comments
Assignees

Comments

@margauxgo
Copy link
Contributor

Hi @larnsce @mianzg could one of you please review this package?

@larnsce
Copy link

larnsce commented Apr 2, 2024

Thanks, @margauxgo. I will take that over.

@larnsce
Copy link

larnsce commented Apr 3, 2024

I already had a brief look. You have been very quick with this package, well done.

One item I noticed, which I suggest you rework is your data import and then the separation of columns in your `data_processing.R:

data_in <- read_csv("data-raw/PA_census_data.csv") |>

You have a semicolon ; separated file here. You can use read_csv2() or read_delim(delim = ";") to read the file properly. Then you don't need to separate the columns.

@larnsce larnsce self-assigned this Apr 3, 2024
@margauxgo
Copy link
Contributor Author

Thanks @larnsce, I just modified that.

Concerning the data format, I am still debating wether to pivot the dataset to long format to group the variables piped_water, well_spring_water, stored_rainwater and other_water_source into a variable named water_supply, the variables bathroom_sewerage, bathroom_septic_tank and bathroom_cesspit into a variable named sanitation and the variables bathroom_ditch, bathroom_waterbodies and bathroom_other into a variable named wastewater, or to keep it in wide format.

For now, I illustrated how to pivot it for simpler analysis in the example and kept it in wide format otherwise.

What do you think?

@mianzg
Copy link

mianzg commented Apr 5, 2024

Thanks @larnsce, I just modified that.

Concerning the data format, I am still debating wether to pivot the dataset to long format to group the variables piped_water, well_spring_water, stored_rainwater and other_water_source into a variable named water_supply, the variables bathroom_sewerage, bathroom_septic_tank and bathroom_cesspit into a variable named sanitation and the variables bathroom_ditch, bathroom_waterbodies and bathroom_other into a variable named wastewater, or to keep it in wide format.

For now, I illustrated how to pivot it for simpler analysis in the example and kept it in wide format otherwise.

What do you think?

I also had a similar debate: openwashdata/whatawaste#2

I would for now keep it in wide format for now.

@mianzg
Copy link

mianzg commented Apr 5, 2024

  1. Please refine the 1st figure with more information, e.g. title, x/y-axis label
  2. What is the references directory?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants