Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cover multilevel data? #34

Open
briatte opened this issue Apr 6, 2019 · 2 comments
Open

Cover multilevel data? #34

briatte opened this issue Apr 6, 2019 · 2 comments

Comments

@briatte
Copy link
Contributor

briatte commented Apr 6, 2019

The ESS also offers multilevel data.

Perhaps this could find its place in the package? I can try giving it a go if you like.

@cimentadaj
Copy link
Contributor

cimentadaj commented Apr 7, 2019

Hi @briatte

This sounds great. Of course, feel free to working this out. However, before you do so, let's discuss the possible implementation.

Some things I'm not entirely sure about.

  • Can the multilevel data be scraped with the current setup of essurvey?

I'm using xml2 which is straight foward for the round/year data (it's in html, I think). However for the multilevel data it seems that it's coming from a Java based website (if I'm correct). I've done this sort of scraping with RSelenium but I'm not completely sure we would want to add such a big package as a dependency. What are your thoughts on this?

  • What is the precise difference between the multilevel data and the country data? I know it includes different regions but what does the data look like? Is it like a separate data set (such as the SDDF) which simply has the unique ID for every participant and their correspoding NUTS variables? Or is it the complete dataset including the different region variables? Does this have additional variables unavailable in the other dataset?

My main concern is whether this should be a completely different step from import_country or whether it's something similar to what I've discussed in #9 in terms of doing the merging silently.

I think if we answer some of these questions, then the implementation would become feasible.

Hey! Thanks for getting involved, these ideas are really cool and we can hopefully integrate them into the package.

@briatte
Copy link
Contributor Author

briatte commented Apr 8, 2019

Hi @cimentadaj

I'll have to explore, being crap at Java-based scraping :) Perhaps there's a small headless browser that might just work, and could be added as a Suggests rather than Imports.

AFAIK, the multilevel data has only the higher-level variables, plus country-region identifiers to allow merging. No microdata in those files.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants