Run Hydrator for multiple CSV files #94

grrigore · 2021-05-31T10:06:12Z

I have a folder with 100 .csv files of different sizes. Is there a way to hydrate those files without manually adding each file into the Hydrator app?

edsu · 2021-05-31T11:33:53Z

Do your CSV files only contain a column of numbers? Or do they include other columns as well? Also what operating system are you using?

grrigore · 2021-05-31T11:47:30Z

My .csv files contain tweet's ID.I am using Ubuntu.

edsu · 2021-05-31T11:48:39Z

Do the CSV files have a column header? Or are the files just lines of numbers?

grrigore · 2021-05-31T11:50:45Z

This is a preview from a .csv file:
ID, TextBlob score (I can remove this)

1385449730818285569,0.125
1385449730981842946,0
1385449730981957635,-0.0062500000000000056
1385449730948288516,0.26666666666666666
1385449731132989440,-0.016666666666666677
1385449731086708736,0
1385449731267178496,0.3

I am using data from here

edsu · 2021-05-31T12:00:51Z

You will want to ensure that your input file is a text file where each line contains a tweet id and nothing else. So that TextBlob score will need to be removed as will any column headers.

I don't actually see data with that format in the dataset you linked to. If you are working with a very large dataset (hundreds of millions of tweets) you might want to use twarc instead of Hydrator.

edsu · 2021-05-31T12:24:49Z

Sorry i should have left this open to see if you have any more questions.

grrigore · 2021-05-31T12:30:51Z

No problem. 🙂 I think twarc it's a better tool for what I want. Thank you.

edsu closed this as completed May 31, 2021

edsu reopened this May 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run Hydrator for multiple CSV files #94

Run Hydrator for multiple CSV files #94

grrigore commented May 31, 2021

edsu commented May 31, 2021 •

edited

Loading

grrigore commented May 31, 2021

edsu commented May 31, 2021

grrigore commented May 31, 2021 •

edited

Loading

edsu commented May 31, 2021

edsu commented May 31, 2021

grrigore commented May 31, 2021

Run Hydrator for multiple CSV files #94

Run Hydrator for multiple CSV files #94

Comments

grrigore commented May 31, 2021

edsu commented May 31, 2021 • edited Loading

grrigore commented May 31, 2021

edsu commented May 31, 2021

grrigore commented May 31, 2021 • edited Loading

edsu commented May 31, 2021

edsu commented May 31, 2021

grrigore commented May 31, 2021

edsu commented May 31, 2021 •

edited

Loading

grrigore commented May 31, 2021 •

edited

Loading