-
Notifications
You must be signed in to change notification settings - Fork 27
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Reworking documentation * Readme work * Starting database scripts * Working on database tables * Refining table scripts * Table creation scripts * Bulk of database loading scripts * Updating readme * Updating ISBN table population * Refining order of creating indexes * Communicative load process (#7) * Communicative load process (#9) * add command to make sure openlibrary database is being used when running the script (#6) * create a sql file to load a temp table with file information * create data loader than can take database files in chunks * add new step to load process * remove temp table after script finishes * add file loader and make it create a loadable file for temp database table * add command to auto load filenames into temp table * clean up add load script * add load scripts * rename files to remove the 2 I added * add python file to split up data into smaller chunks * add some sample files to demonstrate this version quickly * reduced the size of sample files * add section for moving the files to the unprocessed folder * incorporate the changes I made into the readme * update notes * update one of the code examples * make loader mark files that have been loaded instead of deleting them from the loader * rephrased message * add time stamps and notices * refine chunk notes * Adjusting bulk loader to use /copy command * Minor readme update --------- Co-authored-by: Chloe-Meinshausen <[email protected]>
- Loading branch information
1 parent
2426fc3
commit 3ad8df5
Showing
28 changed files
with
18,388 additions
and
569 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,11 @@ | ||
*.csv | ||
*.csv | ||
*.txt | ||
*.xlsx | ||
*.xlsx | ||
|
||
data/unprocessed/ol_dump_works_*.txt.gz | ||
data/unprocessed/ol_dump_authors_*.txt.gz | ||
data/unprocessed/ol_dump_editions_*.txt.gz | ||
.vscode/settings.json | ||
create_db.bat | ||
copy_commands.sql |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
\copy editions from './data/processed/ol_dump_editions.txt' delimiter E'\t' quote '|' csv; |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
psql --set=sslmode=require -f openlibrary-db.sql -h localhost -p 5432 -U username postgres |
Empty file.
Empty file.
Oops, something went wrong.