-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature Request: read bson data directly from dbpath (without mongod running) #4
Comments
Great idea! Currently it's not supported since we have different use cases: The mongo dump file seems just a collection of BSON objects, so if there I think you could dump as CSV file using mongoexport as a workaround. If On Tue, May 8, 2012 at 7:52 AM, Alessandro D. Gagliardi <
|
CSV is no good as we have shifting schemata and nested documents and all kinds of other madness that make CSV a mess. I imagine you're already aware of https://github.com/mongodb/mongo-hadoop but I thought I'd mention it just in case. |
yeah, they have a wonderful shard-aware input split implementation and we'd On Wednesday, May 9, 2012, Alessandro D. Gagliardi wrote:
|
Just got message from 10gen engineer that they have a hive connector which currently support static bson file: |
It would be really cool of Hive-mongo could read directly from MongoDB files rather than having to go through a mongod process (this way I could run it directly against backups without having to start mongod on them). If this is too difficult/impossible, the next best thing would be to be able to run it against the bson files produced by mongodump (though at that point, I'm already halfway to exporting the data to another format anyway).
The text was updated successfully, but these errors were encountered: