Modin is a library for unifying the way you interact with your data
Modin can be installed with pip: pip install modin
Pandas on Ray is a library that allows you to effortlessly scale pandas by changing only a single line of code
pandas | Pandas on Ray |
# Normal pandas import
import pandas as pd
df = pd.DataFrame({'col1': [1, 2, 3],
'col2': [1.0, 2.0, 3.0]})
df.sum()
... |
# Pandas on Ray import
import modin.pandas as pd
df = pd.DataFrame({'col1': [1, 2, 3],
'col2': [1.0, 2.0, 3.0]})
df.sum()
... |
Pandas on Ray is currently for experimental use only. Requests and contributions are welcome!
SQL on Ray is currently under development. Coming Soon!
We have implemented a simple example that can be found below. Feedback welcome!
>>> import modin.sql as sql
>>>
>>> conn = sql.connect("db_name")
>>> c = conn.cursor()
>>> c.execute("CREATE TABLE example (col1, col2, column 3, col4)")
>>> c.execute("INSERT INTO example VALUES ('1', 2.0, 'A String of information', True)")
col1 col2 column 3 col4
0 1 2.0 A String of information True
>>> c.execute("INSERT INTO example VALUES ('6', 17.0, 'A String of different information', False)")
col1 col2 column 3 col4
0 1 2.0 A String of information True
1 6 17.0 A String of different information False
- Documentation
- Ask questions on our mailing list [email protected].
- Submit bug reports to our GitHub Issues Page.
- Contributions are welcome! Open a pull request.