A python library for distributed data analytics
Install Charm++ non-SMP build
Install pyccs. You can find the instructions here - https://github.com/charmplusplus/PyCCS
Install arrow. The instructions are here - https://arrow.apache.org/install/
Install charmpandas:
git clone https://github.com/adityapb/charmpandas.git cd charmpandas/src make -j6 cd .. python setup.py install
First use the
examples/datagen.py
script to generate data for the example.:python datagen.py
Edit the
examples/Demo.ipynb
notebook to point to the correct parquet filesRun the server from the
charmpandas/src
directory:./charmrun +p4 ./server.out +balancer MetisLB +LBDebug 3 ++server ++server-port 1234
Then run the Demo notebook