NADEEF (or ''clean'' in Arabic, نظيف) is a generalized data cleaning system developed by the data analytic group at Qatar Computing Research Institute.
Launch NADEEF using command
Usage: nadeef.sh [OPTIONS]
Options are:
console : start the NADEEF console.
dashboard : start the NADEEF dashboard.
A demo page can be accessed via here.
More details on the instructions can be found in the User Guide.
Being a commodity data cleaning system, NADEEF aims to be extensible, generic and easy-to-deploy.
Most existing data cleaning methods and systems, either in industry or academia, employ different types of data quality rules in isolation, each time a new data cleaning problem arises or a new type of rules is considered, practitioners either build a new system from scratch or go through a painful process of customizing an existing tool. The NADEEF team designed a new data cleaning system that distinguishes between a programming interface and a core to achieve generality and extensibility.
http://da.qcri.org/publications
NADEEF is released under the terms of the MIT License.
For any issues or enhancement please use the issue pages in Github, or contact [email protected]. We will try our best to help you sort it out.