Learn2Clean: Optimizing the Sequence of Tasks for Web Data Preparation, Proc. of the The Web Conf, 2019. ,
Reinforcement learning for data preparation with active reward learning, Internet Science -6th International Conference, INSCI 2019, pp.121-132, 2019. ,
Quality Awareness for Data Management and Mining. Habilitation à Diriger des Recherches, Univ. Rennes, vol.1, 2007. ,
Data validation for machine learning, Proc. of SysML, 2019. ,
Array of things: A scientific research instrument in the public way: Platform design and early lessons learned, Proc. of the 2nd International Workshop on Science of Smart City Operations and Platforms Engineering, SCOPE '17, pp.26-33, 2017. ,
Trends in cleaning relational data: Consistency and deduplication. Foundations and Trends in Databases, vol.5, pp.281-393, 2015. ,
MIMIC-III, a freely accessible critical care database, 2016. ,
Activeclean: Interactive data cleaning for statistical modeling, Proc. VLDB Endow, vol.9, issue.12, pp.948-959, 2016. ,
Holoclean: Holistic data repairs with probabilistic inference, vol.10, pp.1190-1201, 2017. ,
Data quality: The role of empiricism, SIGMOD Record, vol.46, issue.4, pp.35-43, 2017. ,
Unit testing data with deequ, Proc. of the 2019 International Conference on Management of Data, SIGMOD '19, pp.1993-1996, 2019. ,
Automating large-scale data quality verification, vol.11, pp.1781-1794, 2018. ,
Don't be scared: use scalable automatic repairing with maximal likelihood and bounded changes, Proc. of the ACM SIGMOD, pp.553-564, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-01855779