CleanML: A study for evaluating the impact of data cleaning on ml classification tasks P Li, X Rao, J Blase, Y Zhang, X Chu, C Zhang 2021 IEEE 37th International Conference on Data Engineering (ICDE), 13-24, 2021 | 132* | 2021 |
Nearest neighbor classifiers over incomplete information: From certain answers to certain predictions B Karlaš, P Li, R Wu, NM Gürel, X Chu, W Wu, C Zhang arXiv preprint arXiv:2005.05117, 2020 | 43 | 2020 |
Auto-fuzzyjoin: Auto-program fuzzy similarity joins without labeled examples P Li, X Cheng, X Chu, Y He, S Chaudhuri Proceedings of the 2021 international conference on management of data, 1064 …, 2021 | 28 | 2021 |
Diffprep: Differentiable data preprocessing pipeline search for learning over tabular data P Li, Z Chen, X Chu, K Rong Proceedings of the ACM on Management of Data 1 (2), 1-26, 2023 | 14 | 2023 |
Table-gpt: Table-tuned gpt for diverse table tasks P Li, Y He, D Yashar, W Cui, S Ge, H Zhang, DR Fainman, D Zhang, ... arXiv preprint arXiv:2310.09263, 2023 | 12 | 2023 |
Demonstration of panda: a weakly supervised entity matching system R Wu, P Sakala, P Li, X Chu, Y He arXiv preprint arXiv:2106.10821, 2021 | 8 | 2021 |
A Model-Agnostic approach for learning with noisy labels of arbitrary distributions S Hao, P Li, R Wu, X Chu 2022 IEEE 38th International Conference on Data Engineering (ICDE), 1219-1231, 2022 | 3 | 2022 |
Auto-tables: Synthesizing multi-step transformations to relationalize tables without using examples P Li, Y He, C Yan, Y Wang, S Chauduri arXiv preprint arXiv:2307.14565, 2023 | 2 | 2023 |
Discovering Process-Based Drivers for Case-Level Outcome Explanation P Li, H Zhang, X Chu, A Seeliger, C Yu International Conference on Process Mining, 165-178, 2023 | | 2023 |
Experiences and Lessons Learned from the SIGMOD Entity Resolution Programming Contests A De Angelis, M Mazzei, F Piai, P Merialdo, G Simonini, L Zecchini, ... ACM SIGMOD Record 52 (2), 43-47, 2023 | | 2023 |