Publications
2024
Croissant: A Metadata Format for ML-Ready Datasets.
Mubashara Akhtar, Omar Benjelloun, Costanza Conforti, et al.
Spotlight paper at Neurips2024. (top ~3% of submissions)
Croissant: A Metadata Format for ML-Ready Datasets.
Mubashara Akhtar, Omar Benjelloun, Costanza Conforti, et al.
DEEM workshop @SIGMOD, 2024. (Best Short Paper award 🥇)
A Standardized Machine-readable Dataset Documentation Format for Responsible AI.
Nitisha Jain, Mubashara Akhtar, Joan Giner-Miguelez, Rajat Shinde, et al.
ArXiv, 2024.
TANQ: An open domain dataset of table answered questions.
Mubashara Akhtar,* Chenxi Pang,* Andreea Marzoca, Yasemin Altun, Julian Martin Eisenschlos
ArXiv, 2024.
ChartCheck: Explainable Fact-Checking over Real-World Chart Images.
Mubashara Akhtar, Nikesh Subedi, Vivek Gupta, Sahar Tahmasebi, Oana Cocarascu, Elena Simperl
Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL), 2024.
2023
Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data.
Mubashara Akhtar, Abhilash Shankarampeta, Vivek Gupta, Arpit Patil, Oana Cocarascu, Elena Simperl
Findings of the Annual Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Multimodal Automated Fact-Checking: A Survey.
Mubashara Akhtar, Michael Schlichtkrull, Zhijiang Guo, Oana Cocarascu, Elena Simperl, Andreas Vlachos
Findings of the Annual Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Reading and Reasoning over Chart Images for Evidence-based Automated Fact-Checking.
Mubashara Akhtar, Oana Cocarascu, Elena Simperl
Findings of the Annual Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023.
Proceedings of the Sixth Fact Extraction and VERification Workshop (FEVER).
Mubashara Akhtar, Rami Aly, Christos Christodoulopoulos, Oana Cocarascu, Zhijiang Guo, Arpit Mittal, Michael Schlichtkrull, James Thorne, Andreas Vlachos
Proceedings of the Sixth Fact Extraction and VERification Workshop (FEVER), 2023.
2022
PubHealthTab: A public health table-based dataset for evidence-based fact checking.
Mubashara Akhtar, Oana Cocarascu, Elena Simperl
Findings of Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022.