top of page

Publications

2024

Croissant: A Metadata Format for ML-Ready Datasets.

Mubashara Akhtar, Omar Benjelloun, Costanza Conforti, et al.

Spotlight paper at Neurips2024. (top ~3% of submissions)

Croissant: A Metadata Format for ML-Ready Datasets.

Mubashara Akhtar, Omar Benjelloun, Costanza Conforti, et al.

DEEM workshop @SIGMOD, 2024. (Best Short Paper award 🥇)

A Standardized Machine-readable Dataset Documentation Format for Responsible AI.

Nitisha Jain, Mubashara Akhtar, Joan Giner-Miguelez, Rajat Shinde, et al.

ArXiv, 2024.

TANQ: An open domain dataset of table answered questions.

Mubashara Akhtar,* Chenxi Pang,* Andreea Marzoca, Yasemin Altun, Julian Martin Eisenschlos

ArXiv, 2024.

ChartCheck: Explainable Fact-Checking over Real-World Chart Images.

Mubashara Akhtar, Nikesh Subedi, Vivek Gupta, Sahar Tahmasebi, Oana Cocarascu, Elena Simperl

Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL), 2024.

2023

Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data.

Mubashara Akhtar, Abhilash Shankarampeta, Vivek Gupta, Arpit Patil, Oana Cocarascu, Elena Simperl

Findings of the Annual Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

Multimodal Automated Fact-Checking: A Survey.

Mubashara Akhtar, Michael Schlichtkrull, Zhijiang Guo, Oana Cocarascu, Elena Simperl, Andreas Vlachos

Findings of the Annual Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

Reading and Reasoning over Chart Images for Evidence-based Automated Fact-Checking.

Mubashara Akhtar, Oana Cocarascu, Elena Simperl

Findings of the Annual Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023.

Proceedings of the Sixth Fact Extraction and VERification Workshop (FEVER).

Mubashara Akhtar, Rami Aly, Christos Christodoulopoulos, Oana Cocarascu, Zhijiang Guo, Arpit Mittal, Michael Schlichtkrull, James Thorne, Andreas Vlachos

Proceedings of the Sixth Fact Extraction and VERification Workshop (FEVER), 2023.

2022

PubHealthTab: A public health table-based dataset for evidence-based fact checking.

Mubashara Akhtar, Oana Cocarascu, Elena Simperl

Findings of Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022.

bottom of page