DATAI 2024: The 1st International Workshop on Data-driven AI Guangzhou, China, August 25, 2024 |
Conference website | https://data-drivenai.github.io/2024/ |
Submission link | https://easychair.org/conferences/?conf=datai2024 |
Abstract registration deadline | June 1, 2024 |
Submission deadline | June 1, 2024 |
The 1st International Workshop on Data-driven AI (DATAI)
https://data-drivenai.github.io/2024/
The advent of artificial intelligence (AI), particularly through deep learning (DL) and large language models (LLMs), has marked a significant milestone in technological advancement, attributing to its unparalleled accuracy and generalization abilities. The rapid evolution of AI model structures to achieve superior performance underscores the dynamic progression and potential of AI technologies. However, the cornerstone of any AI's success lies not just in its algorithmic prowess but in the quality of data it is trained on. High-quality, accurate, consistent, and representative data sets are imperative for enhancing AI models' learning efficacy, thereby optimizing their generalization capabilities and reducing computational demands.
Beyond just leveraging quality data, AI technology itself plays a pivotal role in enhancing data quality through its powerful tools for data management. From cleaning, labeling, and validation to sophisticated feature engineering, AI ensures data accuracy, integrity, consistency, and reliability. This creates a symbiotic relationship between AI technology and high-quality data, highlighting their mutual dependence and the complementary nature of their interaction. It is this synergy that the 1st International Workshop on Data-driven AI (DATAI) aims to explore, delving into the latest research breakthroughs and presenting innovative techniques and methodologies at the forefront of data-driven AI.
This workshop is dedicated to fostering a comprehensive understanding of the intricate relationship between AI technologies and the data they depend on, focusing on the development of high-quality data specifically tailored for AI technologies, with a particular emphasis on large-scale models. Through engaging researchers, developers, and practitioners in rigorous discussions, the workshop seeks to explore sustained advancements, design innovations, and practical applications of data construction techniques that propel the progress of AI technologies forward.
Submission Guidelines
We welcome innovative, original, unpublished papers, including the following paper categories:
- Full papers (up to 10 pages, including bibliography) that present complete research results.
- Posters (up to 4 pages, including bibliography) that present preliminary findings with the potential to stimulate thorough deliberations.
Papers must be submitted over the EasyChair conference system using the following link: https://easychair.org/conferences/?conf=DATAI2024.
It is expected, that papers are formatted according to the VLDB formatting guidelines, which you can find here: https://vldb.org/pvldb/volumes/17/formatting
List of Topics
- Data discovery for ML
- Data cleaning & integration for ML
- Labeling quality and ML performance
- Data-efficient solutions for ML training
- LLM-based data cleaning & integration
- Multi-modal data lakes (retrieval-)augmented large langauge models
Committees
Program Committee
- Hongzhi Wang
- Nan Tang
Publication
Each accepted paper must be accompanied by at least one full registration, and one author is expected to present the paper at the workshop.
Contact
All questions about submissions should be emailed to Xiaoou Ding (dingxiaoou@hit.edu.cn)