Enabling Fairness Across Multi-modal and Multi-agent Applications

EasyChair Preprint 15978

3 pages•Date: July 3, 2025

Abstract

Modern multi-agent systems leverage a diverse set of AI models, including Large Language Models (LLMs), Vision-Language Models (VLMs), etc., to perform complex multi-modal tasks. However, fair model serving in such heterogeneous environments remains a significant challenge. Existing scheduling methods primarily focus on single-modality fairness, failing to account for varying computational costs across different models and the hierarchical structure of multi-agent applications. In this work, we introduce Hierarchical Multi-Modality Fair Scheduling (HMFS), a novel approach that ensures fairness across applications, agents, and tasks while maintaining high resource utilization.

To enable cross-modality fairness, we propose a Unified Token Representation, which normalizes token costs across different transformer-based models by leveraging latent space embedding dimensions and computational intensity factors. Using this unified metric, we design a Hierarchical Multi-Modality Fair Scheduling algorithm that dynamically prioritizes requests at both application and agent levels, ensuring equitable access to compute resources.

Keyphrases: Edge-Cloud Communication, Scheduling, fairness, multi-agent, multi-modality, transformer

Links:

https://easychair.org/publications/preprint/Xw9k

BibTeX entry

BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:

@booklet{EasyChair:15978,
  author    = {Rui Zhang and Liting Hu},
  title     = {Enabling Fairness Across Multi-modal and Multi-agent Applications},
  howpublished = {EasyChair Preprint 15978},
  year      = {EasyChair, 2025}}

Download PDF Open PDF in browser