Prism: A Multi-Agent System for Real-Time Multi- Modal Interaction in Mobile and Web Applications

Skanda Suresh; Sudarshan Sridhar; Supreeth B Raj; Purmina Mittalkod; Sukhateertha V1

1

Publication Date: 2024/12/23

Abstract: The integration of multi-agent systems in mobile and web applications has opened new horizons for real- time multi- modal interaction. This paper presents a comprehensive exploration of a multi-agent framework leveraging the Qwen2.5:3B and Gemini 1.5 Flash 8B models to provide robust, scalable, and user-centric solutions. Agents for diverse functionalities—such as Cooking, Notes, Entertainment, Travel Planning, Weather, and SecureFace—are seamlessly integrated into a unified platform to address real-world challenges. The framework emphasizes dynamic adaptability, cross- platform consistency, and enhanced user experience. We also examine the architectural considerations, implementation challenges, and future directions for ensuring the reliability and efficiency of such multi-modal systems, underscoring their potential to transform digital interactions across various domains.

Keywords: Multi-Agent Systems, Multi-Modal Interaction, Mobile Applications, Web Applications, Artificial Intelligence, Cross-Platform Design, Real-Time Responsiveness, SecureFace Technology, User-Centric Design.

DOI: https://doi.org/10.5281/zenodo.14546550

PDF: https://ijirst.demo4.arinfotech.co/assets/upload/files/IJISRT24DEC857.pdf

REFERENCES

  1. Calvaresi, D., Dicente Cid, Y., Marinoni, M., Dragoni, A. F., Najjar, A., Schumacher, M. (2021). Real-time multi-agent systems: rationality, formal model, and empirical results. Autonomous Agents and Multi-Agent Systems, 35(12). Retrieved from https://link.springer.com/article/10.1007/s10458-020-09492-5/
  2. Wang, J., Xu, H., Ye, J., Yan, M., Shen, W., Zhang, J., Huang, F., Sang, J. (2024). Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception. arXiv preprint arXiv:2401.16158.
  3. Retrieved from https://arxiv.org/abs/2401.16158/
  4. Song, Z., Li, Y., Fang, M., Chen, Z., Shi, Z., Huang, Y.,  Chen,
  5. L. (2024). MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot. arXiv preprint arXiv:2404.18074. Retrieved from https://arxiv.org/abs/2404.18074/
  6. Bosse, S. (2022). JAM: The JavaScript Agent Machine for Distributed Computing and Simulation with Reactive and Mobile Multi- agent Systems. arXiv preprint arXiv:2207.11300. Retrieved from https://arxiv.org/abs/2207.11300/
  7. Wang, J., Xu, H., Jia, H., Zhang, X., Yan, M., Shen, W., Zhang, J., Huang, F., Sang, J. (2024). Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi- Agent Collaboration. arXiv preprint arXiv:2406.01014. Retrieved from https://arxiv.org/abs/2406.01014/