Prism: A Multi-Agent System for Real-Time Multi- Modal Interaction in Mobile and Web Applications
Skanda Suresh; Sudarshan Sridhar; Supreeth B Raj; Purmina Mittalkod; Sukhateertha V1
1
Publication Date:
2024/12/23
Abstract:
The integration of multi-agent systems in mobile
and web applications has opened new horizons for real-
time multi- modal interaction. This paper presents a
comprehensive exploration of a multi-agent framework
leveraging the Qwen2.5:3B and Gemini 1.5 Flash 8B
models to provide robust, scalable, and user-centric
solutions. Agents for diverse functionalities—such as
Cooking, Notes, Entertainment, Travel Planning, Weather,
and SecureFace—are seamlessly integrated into a
unified platform to address real-world challenges. The
framework emphasizes dynamic adaptability, cross-
platform consistency, and enhanced user experience. We
also examine the architectural considerations,
implementation challenges, and future directions for
ensuring the reliability and efficiency of such multi-modal
systems, underscoring their potential to transform digital
interactions across various domains.
Keywords:
Multi-Agent Systems, Multi-Modal Interaction, Mobile Applications, Web Applications, Artificial Intelligence, Cross-Platform Design, Real-Time Responsiveness, SecureFace Technology, User-Centric Design.
DOI:
https://doi.org/10.5281/zenodo.14546550
PDF:
https://ijirst.demo4.arinfotech.co/assets/upload/files/IJISRT24DEC857.pdf
REFERENCES
- Calvaresi, D., Dicente Cid, Y., Marinoni, M., Dragoni, A. F., Najjar, A., Schumacher, M. (2021). Real-time multi-agent systems: rationality, formal model, and empirical results. Autonomous Agents and Multi-Agent Systems, 35(12). Retrieved from https://link.springer.com/article/10.1007/s10458-020-09492-5/
- Wang, J., Xu, H., Ye, J., Yan, M., Shen, W., Zhang, J., Huang, F., Sang, J. (2024). Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception. arXiv preprint arXiv:2401.16158.
- Retrieved from https://arxiv.org/abs/2401.16158/
- Song, Z., Li, Y., Fang, M., Chen, Z., Shi, Z., Huang, Y., Chen,
- L. (2024). MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot. arXiv preprint arXiv:2404.18074. Retrieved from https://arxiv.org/abs/2404.18074/
- Bosse, S. (2022). JAM: The JavaScript Agent Machine for Distributed Computing and Simulation with Reactive and Mobile Multi- agent Systems. arXiv preprint arXiv:2207.11300. Retrieved from https://arxiv.org/abs/2207.11300/
- Wang, J., Xu, H., Jia, H., Zhang, X., Yan, M., Shen, W., Zhang, J., Huang, F., Sang, J. (2024). Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi- Agent Collaboration. arXiv preprint arXiv:2406.01014. Retrieved from https://arxiv.org/abs/2406.01014/