📞 +91-7667918914 | ✉️ ijarcce@gmail.com
IJARCCE Logo
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 12, ISSUE 4, APRIL 2023

Interactive Visual Foundation Models: Talking and Generating

Siddharth Singh Chouhan, Sujal Jadhav, Vanshita Singh, Pratik Gaikwad

DOI: 10.17148/IJARCCE.2023.124116
Abstract: The generation of images based on the content of a conversation using a visual foundation model. The aim is to develop a system that can generate images that align with the context of a conversation in a more intuitive and creative way. We propose a method that utilizes a pre-trained visual foundation model to extract features from the input text and generate an image that reflects the meaning of the conversation. The model is trained on a large-scale image dataset and a text dataset that is relevant to the target domain. Experimental results show that the proposed method outperforms existing methods in terms of image quality and content alignment with the conversation. The system has potential applications in various areas such as e-commerce, social media, and entertainment, where generating images from text can improve user engagement and experience.

Keywords: Visual Foundation model, AI, Large Language Models (LLM).

How to Cite:

[1] Siddharth Singh Chouhan, Sujal Jadhav, Vanshita Singh, Pratik Gaikwad, “Interactive Visual Foundation Models: Talking and Generating,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2023.124116