Large language models (LLMs) such as ChatGPT and Gemini were originally designed to work with text only. Today, they have evolved into systems that can work with many types of information at once (multimodal systems), as well as understand and generate images, audio, speech and music.Large language models (LLMs) such as ChatGPT and Gemini were originally designed to work with text only. Today, they have evolved into systems that can work with many types of information at once (multimodal systems), as well as understand and generate images, audio, speech and music.Computer Sciences[#item_full_content]