Elena' s AI Blog

genAI

Multimodal AI


Multimodal AI is rapidly evolving, pushing the boundaries of what machines can understand and achieve by combining information from multiple modalities like text, images, audio, and video. This post explores the core techniques of realising multimodal AI, existing systems and related research.

Generative AI vs. Large Language Models


Generative AI and Large Language Models (LLMs) are both important concepts in artificial intelligence, but they are not the same. Generative AI refers to different models that can create various types of content, such as text, images, and music. LLMs are a specific type of generative AI that focuses on understanding and producing human language. This post explains their differences, highlights key techniques like Transformers and GANs, and mentions important open-source projects.

Combining Retrieval and Generation in RAG


Let's talk about retrieval-augmented generation (RAG) - an advanced AI technique that enhances generative models with retrieval mechanisms. We'll cover RAG architecture, applications, benefits, challenges, best practices, and current research opportunities.