LIBRISTO
LIBROAMANTO
mandatory
Become part of a community of book lovers from all over the world and get access to a whole bunch of benefits. Create an account for free
0
Austrian Post 5.49 DPD courier 3.99 DPD point 2.99

Ultimate Multimodal Transformer Models

Language EnglishEnglish
Book Paperback
Book Ultimate Multimodal Transformer Models Dr. S. Mahesh Anand
Libristo code: 52743511
Publishers Orange Education Pvt Ltd, May 2026
One Architecture. Infinite Intelligence.Book DescriptionTransformer architectures have become the un... Full description
? points 112 b New New
45.59 VAT included
Expected in stock Expected 05. 06. 2026
Austria Delivery to Austria

Up to 30 days for returns

One Architecture. Infinite Intelligence.

Book Description

Transformer architectures have become the unified foundation of modern AI - powering language models, computer vision systems, and multimodal applications that process text, images, and speech together. Ultimate Multimodal Transformer Models provides a comprehensive, hands-on guide to mastering every major Transformer variant, from foundational encoder-decoder architectures to cutting-edge vision-language models and production GenAI systems.

You begin with the core building blocks of Transformer architecture and text data preparation, then progressively advance through encoder-only models, generative LLMs, RAG, Agentic workflows, and efficient fine-tuning using PEFT, LoRA, and QLoRA. The book then transitions into Vision Transformers, covering ViT, DETR, SAM, CLIP, and Flamingo, before bringing everything together in real-world multimodal applications combining text, vision, and speech using PyTorch and Hugging Face throughout.

What you will learn

● Build and deploy Transformer models for text, vision, and multimodal AI tasks.

● Fine-tune large language models efficiently using PEFT, LoRA, and QLoRA techniques.

● Develop production-ready GenAI applications using RAG pipelines and Agentic AI workflows.

● Apply LLMs to real-world NLP tasks including summarization, question answering, and classification.

Table of Contents

1. The Rise of Transformer Models in Sequence Learning

2. Text Data Preparation for Transformer Models

3. Building Blocks of Transformer Architecture

4. Encoder-only Transformer Configurations

5. Generative Transformers and LLM Architectures

6. Customizing LLMs Using Retrieval-Augmented Generation (RAG)

7. Efficient Fine-Tuning Techniques with PEFT and LoRA

8. Orchestrating LLMs with Tools and Memory

9. Introduction to Vision Transformer Models

10. Vision Transformers for Image Classification

11. Object Detection and Segmentation with Transformer Architectures

12. Vision-Language Models and Multimodal LLMs

13. Real-World Multimodal GenAI Applications

14. Image Generation with Vision Transformers

15. The Future of GenAI with Transformers

       Index

Actress & Polyglot
EWA KASP for
Play video
Ewa Kasp
Libristo has the largest selection of foreign-language books. That’s why I buy my books there.

About the book

Full name Ultimate Multimodal Transformer Models
Language English
Binding Book - Paperback
Date of issue 2026
Number of pages 352
EAN 9788169646161
ISBN 8169646162
Libristo code 52743511
Weight 818
Dimensions 216 x 280 x 19
Give this book today
It's easy
1 Add to cart and choose Deliver as present at the checkout 2 We'll send you a voucher 3 The book will arrive at the recipient's address

Login

Log in to your account. Don't have a Libristo account? Create one now!

 
mandatory
mandatory

Don’t have an account? Discover the benefits of having a Libristo account!

With a Libristo account, you'll have everything under control.

Create a Libristo account
Book advisor Libroamiko
Hi, I'm Libroamiko, can I help?