AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and Audio From LLM Fine-Tuning to Voice Agents, AR Interfaces, and Real-World Deployment - Softcover

Ara, Husn

9798296089038: AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and Audio From LLM Fine-Tuning to Voice Agents, AR Interfaces, and Real-World Deployment

Softcover

ISBN 13: 9798296089038

Verlag: Independently published, 2025

Alle Exemplare dieser ISBN-Ausgabe

2 Gebraucht

Von EUR 36,36

6 Neu

Von EUR 31,89

AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and Audio
From LLM Fine-Tuning to Voice Agents, AR Interfaces, and Real-World Deployment

Unlock the future of artificial intelligence with practical, production-ready multi-modal engineering.

This hands-on guide is built for developers, researchers, and AI professionals who want to go beyond chatbots and dive into building intelligent systems that understand text, images, audio, and human intent — all in one pipeline.

Whether you're fine-tuning large language models (LLMs) or creating voice-driven AR interfaces, this book walks you through the real engineering decisions, tools, and architectures needed to bring multi-modal AI to life.

What You'll Learn:

Fine-tuning Large Language Models (LLMs): Train and adapt models like GPT-2, LLaMA, and Mistral for custom tasks using Hugging Face, LoRA, QLoRA, and PEFT.
Voice Interfaces: Combine Whisper, LLMs, and Bark/Tortoise TTS to build interactive speech-driven assistants.
Computer Vision + Language: Use models like BLIP, CLIP, and DETR to connect what systems see to what they say and understand.
Instruction Tuning & Hyperparameter Optimization: Build smarter, domain-specific models with efficient training workflows.
Multi-Modal Pipelines: Chain audio, image, and text inputs for question answering, summarization, tutoring, and AR/robotic control.
Real-Time Interfaces: Deploy intelligent agents using FastAPI, Streamlit, Gradio, Docker, and Hugging Face Spaces.
Edge & Offline Deployment: Optimize models with ONNX, quantization (4-bit, 8-bit), and TensorRT for low-latency inference on CPU/GPU.

Use Cases Covered:

Smart document summarizers with OCR + TTS
Voice-enabled image assistants
Emotion-aware agents
Virtual tutors
AR-enhanced AI interfaces
Robotic perception + control from voice/image input
Secure, multilingual, and privacy-conscious AI systems

Tools & Frameworks Inside:

Python, PyTorch, Hugging Face Transformers
LangChain, OpenCV, Whisper, TTS, BLIP
ROS, Unity (AR/VR), Gradio, Streamlit
Docker, FastAPI, gRPC, TorchServe

Built for engineers. Written with depth. Designed for real-world impact.

If you're ready to build intelligent multi-modal agents that understand the world like humans do — across speech, vision, and language — this book gives you the complete roadmap.

Perfect for:
Machine learning engineers, data scientists, AI product developers, researchers, robotics engineers, and anyone building cutting-edge AI systems.

Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.

Verlag: Independently published
Erscheinungsdatum: 2025
Sprache: Englisch
ISBN 13: 9798296089038
Einband: Taschenbuch
Anzahl der Seiten: 294
Kontakt zum Hersteller: Manufactured by Amazon on behalf of the author
https://www.amazon.de/hz/contact-us

c/o Amazon Media EU S.�.r.l., 38 Avenue John F. Kennedy
Luxembourg
L-1855
Luxemburg

Gebraucht kaufen

Zustand: Wie neu

Unread book in perfect condition...

Diesen Artikel anzeigen

EUR 36,36

EUR 2,31 Versand
Versand innerhalb von USA

In den Warenkorb

Neu kaufen

Diesen Artikel anzeigen

EUR 31,89

EUR 2,31 Versand
Versand innerhalb von USA

In den Warenkorb

Suchergebnisse f�r AI Engineering: Building Multi-Modal Intelligent Systems...

Beispielbild f�r diese ISBN

AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and Audio From LLM Fine-Tuning to Voice Agents, AR Interfaces, and Rea

Ara, Husn

Verlag: Independently published, 2025

ISBN 13: 9798296089038

Neu Softcover

Anbieter: GreatBookPrices, Columbia, MD, USA

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. Bestandsnummer des Verk�ufers 51514388-n

Verk�ufer kontaktieren

Neu kaufen

EUR 31,89

EUR 2,31 Versand
Versand innerhalb von USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and Audio From LLM Fine-Tuning to Voice Agents, AR Interfaces, and Real-World Deployment

Ara, Husn

Verlag: Independently published, 2025

ISBN 13: 9798296089038

Neu Softcover

Print-on-Demand

Anbieter: California Books, Miami, FL, USA

Verk�uferbewertung 4 von 5 Sternen

Zustand: New. Print on Demand. Bestandsnummer des Verk�ufers I-9798296089038

Verk�ufer kontaktieren

Neu kaufen

EUR 34,29

Versand gratis
Versand innerhalb von USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

AI Engineering (Paperback)

Husn Ara

Verlag: Independently Published, 2025

ISBN 13: 9798296089038

Neu Paperback

Print-on-Demand

Anbieter: Grand Eagle Retail, Bensenville, IL, USA

Verk�uferbewertung 5 von 5 Sternen

Paperback. Zustand: new. Paperback. AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and AudioFrom LLM Fine-Tuning to Voice Agents, AR Interfaces, and Real-World DeploymentUnlock the future of artificial intelligence with practical, production-ready multi-modal engineering.This hands-on guide is built for developers, researchers, and AI professionals who want to go beyond chatbots and dive into building intelligent systems that understand text, images, audio, and human intent - all in one pipeline.Whether you're fine-tuning large language models (LLMs) or creating voice-driven AR interfaces, this book walks you through the real engineering decisions, tools, and architectures needed to bring multi-modal AI to life.What You'll Learn: Fine-tuning Large Language Models (LLMs): Train and adapt models like GPT-2, LLaMA, and Mistral for custom tasks using Hugging Face, LoRA, QLoRA, and PEFT.Voice Interfaces: Combine Whisper, LLMs, and Bark/Tortoise TTS to build interactive speech-driven assistants.Computer Vision + Language: Use models like BLIP, CLIP, and DETR to connect what systems see to what they say and understand.Instruction Tuning & Hyperparameter Optimization: Build smarter, domain-specific models with efficient training workflows.Multi-Modal Pipelines: Chain audio, image, and text inputs for question answering, summarization, tutoring, and AR/robotic control.Real-Time Interfaces: Deploy intelligent agents using FastAPI, Streamlit, Gradio, Docker, and Hugging Face Spaces.Edge & Offline Deployment: Optimize models with ONNX, quantization (4-bit, 8-bit), and TensorRT for low-latency inference on CPU/GPU.Use Cases Covered: Smart document summarizers with OCR + TTSVoice-enabled image assistantsEmotion-aware agentsVirtual tutorsAR-enhanced AI interfacesRobotic perception + control from voice/image inputSecure, multilingual, and privacy-conscious AI systemsTools & Frameworks Inside: Python, PyTorch, Hugging Face TransformersLangChain, OpenCV, Whisper, TTS, BLIPROS, Unity (AR/VR), Gradio, StreamlitDocker, FastAPI, gRPC, TorchServeBuilt for engineers. Written with depth. Designed for real-world impact.If you're ready to build intelligent multi-modal agents that understand the world like humans do - across speech, vision, and language - this book gives you the complete roadmap.Perfect for: Machine learning engineers, data scientists, AI product developers, researchers, robotics engineers, and anyone building cutting-edge AI systems. This item is printed on demand. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Bestandsnummer des Verk�ufers 9798296089038

Verk�ufer kontaktieren

Neu kaufen

EUR 38,42

Versand gratis
Versand innerhalb von USA

Anzahl: 1 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and Audio From LLM Fine-Tuning to Voice Agents, AR Interfaces, and Rea

Ara, Husn

Verlag: Independently published, 2025

ISBN 13: 9798296089038

Gebraucht Softcover

Anbieter: GreatBookPrices, Columbia, MD, USA

Verk�uferbewertung 5 von 5 Sternen

Zustand: As New. Unread book in perfect condition. Bestandsnummer des Verk�ufers 51514388

Verk�ufer kontaktieren

Gebraucht kaufen

EUR 36,36

EUR 2,31 Versand
Versand innerhalb von USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

AI Engineering

Ara, Husn

Verlag: Amazon Digital Services LLC - Kdp, 2025

ISBN 13: 9798296089038

Neu PAP

Anbieter: PBShop.store UK, Fairford, GLOS, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

PAP. Zustand: New. New Book. Shipped from UK. Established seller since 2000. Bestandsnummer des Verk�ufers L2-9798296089038

Verk�ufer kontaktieren

Neu kaufen

EUR 35,24

EUR 4,89 Versand
Versand von Vereinigtes K�nigreich nach USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and Audio From LLM Fine-Tuning to Voice Agents, AR Interfaces, and Rea

Ara, Husn

Verlag: Independently published, 2025

ISBN 13: 9798296089038

Neu Softcover

Anbieter: GreatBookPricesUK, Woodford Green, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. Bestandsnummer des Verk�ufers 51514388-n

Verk�ufer kontaktieren

Neu kaufen

EUR 35,23

EUR 17,65 Versand
Versand von Vereinigtes K�nigreich nach USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and Audio From LLM Fine-Tuning to Voice Agents, AR Interfaces, and Rea

Ara, Husn

Verlag: Independently published, 2025

ISBN 13: 9798296089038

Gebraucht Softcover

Anbieter: GreatBookPricesUK, Woodford Green, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Zustand: As New. Unread book in perfect condition. Bestandsnummer des Verk�ufers 51514388

Verk�ufer kontaktieren

Gebraucht kaufen

EUR 37,81

EUR 17,65 Versand
Versand von Vereinigtes K�nigreich nach USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

AI Engineering (Paperback)

Husn Ara

Verlag: Independently Published, 2025

ISBN 13: 9798296089038

Neu Paperback

Print-on-Demand

Anbieter: CitiRetail, Stevenage, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Paperback. Zustand: new. Paperback. AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and AudioFrom LLM Fine-Tuning to Voice Agents, AR Interfaces, and Real-World DeploymentUnlock the future of artificial intelligence with practical, production-ready multi-modal engineering.This hands-on guide is built for developers, researchers, and AI professionals who want to go beyond chatbots and dive into building intelligent systems that understand text, images, audio, and human intent - all in one pipeline.Whether you're fine-tuning large language models (LLMs) or creating voice-driven AR interfaces, this book walks you through the real engineering decisions, tools, and architectures needed to bring multi-modal AI to life.What You'll Learn: Fine-tuning Large Language Models (LLMs): Train and adapt models like GPT-2, LLaMA, and Mistral for custom tasks using Hugging Face, LoRA, QLoRA, and PEFT.Voice Interfaces: Combine Whisper, LLMs, and Bark/Tortoise TTS to build interactive speech-driven assistants.Computer Vision + Language: Use models like BLIP, CLIP, and DETR to connect what systems see to what they say and understand.Instruction Tuning & Hyperparameter Optimization: Build smarter, domain-specific models with efficient training workflows.Multi-Modal Pipelines: Chain audio, image, and text inputs for question answering, summarization, tutoring, and AR/robotic control.Real-Time Interfaces: Deploy intelligent agents using FastAPI, Streamlit, Gradio, Docker, and Hugging Face Spaces.Edge & Offline Deployment: Optimize models with ONNX, quantization (4-bit, 8-bit), and TensorRT for low-latency inference on CPU/GPU.Use Cases Covered: Smart document summarizers with OCR + TTSVoice-enabled image assistantsEmotion-aware agentsVirtual tutorsAR-enhanced AI interfacesRobotic perception + control from voice/image inputSecure, multilingual, and privacy-conscious AI systemsTools & Frameworks Inside: Python, PyTorch, Hugging Face TransformersLangChain, OpenCV, Whisper, TTS, BLIPROS, Unity (AR/VR), Gradio, StreamlitDocker, FastAPI, gRPC, TorchServeBuilt for engineers. Written with depth. Designed for real-world impact.If you're ready to build intelligent multi-modal agents that understand the world like humans do - across speech, vision, and language - this book gives you the complete roadmap.Perfect for: Machine learning engineers, data scientists, AI product developers, researchers, robotics engineers, and anyone building cutting-edge AI systems. This item is printed on demand. Shipping may be from our UK warehouse or from our Australian or US warehouses, depending on stock availability. Bestandsnummer des Verk�ufers 9798296089038

Verk�ufer kontaktieren

Neu kaufen

EUR 39,37

EUR 43,53 Versand
Versand von Vereinigtes K�nigreich nach USA

Anzahl: 1 verf�gbar

In den Warenkorb