Appearance
AI Models and LLM Systems
AI Models and LLM Systems Graphics Coverage
Primary chapter graphic: LLM Glossary Map, Generative AI Learning Roadmap, Transformer Encoder Decoder Flow, AI Concepts Map, MCP Server Catalog, Frontier Model One-Pager, Generative AI Tech Stack, Reasoning Model Training Path, Open Model Inference Flow, RAG vs Fine-Tuning, Full Fine-Tuning vs LoRA vs RAG, Open Source AI Stack, How LLMs See Text, Visual Token Context Compression. Accepted graphics: 14. Reviewed non-signal pages: 5. Open graphics in review: 0. QA status lives in graphics audit and visual review ledger.
Corpus pages: p. 38, p. 43, p. 74, p. 131, p. 147, p. 158, p. 175, p. 194-195, p. 200-201, p. 230, p. 242, p. 266-267, p. 277, p. 326-327, p. 335-336, p. 343, p. 364, p. 373-374 Coverage: 24 pages; low-confidence extraction ranges: p. 364, p. 373-374
This chapter is part of Marius's owned architecture build corpus. The text routes decisions; durable implementation signal is carried by accepted graphics, reviewed non-signal decisions, and the linked QA audit.
Chapter Visuals
Accepted graphics carry the canonical design signal for this chapter. Each selected source page is either accepted as a graphic or explicitly marked non-signal in the source-faithful ledger. Review and QA state live in visual inventory, visual review ledger, and graphics audit.
LLM Glossary Map
- source-page: p. 147
- batch: 04
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0147-ai-models-and-llm-systems.json
- svg: bbg-p0147-ai-models-and-llm-systems.svg

Generative AI Learning Roadmap
- source-page: p. 77
- batch: 05
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0077-architecture-source-map-architecture-source.json
- svg: bbg-p0077-architecture-source-map-architecture-source.svg

Transformer Encoder Decoder Flow
- source-page: p. 130
- batch: 08
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0130-search-retrieval-and-rag-search.json
- svg: bbg-p0130-search-retrieval-and-rag-search.svg

MCP Server Catalog
- source-page: p. 158
- batch: 09
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0158-ai-models-and-llm-systems.json
- svg: bbg-p0158-ai-models-and-llm-systems.svg

AI Concepts Map
- source-page: p. 194
- batch: 09
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0194-ai-models-and-llm-systems.json
- svg: bbg-p0194-ai-models-and-llm-systems.svg

Frontier Model One-Pager
- source-page: p. 73
- batch: 13
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0073-api-gateways-and-contracts-api.json
- svg: bbg-p0073-api-gateways-and-contracts-api.svg

Generative AI Tech Stack
- source-page: p. 222
- batch: 13
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0222-cloud-infrastructure-and-iac-cloud.json
- svg: bbg-p0222-cloud-infrastructure-and-iac-cloud.svg

Reasoning Model Training Path
- source-page: p. 38
- batch: 14
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0038-ai-models-and-llm-systems.json
- svg: bbg-p0038-ai-models-and-llm-systems.svg

Open Model Inference Flow
- source-page: p. 276
- batch: 15
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0276-architecture-source-map-architecture-source.json
- svg: bbg-p0276-architecture-source-map-architecture-source.svg

RAG vs Fine-Tuning
- source-page: p. 366
- batch: 15
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0366-platform-selection-and-tradeoffs-platform.json
- svg: bbg-p0366-platform-selection-and-tradeoffs-platform.svg

Full Fine-Tuning vs LoRA vs RAG
- source-page: p. 326
- batch: 17
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0326-ai-models-and-llm-systems.json
- svg: bbg-p0326-ai-models-and-llm-systems.svg

Open Source AI Stack
- source-page: p. 24
- batch: 23
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0024-deployment-release-and-versioning-deployment.json
- svg: bbg-p0024-deployment-release-and-versioning-deployment.svg

How LLMs See Text
- source-page: p. 364
- batch: 25
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0364-ai-models-and-llm-systems.json
- svg: bbg-p0364-ai-models-and-llm-systems.svg

Visual Token Context Compression
- source-page: p. 373
- batch: 26
- status: accepted
- reviewer-status: reviewed
- fidelity-score: 0.9
- spec: bbg-p0373-ai-models-and-llm-systems.json
- svg: bbg-p0373-ai-models-and-llm-systems.svg

Open Review Queue
- none
Reviewed Non-Signal Pages
- AI Models And LLM Systems: LLM + Vector Search Map: source p. 195; batch 08; status non-signal/reviewed; ledger reason in visual-review-ledger.json
- AI Models And LLM Systems: LLM + Tool Map: source p. 230; batch 08; status non-signal/reviewed; ledger reason in visual-review-ledger.json
- AI Models And LLM Systems: LLM + Tool Map: source p. 242; batch 11; status non-signal/reviewed; ledger reason in visual-review-ledger.json
- AI Models And LLM Systems: LLM + Embedding Map: source p. 277; batch 12; status non-signal/reviewed; ledger reason in visual-review-ledger.json
- AI Models And LLM Systems: Embedding + Pattern Map: source p. 131; batch 16; status non-signal/reviewed; ledger reason in visual-review-ledger.json
Use When
- A workflow needs language understanding, extraction, classification, summarization, or generation.
Avoid When
- A deterministic rule is cheaper, clearer, and more reliable.
Core Model
- Model systems combine prompt, context, tool constraints, evaluation, fallback, and cost controls.
- Prefer explicit ownership over accidental coupling. Every boundary should say who owns correctness, cost, data, recovery, and change.
- Use corpus page pointers for inspection, and keep the chapter notes focused on reusable design decisions.
Implementation Guidance
- Define task type, acceptable error, source context, output schema, review threshold, and regression set.
- Write the smallest useful design note: purpose, inputs, outputs, state, failure behavior, observability, and rollback.
- Choose the first implementation that can be tested against the real workflow without hiding a known production risk.
Tradeoffs
- Fine-tuning can specialize behavior; retrieval keeps knowledge fresh without changing weights.
- Centralization reduces duplicated work but can become a bottleneck when every team needs exceptions.
- Specialized infrastructure helps at scale, but it must earn its operational cost.
Failure Modes
- A model is evaluated by vibes instead of test cases tied to business risk.
- The diagram shows boxes but not ownership, retry behavior, data freshness, or user-visible failure.
- The system has no proof path for the highest-risk assumption.
Decision Checklist
- Measure accuracy, refusal behavior, hallucination rate, latency, cost, and human review load.
- Name the owner, source of truth, timeout, retry policy, and evidence that the path works.
- Add one regression check for the failure mode most likely to recur.
Neutral Automation Examples
- A classifier suggests categories with confidence and routes low-confidence items to a reviewer.
- A neutral internal automation starts with fixtures, then adds credentials, permissions, and production scheduling only after the boundary is tested.
- A customer-facing workflow keeps irreversible actions behind explicit approval until metrics show it is safe to automate further.