Assignment Review¶
Q1: Why do tokens matter?¶
Tokens are the units processed by an LLM. Every prompt, log file, YAML file, or response consumes tokens. More tokens increase cost, latency, and may exceed the model's context window.
DevOps Example¶
100 log lines = fewer tokens
10000 log lines = more tokens
More tokens = more Bedrock cost.
Q2: Context Window is similar to?¶
RAM
Server RAM
≈
LLM Context Window
If RAM is full → problems.
If Context Window is full → older information gets dropped or request fails.
Q3: Prompt vs System Prompt¶
Prompt is casual user input but system prompt decides behavior
| Type | Purpose |
|---|---|
| Prompt | User request |
| System Prompt | Defines AI behavior and rules |
Example:
System:
You are a Kubernetes Expert.
User:
Why is my pod restarting?
Q4: Temperature¶
Your answer:
Low
✅ Correct.
Reason:
Troubleshooting
RCA
Automation
Runbooks
must be consistent.
We don't want AI being creative when diagnosing production issues.
Q5: Hallucinations¶
Hallucinations are dangerous because AI may generate incorrect troubleshooting steps, wrong root causes, or non-existent commands. Following such advice in production could worsen incidents.
Example:
AI:
Run kubectl delete namespace production
if hallucinated → disaster.
Mini Project Review¶
You are a Senior Kubernetes SRE with 8 years of experience.
Analyze the Kubernetes pod failure using the logs below.
Pod Name: <pod-name>
Namespace: dev
Logs:
<logs>
Provide:
1. Root Cause Analysis
2. Severity (Low/Medium/High/Critical)
3. Probable Cause
4. Impact
5. Recommended Fix
6. kubectl Commands to Validate
7. Preventive Measures
Return the response in markdown format.
This is much closer to what enterprises use.
Module 2 Status¶
✅ Tokens
✅ Context Window
✅ Prompts
✅ System Prompts
✅ Temperature
✅ Inference
✅ Hallucinations