LLM Serving on Yotta Labs with AWS Trainium
1. Platform Overview & Hardware Selection
NVIDIA | [AWS] ← click to switchSpec
Value

2. AWS Trainium vs NVIDIA GPU — Key Differences
Aspect
NVIDIA GPU
AWS Trainium1
Key Takeaways
3. Launching a Trainium Pod
3.1 Select the Image
Field
Value

3.2 Ports
Port
Service
3.3 Deploy
4. Connecting & Verifying the Environment
4.1 SSH into the Pod
4.2 Activate the Conda Environment
4.3 Verify Neuron Devices
4.4 Verify All Required Packages
5. Using the Prestarted vLLM Service
5.1 Check if vLLM is already running
5.2 Confirm the active model
5.3 Run inference (curl)
Quick Reference
Verified Package Versions
Package
Version
Last updated
Was this helpful?