
ลูกค้า: Red Hat US
รูปแบบ: หนังสืออิเล็กทรอนิกส์
ขนาด: 5.18 เมกะไบต์
ภาษา: อังกฤษ
วันที่: 06.11.2025
Get started with AI Inference
This e-book introduces the fundamentals of inference performance engineering and model optimization, with a focus on quantization, sparsity, and other techniques that help reduce compute and memory requirements for artificial intelligence (AI) models. It highlights the benefits of using a Red Hat® open approach, validated model repository, and tools like the LLM Compressor and Red Hat AI Inference Server. Download to get started.