inference | Any IT here? Help Me!

October 17, 2025

llama-bench the Qwen3 Coder 30B A3B and AMD Radeon Instinct Mi50 32Gb

For the LLM model the Alibaba’s Qwen3 Coder 30B A3B with different quantization are used to show the difference in token generation per second and…

neoX

August 1, 2025

llama-bench the LLama 4 Scout 17B 16E and AMD EPYC 9554 CPU

AI, llamacpp

This article shows the CPU-only inference with a modern server processor – AMD Epyc 9554. For the LLM model the Meta’s Llama 4 Scout 17B…

neoX

August 1, 2025

llama-bench the Phi-4 14B and AMD EPYC 9554 CPU

AI, llamacpp

This article shows the CPU-only inference with a modern server processor – AMD Epyc 9554. For the LLM model the Microsoft’s Phi-4 14B with different…

neoX

August 1, 2025

llama-bench the Qwen3 Coder 30B A3B and AMD EPYC 9554 CPU

AI, llamacpp

This article shows the CPU-only inference with a modern server processor – AMD Epyc 9554. For the LLM model the Alibaba’s Qwen3 Coder 30B A3B…

neoX

July 31, 2025

llama-bench the Qwen3 32B and AMD EPYC 9554 CPU

AI, llamacpp

This article shows the CPU-only inference with a modern server processor – AMD Epyc 9554. For the LLM model the Alibaba’s Qwen3 32B with different…

neoX

July 31, 2025

llama-bench the Gemma 3 27B and AMD EPYC 9554 CPU

AI, llamacpp

This article shows the CPU-only inference with a modern server processor – AMD Epyc 9554. For the LLM model the Google’s Gemma 3 27B Instruct…

neoX

July 30, 2025

llama-bench the Mistral Large 123B and AMD EPYC 9554 CPU

AI, llamacpp

This article shows the CPU-only inference with a modern server processor – AMD Epyc 9554. For the LLM model the Mistral Large Instruct 123B 2411…

neoX

July 29, 2025

llama-bench the Qwen 2.5 Coder 32B and AMD EPYC 9554 CPU

AI, llamacpp

This article shows the CPU-only inference with a modern server processor – AMD Epyc 9554. For the LLM model the Qwen 2.5 Coder 32B with…

neoX

July 23, 2025

llama-bench the DeepSeek R1 Distill Llama 70B and dual AMD EPYC 7282

AI, llamacpp

This article shows the CPU-only inference with a relatively old server processor – AMD Epyc 7282. For the LLM model the DeepSeek R1 Distill Llama…

neoX

July 20, 2025

llama-bench the DeepSeek R1 Distill Llama 70B and AMD EPYC 9554 CPU

AI, llamacpp

This article shows the CPU-only inference with a modern server processor – AMD Epyc 9554. For the LLM model the DeepSeek R1 Distill Llama 70B…

neoX