Wanted: LLM Inference service providers in Europe

Lunics · October 2024

I'm looking for offers and pointers for LLM inference service via API, in particular from European API providers that adhere to GDPR. The LLMs offered via API should currently include Llama-3.1-70B-instruct FP16.

PS: For LLM providers in general, https://artificialanalysis.ai/ is a good resource.

PPS: I am aware of IONOS, Mistral, OVHCloud, Nebius, LightOn / Orange Business, T-Systems

Keywords: Llama, AI, GenAI, generative AI, vLLM, openAI API, tokens

Void · October 2024

TensorDock / @lentro

Lunics · October 2024

@Void said:
TensorDock / @lentro

They seem to offer GPUs but no inference service - yet

k4zz · October 2024

IBM cloud

foxtwo · October 2024

@Lunics said:

@Void said:
TensorDock / @lentro

They seem to offer GPUs but no inference service - yet

SGLang works great for most of the models, including Llama.

https://github.com/sgl-project/sglang

foxtwo · October 2024

@Lunics said:
I'm looking for offers and pointers for LLM inference service via API, in particular from European API providers that adhere to GDPR. The LLMs offered via API should currently include Llama-3.1-70B-instruct FP16.

Out of curiosity, but why BF16?
Many benchmarks show that there are no differences between BF16 and FP8.

Void · October 2024

@Lunics said:

@Void said:
TensorDock / @lentro

They seem to offer GPUs but no inference service - yet

I know. They were offering free API access for Llama 3.1 8B hosted in their infrastructure, so maybe a custom plan? That’s the closest I’ve seen in this forum regarding your requirements, so it never hurts to ask.

LeifurGunnarsson · October 2024

Why not just use something like groqcloud?

CharityHost_org · October 2024

Runpod.io but you have to develop to add models yourself although some models are pretty much out of the box already in their pod instances templates.

ehhthing · October 2024

Given the current uncertainty about EU laws around AI I doubt there are many companies in this space.

luis_cortecs · December 2024

https://cortecs.ai/

Howdy, Stranger!

Categories

In this Discussion

Wanted: LLM Inference service providers in Europe

Comments

Howdy, Stranger!

Quick Links

Categories

In this Discussion

Wanted: LLM Inference service providers in Europe

Comments