Vad är en GPU-instans i molnet?

En GPU-instans i molnet är en virtuell maskin hos en molnleverantör med en GPU (grafikkort) monterad till virtuella maskinen. GPUn kan vara antingen delad vGPU eller en hel GPU via PCIe-passthrough.

Varför ska jag välja en GPU-instans istället för en vanlig VPS?

En vanlig VPS med CPU är bra för generella applikationer, men vissa arbetslaster kräver den parallella processkraften i en GPU. Med en GPU-instans kan du: Träna och köra AI- och ML-modeller betydligt snabbare Rendera grafik och 3D i realtid Köra videotranscoding i stor skala Dra nytta av CUDA, TensorRT och andra NVIDIA-optimerade bibliotek Kort sagt: CPU för generellt arbete, GPU för accelererad beräkning.

Vad menas med “Local NVMe High-IOPS Storage”?

Alla våra GPU-instanser kommer med dedikerad NVMe-lagring lokalt i servern . Denna disk används som scratchdisk eller primär lagring och levererar extremt hög IOPS och låg latens , jämfört med traditionell nätverkslagring. För arbetslaster som AI-träning, databearbetning och videobearbetning innebär det att du får mycket snabbare åtkomst till data.

Hur fungerar betalning och fakturering?

Vi erbjuder flexibel fakturering per timme där du endast betalar för den tid din instans är igång. För kunder som vill ha långsiktig drift visar vi även max månadspris. Debitering av inlagt betalkort sker automatiskt och det finns inga bindningstider – du kan starta och stoppa dina GPU-instanser när du vill. Företag kan ansöka om faktura som betalsätt genom att skicka in ett supportärende i cloud-plattformen.

Kan jag installera mina egna drivrutiner eller virtuella arbetsstationer (vWS)?

Du får full root-åtkomst till din GPU-instans och kan installera egna drivrutiner, bibliotek och programvaror. Det innebär att du kan sätta upp allt från Docker-containers med PyTorch/TensorFlow till fulla virtuella arbetsstationer (vWS) med t.ex. Remote Desktop eller NICE DCV.

Vilken support får jag som kund?

Supporten hanteras via tickets direkt i vårt cloud-system , och du kan skriva på både svenska och engelska. Våra tekniker finns i Sverige och hjälper dig med driftrelaterade frågor, instanshantering och felsökning.

Kör ni flera kunder på samma GPU?

Nej. Alla våra GPU-planer inkluderar bara dedikerade NVIDIA L4-GPUer , anslutna via PCIe-passthrough. Det betyder att du inte delar GPU-resurser med andra kunder – till skillnad från vissa molnleverantörer som erbjuder “delad vGPU” med lägre prestanda. Hos oss är prestandan förutsägbar och garanterad.

Kan jag använda instansen för maskininlärning och AI-träning?

Absolut. NVIDIA L4 är utmärkt för både AI-inferens och modellträning . Kortet har stöd för FP32, FP16 och INT8, vilket gör det effektivt för både träning och optimerad inferens. Du kan använda populära ramverk som TensorFlow, PyTorch och JAX utan problem.

GPU Instances in the Cloud - Rent Nvidia L4

Powerful performance for AI training, AI inference, rendering, high-intensity calculations, and multimedia.

Run advanced workloads directly in the cloud – without buying and operating your own hardware. Our GPU instances are built for enterprises, developers, and creators who demand high performance, low latency, and full control.

Why Hexabyte GPU?

Modern hardware – Nvidia L4 (PCIe Gen 4) with 24GB GDDR6 and energy-efficient Ada Lovelace architecture.

Flexible instances – Scale up or down as needed.

Local infrastructure – Low response times within the EU, servers located in Umeå, Sweden.

Transparent pricing – Also the most performance for the money.

Perfectly suited for

AI & Machine Learning – AI training, AI inference and data processing.
Rendering & Video – Stable Diffusion, 3D rendering, video encoding and real-time streaming.
High-performance computing – Financial models, simulations and research projects.
Virtual workstations – CAD, GIS and other GPU-heavy applications.

NVIDIA L4

24GB GDDR6

Massive graphics memory for large models and datasets.

PCIe Gen 4 Passthrough

Full control over the entire GPU, no sharing with other customers.

Up to 30.3 TFLOPS FP32

Raw computing power for AI and rendering.

AI-optimized Ada Lovelace architecture

Specialized in AI inference, AI training and video processing.

Energy-efficient performance

2.5x better performance per watt compared to previous generations.

Low response times within the EU

Server hall in Umeå, Sweden.

Pricing example Nvidia L4 GPU instances

The L4 family consists of GPU servers with Nvidia L4 GPUs, AMD EPYC 7713 CPUs, DDR4 RAM, and high-IOPS local NVMe disks in RAID.

l4.gpu1

€0.36

/ hour

€238.82

/ month

l4.gpu2

€0.39

/ hour

€261.36

/ month

l4.gpu3

€0.44

/ hour

€294.45

/ month

l4.dual-gpu

€0.97

/ hour

€653.36

/ month

Calculated at 672 hours per month. For a full month of use on the same instance, the cost does not exceed the fixed monthly price.

Own public IP addresses

1 IPv4 address and 1 IPv6 address are included per instance.

Renewable electricity

Our servers are powered by electricity from Umeå Energi, which supplies renewable electricity. In 2022, the energy consisted of 72% hydropower, 10% wind power, 16% biopower and 2% solar energy, read more at Umeå Energi's website.

NVMe SSD instead of hard drives

We know that our customers prioritize speed, so all data storage is done on NVMe SSDs of enterprise models. Faster and more power efficient! All data is replicated on three different disks.

We are our own internet provider

Internet for our services is delivered by us via our own AS number. Stable and secure connection - Without traffic restrictions!

FAQ

What is a GPU instance in the cloud?

A GPU instance in the cloud is a virtual machine at a cloud provider with a GPU (graphics card) attached to the virtual machine. The GPU can be either a shared vGPU or a full GPU via PCIe passthrough.

Why should I choose a GPU instance instead of a regular VPS?

A regular VPS with a CPU is fine for general-purpose applications, but some workloads require the parallel processing power of a GPU. With a GPU instance, you can:

Train and run AI and ML models significantly faster
Render graphics and 3D in real time
Running video transcoding at scale
Take advantage of CUDA, TensorRT, and other NVIDIA-optimized libraries
In short: CPU for general work, GPU for accelerated computing.

What does “Local NVMe High-IOPS Storage” mean?

All our GPU instances come with dedicated NVMe storage locally in the serverThis disk is used as a scratch disk or primary storage and delivers extremely high IOPS and low latency, compared to traditional network storage. For workloads like AI training, data mining, and video processing, this means you get much faster access to data.

How does payment and invoicing work?

We offer flexible hourly billing where you only pay for the time your instance is running. For customers who want long-term operation, we also show the maximum monthly price. The debiting of the inserted payment card is done automatically and there is no binding times – you can start and stop your GPU instances whenever you want.

Companies can apply for an invoice as a payment method by submitting a support ticket in the cloud platform.

Can I install my own drivers or virtual workstations (vWS)?

You get full root access to your GPU instance and can install your own drivers, libraries and software. This means you can set up everything from Docker containers with PyTorch/TensorFlow to full virtual workstations (vWS) with e.g. Remote Desktop or NICE DCV.

What support do I receive as a customer?

Support is handled via tickets directly in our cloud system, and you can write in both Swedish and English. Our technicians are located in Sweden and will help you with operational issues, instance management and troubleshooting.

Are you running multiple clients on the same GPU?

No. All of our GPU plans only include dedicated NVIDIA L4 GPUs, connected via PCIe passthrough. This means you don't share GPU resources with other customers – unlike some cloud providers that offer lower performance “shared vGPU”. With us, performance is predictable and guaranteed.

Can I use the instance for machine learning and AI training?

Absolutely. NVIDIA L4 is great for both AI inference and model trainingThe card supports FP32, FP16, and INT8, making it efficient for both training and optimized inference. You can use popular frameworks like TensorFlow, PyTorch, and JAX without any problems.

GPU Instances in the Cloud - Rent Nvidia L4

Why Hexabyte GPU?

Perfectly suited for

NVIDIA L4

24GB GDDR6

PCIe Gen 4 Passthrough

Up to 30.3 TFLOPS FP32

AI-optimized Ada Lovelace architecture

Energy-efficient performance

Low response times within the EU

Pricing example Nvidia L4 GPU instances

l4.gpu1

l4.gpu2

l4.gpu3

l4.dual-gpu

Own public IP addresses

Renewable electricity

NVMe SSD instead of hard drives

We are our own internet provider

FAQ

Services

The company

Legal

Services