Groq LLM | Notion

Overview

Groq provides ultra-fast language model inference through their specialized AI inference hardware and optimized model serving infrastructure. This service offers high-speed text generation with dramatically reduced latency compared to traditional cloud providers, making it ideal for real-time applications and high-throughput scenarios.

Service Information

Property	Value
Service Name	Groq
Status	Enabled
Compatible Nodes	Call LLM, HyperFlow LLM PDF transformer

When to Use This Service

Groq is ideal for:

Real-time applications requiring minimal response latency
High-throughput processing with many concurrent requests
Interactive experiences where speed is critical to user experience
Cost-effective inference with competitive pricing for speed
Development and testing where fast iteration cycles are valuable
Live demonstrations requiring consistently fast responses

API Requirements

Groq → API Key: Valid Groq API key with language model access

Available Models

Model availability depends on your Groq API access. Check the model dropdown in Call LLM node for currently available options.

Groq typically offers optimized versions of popular open-source models with significantly enhanced inference speed.

Parameters