top of page


Building Conversational Agents with Their Own Identity
Building Conversational Agents with Their Own Identity
Tech Team
10 nov


The Hidden Parameter That Cut Our LLM Response Times by 68%
TL;DR: We wanted to understand what actually matters when deploying LLMs in production beyond marketing claims. We benchmarked GPT-OSS 120B across Cerebras , Groq , Azure , and AWS in October 2025. Vendor/press claims show Cerebras attaining up to ~3,000 tokens/sec on wafer-scale inference (peak hardware), while independent single-request benchmarks put AWS Bedrock at ~230 tokens/sec for steady per-session throughput. Crucially, tuning Bedrock's reasoning_effort parameter ma
Tech Team
6 nov


How Promtior Is Using Cursor Rules to Build Smarter With AI
What are Cursor Rules, How to Apply Them & How Promtior Uses Them.
Tech Team
10 sept


The AI Agent's Secret Weapon: Introducing the Model Context Protocol (MCP)
A new standard is changing how AI connects with business tools. In this article, we explain what MCP is, how we're applying it at Promtior, and why it could redefine the way your organization uses AI.
Tech Team
28 may
bottom of page