Tech Talks

Building Conversational Agents with Their Own Identity

Tech Team

10 nov 2025

The Hidden Parameter That Cut Our LLM Response Times by 68%

TL;DR: We wanted to understand what actually matters when deploying LLMs in production beyond marketing claims. We benchmarked GPT-OSS 120B across Cerebras , Groq , Azure , and AWS in October 2025. Vendor/press claims show Cerebras attaining up to ~3,000 tokens/sec on wafer-scale inference (peak hardware), while independent single-request benchmarks put AWS Bedrock at ~230 tokens/sec for steady per-session throughput. Crucially, tuning Bedrock's reasoning_effort parameter ma

Tech Team

6 nov 2025

How Promtior Is Using Cursor Rules to Build Smarter With AI

What are Cursor Rules, How to Apply Them & How Promtior Uses Them.

Tech Team

10 sept 2025

The AI Agent's Secret Weapon: Introducing the Model Context Protocol (MCP)

A new standard is changing how AI connects with business tools. In this article, we explain what MCP is, how we're applying it at Promtior, and why it could redefine the way your organization uses AI.

Tech Team

28 may 2025