Random Llama
Random Llama
ProductsSolutionsBlogCase StudiesContact
Get a Quote
Weekly Newsletter

Get AI & productivity insights weekly

Privacy-first tools, workflow tips, and early product access. No spam — unsubscribe anytime.

Random Llama Software

Texas-built weird tools and custom web platforms—fast shipping, no creepy tracking, no enterprise bloat.

Links
  • Home
  • Products
  • Case Studies
  • Blog
  • Solutions
  • Credentials
  • Contact
Services
  • Custom CMS
  • Booking Engines
  • Mobile Apps
  • AI Integration
Connect
  • Privacy Policy
  • Terms of Service
  • Cookie Policy

© 2026 Random Llama Software, LLC. All rights reserved. Privacy Policy

Back to Blog
ai-toolsopen-source

Cursor Got Caught and Gemini Is Coming for Your Memory

Robert HattalaMarch 29, 2026

Cursor launched Composer 2 on March 19 and called it "state-of-the-art programming intelligence." A developer found kimi-k2p5-rl-0317 buried in an API response a few days later. Composer 2 was Kimi K2.5 with extra fine-tuning, and Cursor never said that.

Cursor's Composer 2 Was Built on a Chinese Open-Source Model

Moonshot AI, backed by Alibaba, released Kimi K2.5 as an open-weight model earlier this year. Cursor used it as the base for Composer 2, added reinforcement learning on top, and shipped it without attribution.

Cursor's VP of dev education said "only ~1/4 of the compute came from the base." The co-founder called the omission a "miss." That's a polite way of saying they got caught by a developer reading API logs.

This matters beyond the drama. If a $50B company is shipping a top-tier coding assistant on a Chinese open-source foundation without disclosing it, the model provenance question just got serious for enterprise buyers.

Gemini Now Wants Your ChatGPT and Claude Memories

Google shipped an import tool on March 26 that lets you pull chat history and memories from ChatGPT or Claude straight into Gemini. Upload a ZIP or paste a summarization prompt, and Gemini absorbs your context.

The biggest switching cost between AI assistants isn't features. It's losing months of accumulated personalization. Google just removed that friction with one tool.

Anthropic built something similar for Claude about three weeks earlier. The memory portability race is on. For users that's good. For anyone betting on lock-in as a moat, it's a problem.

Gemini 3.1 Flash-Lite Is $0.25 per Million Tokens

Google also released Gemini 3.1 Flash-Lite recently with 2.5x faster response times and a price of $0.25 per million input tokens. For apps making lots of AI calls, that changes your infrastructure math in a real way.

Sub-dollar-per-million-tokens is becoming the default for commodity tasks. We're already routing lighter classification jobs through Flash-class models at Random Llama. The latency and cost both make production sense now.

The Cursor story and the Gemini memory importer are part of the same shift. The model layer is commoditizing fast, open-source foundations are everywhere, and the battle is moving to trust and data portability. Build on the product layer. Model dependencies are not a moat.

Related posts

Amazon Drops 25B on Anthropic and AI Infra Gets Serious

Amazon commits another 25B to Anthropic, regulators target frontier AI, and clinical trials get a governance playbook. Here is what matters.

April 21, 2026

Anthropic Hit 30B, Google Went Cheap, AI Agents Spook Security

Anthropic reportedly passed OpenAI at 30B annualized AI revenue. Google shipped a dirt cheap Gemini model. AI agents are making security folks sweat.

April 20, 2026

AI Agents Show Up for Real Work and MCP Gets Cracked Open

Three AI stories worth reading this week. EY put agents on 130K auditors, OpenAI aimed at drug discovery, and Anthropic MCP spec took a beating.

April 19, 2026
All posts