Random Llama
Random Llama
ProductsSolutionsBlogCase StudiesContact
Get a Quote
Weekly Newsletter

Get AI & productivity insights weekly

Privacy-first tools, workflow tips, and early product access. No spam — unsubscribe anytime.

Random Llama Software

Texas-built weird tools and custom web platforms—fast shipping, no creepy tracking, no enterprise bloat.

Links
  • Home
  • Products
  • Case Studies
  • Blog
  • Solutions
  • Credentials
  • Contact
Services
  • Custom CMS
  • Booking Engines
  • Mobile Apps
  • AI Integration
  • Website Maintenance
Connect
  • Privacy Policy
  • Terms of Service
  • Cookie Policy

© 2026 Random Llama Software, LLC. All rights reserved. Privacy Policy

Back to Blog
ai-toolsopen-source

Cursor Got Caught and Gemini Is Coming for Your Memory

Robert HattalaMarch 29, 2026

Cursor launched Composer 2 on March 19 and called it "state-of-the-art programming intelligence." A developer found kimi-k2p5-rl-0317 buried in an API response a few days later. Composer 2 was Kimi K2.5 with extra fine-tuning, and Cursor never said that.

Cursor's Composer 2 Was Built on a Chinese Open-Source Model

Moonshot AI, backed by Alibaba, released Kimi K2.5 as an open-weight model earlier this year. Cursor used it as the base for Composer 2, added reinforcement learning on top, and shipped it without attribution.

Cursor's VP of dev education said "only ~1/4 of the compute came from the base." The co-founder called the omission a "miss." That's a polite way of saying they got caught by a developer reading API logs.

This matters beyond the drama. If a $50B company is shipping a top-tier coding assistant on a Chinese open-source foundation without disclosing it, the model provenance question just got serious for enterprise buyers.

Gemini Now Wants Your ChatGPT and Claude Memories

Google shipped an import tool on March 26 that lets you pull chat history and memories from ChatGPT or Claude straight into Gemini. Upload a ZIP or paste a summarization prompt, and Gemini absorbs your context.

The biggest switching cost between AI assistants isn't features. It's losing months of accumulated personalization. Google just removed that friction with one tool.

Anthropic built something similar for Claude about three weeks earlier. The memory portability race is on. For users that's good. For anyone betting on lock-in as a moat, it's a problem.

Gemini 3.1 Flash-Lite Is $0.25 per Million Tokens

Google also released Gemini 3.1 Flash-Lite recently with 2.5x faster response times and a price of $0.25 per million input tokens. For apps making lots of AI calls, that changes your infrastructure math in a real way.

Sub-dollar-per-million-tokens is becoming the default for commodity tasks. We're already routing lighter classification jobs through Flash-class models at Random Llama. The latency and cost both make production sense now.

The Cursor story and the Gemini memory importer are part of the same shift. The model layer is commoditizing fast, open-source foundations are everywhere, and the battle is moving to trust and data portability. Build on the product layer. Model dependencies are not a moat.

Related posts

AI Agents Are Breaking Things and OpenAI's in Court

OpenAI faces Elon Musk in court over its nonprofit roots, a Claude agent deletes a startup database, and Microsoft ends its exclusive deal.

April 30, 2026

AI Drove on Mars and Snap Just Fired 1,000 People

NASA's AI drove Perseverance across Mars, Snap laid off 1,000 citing AI code generation, and a lawyer got suspended for 57 fake AI citations.

April 28, 2026

Snap Blames AI for 1,000 Layoffs, Meta Bets

Snap laid off 1,000 people and cited AI as the reason. Meta is spending on it this year. OpenAI launched workspace agents. Big week.

April 27, 2026

Need custom software or maintenance?

We build privacy-first apps, booking engines, and full-stack platforms — and keep them running.

Browse SolutionsGet in Touch
All posts