Cursor vs Claude Code

Theo - t3․gg · 21 Claims

Neutral

Claude Code was built to work in the terminal, meeting developers where they already work rather than forcing a new IDE or workflow.

The author explains Claude Code's design philosophy of terminal-first approach as a deliberate choice to lower adoption friction.

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Disagree

Claude Code is as much a marketing tool as it is a developer tool, with many features optimized for Twitter screenshots rather than practical productivity.

The author states features like pet mode and sub-agent displays are designed to look impressive in screenshots, and that Anthropic optimizes for 'the feeling of productivity.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Disagree

Claude Code's strategy is to burn more tokens to solve problems rather than finding token-efficient solutions.

The author argues Anthropic's philosophy is 'if more tokens solve the problem, use more tokens,' citing sub-agent mode as an example that burns tokens aggressively.

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Agree

OpenAI is trying much harder to be token-efficient while maintaining accuracy, using half as many tokens as competitors to get better scores.

The author cites artificial analysis benchmarks showing GPT-55 used half as many tokens to get a better score compared to Anthropic models.

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Disagree

Anthropic actively discourages external integrations with Claude Code and wants users locked into their CLI experience.

The author states Anthropic wants users in their CLI using their features and prevents programmatic API access without charging more money.

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Agree

The Codex app server is open source, enabling developers to build on top of it, which is how the author's T3 Code app was built.

The author says his agentic coding app 'would not exist if it wasn't for the Codex app server, which is part of the open source Codex CLI.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Disagree

Cursor has historically not done a great job building integrations and SDKs, with their CLI and Agent Client Protocol being far behind.

The author describes attempting to integrate Cursor into T3 Code and finding the cursor CLI 'super behind' with hardcoded, outdated models.

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Disagree

Anthropic employees use a different internal version of Claude Code with the Mythos model and different system prompts than what users receive.

The author claims the tool Anthropic employees use internally is not the same as the public version, leading to poorly tested features for end users.

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Agree

OpenAI employees use the exact same Codex app, models, and features that end users get.

The author states 'the Codex app that we're using... is the exact same thing that they're using at OpenAI,' citing conversations with friends at the company.

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Agree

Cursor uses the same tools internally that users get and obsessively tests every detail before releasing to the public.

The author states 'They use the shit out of the things that they are doing. They obsessively test every detail before taking the exact thing they're using internally and throwing it at us.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Disagree

The Claude Code desktop app is heavily underinvested in and clearly not used by Anthropic employees themselves.

The author demonstrates login failures, lack of thread sync, and poor UX in the desktop app, concluding 'they straight up do not use the desktop app.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Disagree

Anthropic built the Claude Code desktop app only because people kept asking for it, not because they intend for people to use it as their primary interface.

The author states 'They didn't build this app for you to use it. They built this app so that it would exist if people stop asking for it.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Agree

Codex (OpenAI) takes a minimal, practical approach focused on getting work done rather than feeling exciting or flashy.

The author contrasts Codex's minimal UI with Claude Code's flashy animations, noting Codex 'isn't trying to feel like a slot machine. It's just trying to get work done.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Agree

Codex uses computer use to verify changes by having the model look at what it changed, which is more efficient than spinning up multiple agents to check code.

The author argues that computer use verification ends up being 'a lot less work than spinning up 15 agents to go triple check every line of code.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Disagree

Anthropic's public models haven't improved since December, with Opus 46 and 47 being regressions rather than improvements.

The author states 'I still feel like 46 and especially 47 are more so regressions than they are improvements' and that 'Opus got worse twice.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Agree

OpenAI models improved significantly from 52 to 55 with massive jumps in capability, while Anthropic's models stagnated.

The author states '52 to 53 to 54 to 55 were massive improvements' and contrasts this with Anthropic where 'Opus got worse twice and OpenAI models got better three times.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Disagree

Anthropic compensates for model stagnation by adding many flashy features to Claude Code to create the appearance of progress.

The author says 'Anthropic has to make up for this somehow... by doing a shitload of cool looking features and trying to make moments happen.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Agree

Cursor's cloud environment is far ahead of competitors, spinning up full graphical Linux instances for testing changes with computer use.

The author describes Cursor's cloud sandbox as spinning up 'a full graphical interface Linux instance' that can run the full app and use computer use to test changes.

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Agree

Cursor enables Slack-based agent triggering where users can ask a bot to fix problems and receive video proof of the fix in the thread.

The author demonstrates that from Slack, someone can ask the cursor bot to 'spin up an agent to fix this' and it 'responds in the thread with a video of the fix.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Neutral

Claude Code is best suited for unmotivated or less experienced developers who want to feel productive, while Codex is better for experienced engineers who want a tool that gets out of the way.

The author summarizes his recommendation: 'Cloud code for unmotivated devs or bad devs that want to feel like they're productive. Codex for skeptical devs that want to use these tools in a way that is productive.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)

Agree

Cursor is the most enterprise-ready end-to-end solution, especially for teams with non-technical members who need to trigger AI agents from Slack.

The author recommends Cursor cloud as 'one of the best solutions to set up for your team so nontechnical people can kick things off in Slack.'

Source: Claude Code vs Codex vs Cursor (an honest comparison)