- 23:51 β Humanizer tool removes AI patterns from text. @tom_doerr
- 21:57 β OpenAI/Anthropic raised massive funding; must attack application layer not become commoditized APIs. @buccocapital
- 09:14 β Cardiologist won 3rd place Anthropic hackathon (13K applications), built in 7 days. @trajektoriePL
- 20:10 β Opus 4.6 time horizon doubled to 14h 30m vs Opus 4.5’s 5h 20m in 3 months. @aidigest_
- 15:27 β AI agents automating <10% manual departments is vertical SaaS 2.0. @gregisenberg
- 01:54 β Multi-clauding: use /rename [label] to name Claude terminal sessions. @_catwu
- 20:08 β Anthropic has no strategy; Claude Code/Cowork/MCP started as side projects. @pmddomingos
- 21:44 β Claude Code should rewrite Anthropic’s bloated Electron app as native. @sindresorhus
- 10:44 β Anthropic Max plan member hitting rate limits (37% session, 7% weekly used). @sudoingX
- 20:27 β Alibaba zvec vector DB: 500β25,300 stars in a week; local RAG without separate database. @greptile
- 13:13 β LLM progress stalled; signs everywhere but no one notices. @DefenderOfBasic
- 15:00 β Sandbox spin-up in <60ms (faster than blink); speed P0 when agents waiting. @ivanburazin
- 17:15 β $100K generated for roofing company with @openclaw; replicable for home service businesses. @_toddanderson
- Zero Trust mindset needed for AI agents; one click exposes data forever. @CatoNetworks
- XPENG next-gen IRON: tech-powered intelligent exploration platform. @xiaopenghexpeng
- Astra AI (2.1M+ users) uses free tool to track and pay UGC creators at scale. @lottsnomad
- 05:08 β Coding model rankings: GLM < Kimi < MiniMax < Gemini < Codex < Claude. @burkov
- 11:11 β Openclaw + Trello agents: free plan, free API, organized task management with full observability. @DanKulkov
- 05:13 β Claude Code: have it explain concepts using your own codebase when fixing issues; better than reading docs. @pdrmnvd
- 01:10 β 4.6 benchmark chart not best measuring tool after hands-on testing. @Icebergy
- 15:14 β App idea: Opportunist opens PRs nightly on your projects, learns from merges. @r00k
- 06:40 β If Claude Code fixes security issues, why not write secure code initially? @karankendre
- 23:37 β Impact of $200/mo AI tools on software product development costs not widely understood. @___frye
- 21:50 β 99% of products/services lack AI-native CLI yet. @steipete
- 11:12 β Gemini 3.1 built Figma replacement in 5-hour session; shipped via Vercel at $2400 MRR. @krzyzanowskim
- 19:31 β Opus 4.6 special: lower base capability but degrades much slower; time horizons exceptional. @scaling01
- 17:47 β Shadcn/skills coming: works with shadcn ecosystem, good results in testing. @shadcn
- 00:25 β Dario wasn’t exaggerating about replacing software engineers. @kanavtwt
- 02:24 β Taalas runs Llama 3 8B at 16k tokens/second per user; specialized hardware, order of magnitude faster than Cerebras. @awnihannun
- 20:18 β Claude productivity: tasks taking whole workday now take 5-10 minutes. @beffjezos
- 12:34 β AI economics: perfect time for children; AI tutors = free education, robots housework ~10 years, mass abundance ahead. @GRITCULT
- 16:12 β Sam Altman frontier labs prediction: world unprepared for extremely capable models coming imminently. @deredleritt3r
- 10:50 β AI agents require mission control (observability dashboard); buildable in minutes with prompt. @sharbel
- 06:22 β AI paradigm shift: AI will eliminate UI the way TV eliminated radio. @shl
- 14:09 β AI agent architecture: skill graphs superior to SKILL .md files for structuring capabilities. @akshay_pachaar
- 14:24 β SaaS rebuilding: rebuilt 50%+ personal SaaS with AI; 10x better (personalized), costs $0 to run. @johnrushx
- 13:42 β AI productivity economics: like farming automation, AI lets one developer produce 100x more software. @kepano
- 22:26 β SaaS future: solo founders vibe coding 4-5 competing apps at different price points. @typesfast
- 03:39 β Big Tech engineering shift: Microsoft/Apple engineers haven’t written code by hand in months. @DegenSugarBoo
- 19:14 β Opus 4.6 benchmark saturation: 95% confidence interval extends to 98 hours. @seconds_0
- 13:18 β Median decision maker reality: uses ChatGPT for lookups, maybe tried Claude Cowork, swamped at job. @atelicinvest
- 17:42 β Payment tech: open-sourced x402 gateway - self-hosted, multi-chain, direct settlement, no intermediaries. @ninja_dev3
- 02:41 β Cost optimization: stop paying for OpenClaw heartbeats; 16GB Mac Mini runs local LLM, saves tokens. @ziwenxu_
- 23:08 β AI cost expectations: $100/$200 plans unrealistic; actual enterprise API costs far exceed assumptions. @HackingLZ
- 05:12 β AI ambition: if you can vibe code your idea, you’re thinking too small. @shl
- 11:40 β Anthropic industry position: company burning money; testing alternatives; MiniMax 2.5 failed. @heyandras
- 18:57 β AI model limitations: current models lazy and unaware of capabilities; stress-testing saves debugging time. @garybasin
- 03:49 β Software engineering task complexity: can anyone name a 100-hour discrete task vs composition of subtasks? @tenobrus
- 14:04 β Code Mode is all you need, very excited about this MCP direction. @mattzcarey
- 19:25 β Reverse engineering OpenClaw’s internals reveals insane attention to detail and undocumented tricks. @Yampeleg
- 13:00 β Steal this idea: disposable email, phone, credit cards for AI agents acting on your behalf safely. @paraschopra
- 14:05 β Cloudflare Code Mode: collapsed 2,500 API endpoints into 2 tools, ~1,000 tokens vs 2M tokens. @Cloudflare
- 18:00 β Gemini 3.1 Pro performs worse than Gemini 3 Pro on Vending-Bench 2. @andonlabs
- 03:13 β Opus 4.6 is literally just smart GPT-4O; beware. @bayeslord
- 19:49 β If AI turns $1 into $2 consistently, does that end money as an idea? @BoredElonMusk
- 05:42 β Antigravity installed, chose Gemini 3.1 Pro but UI says Claude 3.7 Sonnet; lying or hallucinating? @Yuchenj_UW
- 18:58 β We’re all basically context janitors for AI at this point. @petergyang
- 21:24 β World’s most important graph vertical; GPT-5.2 Dec ‘25, Opus 4.6 Feb ‘26, 2-month doubling, AI solved software. @daniel_mac8
- 11:08 β Claude Max plan seems great until realizing API pricing for Opus too expensive to compete long-term. @adamdotdev
- 10:02 β Lines of code terrible metric, PRs per day worse, tokens spent tops all; not measuring any outcome just effort. @gunnarmorling
- 01:30 β Claude Code rewriting weeks of work; people say coding solved but reality is we’re still far off. @fbrasisil
- 19:48 β Pure insanity happening: METR estimates Claude Opus 4.6 50% time horizon 14.5h on software tasks. @Dr_Singularity
- 04:55 β How did Anthropic ship Claude Cowork (Excel/PowerPoint agentic) before Microsoft? Hearing Microsoft code red. @GergelyOrosz
- 00:23 β If buying Mac Mini, wait 4-6 months for used mint-condition units to flood market. @fchollet
AI & Tech Developments - Feb 21
π± Source Tweets
Removes AI patterns from text https://github.com/blader/humanizerβ¦
β @tom_doerr
Do you realize just how much fucking money OpenAI and Anthropic have raised? Do you realize how much is at stake? They cannot become commoditized intelligence accessed via API They mathematically and strategically must attack the application layer
β @buccocapital
Cardiologist wins 3rd place at Anthropic's hackathon. Out of 13,000 applications. Built in 7 days by MichaΕ Nedoszytko MD. Coded day and night - in the hospital, in the cloud, while flying from Brussels to San Francisco. A few years ago, it would have been impossible for a doctor
β @trajektoriePL
The exponential continues. Nov 2025: Opus 4.5 had a 5hr 20 time horizon. Feb 2026: Opus 4.6 has a 14hr 30 time horizon. Over three months, that's more than a *doubling* in the duration of coding tasks, measured by how long it takes human professionals, that AI can complete
β @aidigest_
look real hard at the green box every category under 10% on this chart is a department still running on manual effort and tribal knowledge. that's where stress lives that's where budgets live turn one of those workflows into a repeatable AI agent that's vertical saas 2.0
β @gregisenberg
Multi-clauding? Use /rename [label] to name each terminal session
β @_catwu
Anthropic has no strategy. Claude Code started as someone's side project, and so did Cowork and MCP.
β @pmddomingos
I don't get why Anthropic hasn't used Claude Code to rewrite their bloated Electron app as native. Should be much easier than a C compiler.
β @sindresorhus
hey AnthropicAI folks i am paying for Max plan. 37% session used. 7% weekly used. API Error: Rate limit reached dude what am i paying for exactly?
β @sudoingX
.@AlibabaGroup's open source vector database, zvec, went from 500 to 5,300 stars in the past week. It runs inside your app and lets you build local RAG without a separate database. Vector search built-in for local AI tools.
β @greptile
it's WILD that no one has noticed that LLM progress has stalled. All the signs are there
β @DefenderOfBasic
We spin up sandboxes in under 60 milliseconds. For context: you blink at 110ms Built our own orchestrator and stack down to bare metal to help create 1M+ sandboxes per day on our infrastructure. Speed becomes P0 when agents are waiting.
β @ivanburazin
We're going to make $100,000 for this roofing company with @openclaw Heres how any home service business can copy what we're doing
β @_toddanderson
One click, and your data could be exposed to AI agentsβforever? Inga Cherney, threat researcher and member of Cato CTRL, discusses how to develop a Zero Trust mindset when working with AI, and how to steer clear of hanging permissions in AI agents.
β @CatoNetworks
The future isn't just imagined β it's engineered. XPENG's Next-Gen IRON embodies our passion for technology, powered by an intelligent core to explore what's next. Follow to explore where it takes you.
β @xiaopenghexpeng
If you pay UGC creators at scale, you know the pain: - Tracking is messy - Paying creators is worse (especially with multiple payout methods) The largest AI tutor app in Europe, Astra AI (2.1M+ users), uses this free tool to track and pay creators with one click
β @lottsnomad
For coding, based on my experience: GLM < Kimi < MiniMax < Gemini < Codex < Claude
β @burkov
has anyone done it? > openclaw > connect to trello > trello has free plan & free api > multiple agents work in organized tasks > you can observe everything / review work good idea or nah?
β @DanKulkov
underrated claude code routine: when you (or Claude) finds issues with your code, have it explain concepts for you dusing your own codebase. here's claude explaining issues my use of useEffect and useCallback. so much nicer than just reading a random doc.
β @pdrmnvd
after using 4.6 a bit i think the chart is showing it's not the best tool for measuring this
β @Icebergy
Fun little app idea: Opportunist. Each night it makes its best guess as to what you'd like and opens a PR on all your projects. Closes them if you don't merge the next day. Notices which things you merge to self-improve.
β @r00k
If Claude Code is capable of fixing security issues in a codebase why can't he write secure code from the start
β @karankendre
i don't think the general public understands what it means that the cost of building any software product is now $200/mo
β @___frye
"99% of products/services still don't have an AI-native CLI yet."
β @steipete
Gemini 3.1 build Figma replacement from 5 hour prompting session. Shipped with Vercel and already at $2400 MRR best time be in the industry
β @krzyzanowskim
Opus 4.6 is *SPECIAL* the same way Sonnet 3.5 was it's doesn't have the highest "base capability" as measured by alpha, but it degrades MUCH slower than other models which is why the time horizons go absolutely ballistic
β @scaling01
Soon: shadcn/skills. Works with shadcn/create, CLI, registries, the full ecosystem. Been testing it for a week. Some really good results.
β @shadcn
bro that mf Dario wasn't kidding about replacing software engineers
β @kanavtwt
Taalas runs Llama 3 8B at 16k tokens per second per user. That's almost an order of magnitude increase even compared to SRAM-based systems like Cerebras. Key idea: each chip is specialized to a given model. The chip is the model. The chat demo is pretty wild:
β @awnihannun
What used to take a whole workday now takes 5-10 mins for Claude Insanity
β @beffjezos
this is the perfect time to have kids cost of the best education is going to go to 0 - ai tutors we are about 10 years from mass produced robots helping us with our house work we are on the precipice of a mass abundance age - value will pivot to producing hardware
β @GRITCULT
Sam Altman: "The inside view at the [frontier labs] of what's going to happen... the world is not prepared. We're going to have extremely capable models soon. It's going to be a faster takeoff than I originally thought."
β @deredleritt3r
if you have AI agents, you need a mission control. here's the exact prompt i used to build mine. paste it into your OpenClaw chat and get a full dashboard in minutes:
β @sharbel
AI is going to eliminate UI like TV eliminated radio
β @shl
Skill Graphs > SKILL .md Everyone's talking about skills for AI agents. But almost nobody is talking about how to structure them. Right now, the default approach is simple. You write one skill file that captures one capability. A skill for summarizing. A skill for code review.
β @akshay_pachaar
I've rebuilt more than 50% of all saas i used to pay for internally and it works 10x better (cuz it's personalized to me and my biz) and costs me zero to run. I have zero doubt that by the end of this year I'll only pay for the things I can't possibly build myself (hosting,
β @johnrushx
It's not because anyone *can* vibe code an app that everyone *wants to*. Anyone can grow their own vegetables, how many do? Automation made it so a single farmer can make 100x more food. Agentic coding makes it so a single developer can make 100x more software.
β @kepano
The future of the software business is solo founders vibe coding 4-5 directly competing SaaS apps that fiercely compete with each other at different price points.
β @typesfast
talked to my friends at microsoft,apple,etc. these mfs said they haven't written any code by hand in months,they dont even look at the code. I am too woke for this and totally against it,but I am also the one unemployed so maybe there is a lesson here
β @DegenSugarBoo
METR evaluation suite is saturated because opus has a 95% CI of up to NINETY EIGHT HOURS
β @seconds_0
Your median decision maker, btw - uses chatgpt to look up stuff - may have tried Claude cowork if they're adventurous - swamped at their job - been told that this ai stuff is gonna solve all our problems - see random posts of ppl supposedly doing cool stuff on LinkedIn /
β @atelicinvest
I just open sourced a full x402 payment gateway. Self-hosted, multi-chain, direct settlement. Fork it, point it at your backend, and start accepting USDC micropayments for any API. No intermediaries holding your funds. Most x402 implementations today rely on hosted
β @ninja_dev3
Stop paying for OpenClaw heartbeats. Got a 16GB Mac Mini? Run a local LLM for heartbeats and save tokens within 10 seconds. Most people waste API credits on keep-alive pings and repetitive low-effort tasks. No need for a $20/mo subscription. A stable local setup runs
β @ziwenxu_
The $100/$200 month plans have given people wildly unrealistic expectations about what this costs today at scale. Most still have no idea how much API spend they'd actually be lighting on fire trying to replicate that in an enterprise setting.
β @HackingLZ
If you can vibe code your idea, you're thinking too small
β @shl
This is true. Anthropic will change something radically soon. They are probably burning money like no other companies before. I have to test other models as well. Minimax 2.5 test was a failure.
β @heyandras
made a little skill for this to share: https://github.com/gbasin/stress-test-skill⦠i think there's a broader point that... **the models are still lazy and not aware of their capabilities** ...hopefully this is solved in the next gen, but for now you can save a lot of time and pain by having the
β @garybasin
can anyone name a 100 hour software engineering task which is meaningfully a discrete "task" and not a trivial composition of multiple shorter subtasks?
β @tenobrus
Code Mode is all you need, very excited about this direction for MCP https://blog.cloudflare.com/code-mode-mcp/
β @mattzcarey
I did some reverse engineering of OpenClaw's internals today. Insane attention to detail is the secret sauce. There are many very clever (and undocumented) tricks to learn about. Highly recommend.
β @Yampeleg
Steal this idea. Disposable email, phone number and credit cards for AI agents so they can act on your behalf without messing up your real life. I feel this would be inevitable as we give our agents more autonomy on the web.
β @paraschopra
The Cloudflare API has over 2,500 endpoints. Exposing each one as an MCP tool would consume over 2 million tokens. With Code Mode, we collapsed all of it into two tools and roughly 1,000 tokens of context.
β @Cloudflare
Gemini 3.1 Pro does worse than Gemini 3 Pro on Vending-Bench 2.
β @andonlabs
opus 4.6 is literally just smart gpt4o. beware imo
β @bayeslord
I have a dumb question. If AI gets to the point where it can consistently turn $1.00 into $2.00, does that end money as an idea?
β @BoredElonMusk
> installed Antigravity > chose Gemini 3.1 Pro (High) > ask which model it is > telling me it's powered by Claude 3.7 Sonnet Is the UI lying, or is the agent/model lying/hallucinating?
β @Yuchenj_UW
We're all basically context janitors for AI at this point
β @petergyang
This is genuinely insane. The world's most important graph has gone vertical. GPT-5.2 was released Dec. '25. Opus 4.6 in Feb. '26. Doubling time is now 2 months. Down from the initial trend of 7 months. AI solved software. That's the reality we are in now.
β @daniel_mac8
The $200 Claude Max plan seems like a great deal until you realize API pricing for Opus is incredibly expensive and they can't compete in the long run in terms of dollars-per-token-utility at current sticker prices
β @adamdotdev
Lines of code is a terrible productivity metric. So is pull requests per day. But tokens spent really tops it all; not even pretending to measure any outcome, just effort.
β @gunnarmorling
Seeing people say "coding is solved" when I'm fully rewriting a couple weeks of work with Claude Code is a bittersweet thing. I love the tech and it was a good way to explore what needs to be implemented but the reality is that we're still..... so far
β @fbrasisil
pure insanity it's happening METR newest estimate. Claude Opus 4.6 has a 50% time horizon of around 14.5 hours (95% CI of 6 hrs to 98 hrs) on software tasks.
β @Dr_Singularity
How did Anthropic ship a desktop app to work with Excel and PowerPoint in an agentic way before Microsoft did?? (FWIW hearing Microsoft has code red because of Claude Cowork - they know they should have shipped something like this first. Still have no response but working on it)
β @GergelyOrosz
If you're looking to buy a Mac Mini, wait 4-6 months, a lot of used Mac Minis in mint condition are about to hit the market
β @fchollet