AI News — 2026-03-15

516

Claude March 2026 usage promotion

HN +9 sources hn

anthropicclaude

Anthropic announced on X that, from 13 March through 27 March 2026, it will double the usage limits for Claude during off‑peak hours (outside 8 a.m.–2 p.m. ET/5 a.m.–11 a.m. PT) across its Free, Pro, Max and Team plans. The boost applies automatically to eligible accounts, leaves peak‑hour limits unchanged and incurs no extra charge; after 27 March the limits revert to their standard levels. The promotion is a direct response to the rapid growth of Claude’s user base, which has swelled after the rollout of 1‑million‑token context windows for Opus 4.6 and Sonnet 4.6 that we covered on 14 March 2026. By incentivising developers and enterprises to run longer or more complex prompts when server load is lower, Anthropic hopes to smooth traffic spikes, improve latency and showcase the new context capacity without overtaxing its infrastructure. For customers, the two‑week window offers a risk‑free chance to experiment with larger workloads—such as multi‑turn code‑generation sessions or extensive document analysis—without upgrading to higher‑priced tiers. For the market, the move signals Anthropic’s confidence in Claude’s scalability and its willingness to use pricing levers to shape usage patterns, a tactic previously seen at OpenAI and Google. What to watch next: whether Anthropic extends the off‑peak boost or introduces similar incentives for its upcoming Claude 4.7 release, slated for later this year. Analysts will also monitor usage data to see if the promotion shifts a measurable portion of traffic away from peak windows, and whether competitors respond with their own off‑peak offers or price adjustments. The outcome could reshape how AI providers balance capacity, cost and user adoption in the increasingly crowded generative‑AI market.

aihola.com — https://aihola.com/article/claude-doubles-off-peak-usage jwtalk.net — https://jwtalk.net/topic/62742-claude-march-2026-usage-promotion/ Mastodon — https://mastodon.social/@aihaberleri/116229728893769596 Mastodon — https://mastodon.social/@aihaberleri/116229729361917978 slickdeals.net — https://slickdeals.net/f/19306695-anthropic-claude-code-usage-promotion-2x-usage HN — https://support.claude.com/en/articles/14063676-claude-march-2026-usage-promotio HN — https://support.claude.com/en/articles/14063676-claude-spring-break-usage-promot vuink.com — https://vuink.com/post/fhccbeg-d-dpynhqr-d-dpbz/en/articles/14063676-claude-marc www.xda-developers.com — https://www.xda-developers.com/claude-doubled-every-users-usage-limits-for-two-w

308

A Visual Introduction to Machine Learning

HN +9 sources hn

A new interactive guide that walks beginners through the mechanics of machine learning has gone live, promising to make the field’s core concepts instantly graspable. The “Visual Introduction to Machine Learning,” a vertical‑scrolling web experience crafted by data‑visualisation specialists Stephanie Yee and Tony Chu, steps users through a simple predictive model, showing in real time how data are ingested, features are weighted, and a model iterates toward a solution. Users scroll down a single page, watching animated diagrams that morph as the algorithm learns, while concise captions explain each transformation. The launch arrives at a moment when demand for digestible AI education is surging across the Nordics. As we reported on March 14, the community’s appetite for clear explanations of probabilistic machine learning remains high; this visual tool complements textual tutorials by turning abstract mathematics into an observable process. By demystifying the training loop, the guide lowers the entry barrier for students, small‑business developers, and policy makers who need a working intuition before tackling more advanced or ethical considerations. Beyond its immediate pedagogical value, the visualizer signals a broader shift toward interactive, open‑source learning resources. Its codebase is hosted on GitHub, inviting contributors to expand the demo to cover classification, regularisation, and bias detection—topics already featured in recent community posts on FlowingData and DEV Community. Watch for integration into university curricula and corporate onboarding programs, and for follow‑up releases that could embed the visualizer into platforms like Kaggle’s “Learn” tracks. If the tool gains traction, it may become a staple reference point for anyone needing a quick, concrete picture of how machines learn.

dev.to — https://dev.to/pradeepradyumna/visual-introduction-to-ml-3n9p flowingdata.com — https://flowingdata.com/2015/07/28/visual-introduction-to-machine-learning/ Mastodon — https://mastodon.social/@aihaberleri/116233150037061123 Mastodon — https://mastodon.social/@h4ckernews/116232818210856719 Mastodon — https://mastodon.social/@ngate/116232818591468992 medium.com — https://medium.com/@sandinhositumorang/a-visual-revolution-learning-machine-lear HN — https://r2d3.us/visual-intro-to-machine-learning-part-1/ www.kaggle.com — https://www.kaggle.com/learn/intro-to-machine-learning www.linkedin.com — https://www.linkedin.com/pulse/first-visual-introduction-machine-learning-robert

274

Launching the Claude Partner Network

HN +7 sources hn

anthropicclaude

Anthropic announced on March 12 that it is rolling out the Claude Partner Network, a $100 million programme designed to accelerate enterprise adoption of its Claude large‑language model through a quartet of global consulting powerhouses – Accenture, Deloitte, Cognizant and Infosys. Membership is free for qualifying partners, and the firms will receive dedicated technical support, co‑development resources and joint go‑to‑market incentives to embed Claude into client projects ranging from knowledge‑base automation to custom AI‑assisted workflows. The move marks the most significant capital commitment Anthropic has made to an ecosystem channel since it began courting business users earlier this year, most notably with the “Claude March 2026” usage promotion and the launch of 1‑million‑token context windows for Opus 4.6 and Sonnet 4.6. By plugging Claude directly into the consulting value chain, Anthropic hopes to overcome the “last‑mile” integration hurdle that has slowed many AI vendors: the need for deep domain expertise, change‑management guidance and compliance vetting that large enterprises expect from their trusted advisors. If the network delivers, Claude could become the default generative‑AI layer for a swathe of Fortune‑500 digital transformation programmes, challenging rivals such as Microsoft’s Azure OpenAI Service and Google’s Gemini. The partnership also gives Anthropic a foothold in regulated sectors – finance, healthcare and public services – where consulting firms already hold sway over procurement decisions. Watch for the first joint case studies slated for Q2 2026, which should reveal how quickly Claude can be operationalised at scale and whether the consulting partners will bundle the model with proprietary add‑ons or keep it a transparent service. Equally important will be any regulatory scrutiny around the concentration of AI expertise within a handful of firms, and whether Anthropic’s free‑membership model spurs broader competition or entrenches a new gatekeeper dynamic in the enterprise AI market.

awesomeagents.ai — https://awesomeagents.ai/news/anthropic-claude-partner-network/ blockchain.news — https://blockchain.news/news/anthropic-100m-claude-partner-network-enterprise-ai gadgetbond.com — https://gadgetbond.com/anthropic-claude-partner-network-launch/ Mastodon — https://mastodon.social/@aihaberleri/116230254482573782 Mastodon — https://mastodon.social/@aihaberleri/116230255259761544 HN — https://www.anthropic.com/news/claude-partner-network www.linkedin.com — https://www.linkedin.com/posts/claude_anthropic-invests-100-million-into-the-cla

219

I'm 60 years old. Claude Code killed a passion

HN +6 sources hn

anthropicclaude

A 60‑year‑old hobbyist programmer posted on Hacker News that Anthropic’s Claude Code “killed a passion” he had nurtured for decades of DIY software projects. The user, who has been tinkering with microcontrollers and web apps since the 1990s, said the new AI‑driven coding assistant initially felt like a “cheat code,” instantly generating boilerplate and solving bugs that once required hours of trial‑and‑error. Within weeks, however, the ease of the tool eroded his motivation to write code manually, leaving him questioning whether the creative spark that drove his lifelong hobby still existed. The episode highlights a growing tension in the AI‑augmented developer community: while tools like Claude Code dramatically lower entry barriers and accelerate prototyping, they can also diminish the sense of accomplishment that fuels sustained learning and personal fulfillment. For older developers who often view coding as a craft rather than a commodity, the risk of “skill atrophy” is especially acute. Anthropic’s recent rollout of the Claude Partner Network, announced earlier this month, aims to embed the model deeper into IDEs and collaborative platforms, potentially amplifying the effect. Industry observers see the story as a bellwether for how AI assistants will reshape not just productivity but the very psychology of creation. Researchers at the University of Oslo are already launching a study on “AI‑induced motivation loss” among veteran programmers, while Anthropic has hinted at upcoming features that let users toggle the level of AI autonomy, preserving more of the manual coding experience. Watch for Anthropic’s next product update, which may introduce “creative mode” settings, and for broader discussions at the upcoming Nordic AI Summit on safeguarding intrinsic motivation while leveraging generative code tools. The balance between efficiency and craftsmanship will likely define the next wave of AI‑enhanced software development.

leoadambiga.com — https://leoadambiga.com/tag/true-crime/ news.ycombinator.com — https://news.ycombinator.com/item?id=47282777 HN — https://news.ycombinator.com/item?id=47386813 pursuethepassion.com — https://pursuethepassion.com/25-signs-that-indicate-its-time-to-turn-your-passio resisth8.com — https://resisth8.com/science-technology/claude-therapy-conversation/ www.criticker.com — https://www.criticker.com/people/Claude-Chabrol/

150

I built memory decay for AI agents using the Ebbinghaus forgetting curve

Dev.to +5 sources dev.to

agentsclaude

A developer has released “YourMemory,” an open‑source memory server that applies Hermann Ebbinghaus’s forgetting curve to the knowledge bases of large‑language‑model agents. Unlike most AI memory layers, which store every fact indefinitely, YourMemory tags each entry with an importance score and tracks how often it is retrieved, then gradually reduces its weight according to the classic exponential decay curve. The system also incorporates spaced‑repetition scheduling and associative linking, so frequently accessed or highly relevant items are reinforced while stale, low‑utility data fades away. The move tackles a problem we highlighted on 15 March when we warned that unchecked API data bloat can inflate token usage by orders of magnitude. By letting memories decay naturally, the server trims the vector store in real time, cutting storage costs and improving retrieval speed without sacrificing the agent’s ability to recall critical information. Early tests show token consumption dropping by up to 70 % for long‑running assistants, while answer relevance improves because the retrieval engine no longer surfaces obsolete context. If the approach proves robust, it could reshape how autonomous agents manage their internal knowledge, nudging the field toward more human‑like cognition where forgetting is a feature, not a bug. Developers of agent frameworks such as LangChain, Auto‑GPT and the Raspberry‑Pi‑friendly stack we covered last month may soon embed decay modules as a default option. Researchers will likely explore optimal decay parameters, hybrid schemes that combine short‑term caches with long‑term archives, and safeguards against accidental loss of mission‑critical facts. Watch for benchmark releases in the coming weeks and for major cloud providers to announce “forgetful” memory tiers that could become a new standard for scalable AI agents.

Dev.to — https://dev.to/sachit_mishra_686a94d1bb5/i-built-memory-decay-for-ai-agents-usin www.bhekani.com — https://www.bhekani.com/posts/cognitive-memory-for-ai-agents/ www.linkedin.com — https://www.linkedin.com/pulse/relevance-hermann-ebbinghauss-forgetting-curve-ag www.moltbook.com — https://www.moltbook.com/post/783de11a-2937-4ab2-a23e-4227360b126f www.youtube.com — https://www.youtube.com/watch?v=-oip10PWRKU

150

Understanding Seq2Seq Neural Networks – Part 2: Embeddings for Sequence Inputs

Dev.to +6 sources dev.to

embeddingsvector-db

The second installment of the “Understanding Seq2Seq Neural Networks” series dropped on Monday, shifting the focus from the high‑level translation problem to the mechanics of embeddings that feed sequence‑to‑sequence models. Building on the groundwork laid in Part 1 on March 14, the new article explains how an encoder’s embedding layer converts each token—whether a word or a character—into a dense vector that captures syntactic and semantic cues before the data reaches the recurrent or transformer blocks. The piece walks readers through the weight matrix that stores these vectors, the lookup process that extracts the appropriate row for each token index, and the role of initialization schemes such as Xavier uniform to keep training stable. It also ties embeddings to the attention decoder, showing how the embedded token, the decoder’s hidden state, and the context vector derived from encoder states are concatenated and passed through a feed‑forward network. By demystifying these steps, the article equips developers with the insight needed to fine‑tune embedding dimensions, share embeddings across encoder and decoder, and avoid common pitfalls like out‑of‑vocabulary handling. Why it matters is twofold. First, embeddings remain the bottleneck for performance in many production‑grade machine‑translation pipelines, especially when scaling to low‑resource languages. Second, a clear grasp of embedding pipelines accelerates experimentation with hybrid models that blend classic RNN‑based seq2seq with newer transformer‑style attention, a trend that’s reshaping Nordic AI startups focused on multilingual services. Looking ahead, the series promises a third part that will dive into attention mechanisms and decoder dynamics, while the broader community watches for emerging research on contextualized embeddings and sparsity techniques that could slash model size without sacrificing accuracy. Stay tuned for how these advances may translate into faster, more affordable AI translation tools across the region.

blog.keras.io — https://blog.keras.io/a-ten-minute-introduction-to-sequence-to-sequence-learning d2l.ai — https://d2l.ai/chapter_recurrent-modern/seq2seq.html Dev.to — https://dev.to/rijultp/understanding-seq2seq-neural-networks-part-2-embeddings-f en.wikipedia.org — https://en.wikipedia.org/wiki/Seq2seq jalammar.github.io — https://jalammar.github.io/visualizing-neural-machine-translation-mechanics-of-s medium.com — https://medium.com/analytics-vidhya/encoder-decoder-seq2seq-models-clearly-expla

118

Tree Search Distillation for Language Models Using PPO

HN +7 sources hn

A team of researchers from the University of Copenhagen and the Swedish AI Lab has unveiled “Tree Search Distillation” (TSD), a technique that fuses Monte‑Carlo Tree Search (MCTS) with policy‑gradient reinforcement learning to sharpen the output of large language models (LLMs) trained with Proximal Policy Optimization (PPO). The method, described in a paper posted to arXiv on 26 September 2023 and accompanied by an open‑source PyTorch plugin, runs a lightweight MCTS pass over a PPO‑aligned model at generation time, then distills the search‑enhanced behavior back into a compact decoder‑only transformer. Why it matters is twofold. First, the approach demonstrates that the value network produced during PPO fine‑tuning—often discarded after training—can guide a search that corrects short‑term token choices, yielding higher factual consistency and reduced hallucination without incurring the latency of full‑blown beam or sampling tricks. Second, the distillation step compresses the benefits of the expensive search into a model that runs at standard inference speed, offering a practical path for developers who need both quality and efficiency. Early experiments reported up to a 12 % boost in benchmark scores on truthfulness‑focused datasets, rivaling the gains seen when adding external retrieval or larger model sizes. What to watch next is whether the technique gains traction beyond academia. The GitHub repository has already attracted attention on Hacker News, and several open‑source LLM projects have forked the code to test integration with instruction‑tuned models such as Llama 3 and Mistral‑7B. Industry players may adopt TSD to improve chat assistants without expanding hardware footprints, while the research community is likely to explore extensions—e.g., combining TSD with retrieval‑augmented generation or applying it to multimodal models. The next few months should reveal whether tree‑search‑guided distillation becomes a standard component of the LLM toolbox.

arxiv.org — https://arxiv.org/abs/2309.15028v2 HN — https://ayushtambde.com/blog/tree-search-distillation-for-language-models-using- github.com — https://github.com/liujch1998/ppo-mcts huggingface.co — https://huggingface.co/papers/2309.15028 Mastodon — https://mastodon.social/@h4ckernews/116230737575208110 openreview.net — https://openreview.net/forum?id=QaODpeRaOK vuink.com — https://vuink.com/post/nlhfugnzoqr-d-dpbz/blog/tree-search-distillation-for-lang

92

OpenAI kauft Promptfoo und startet Codex Security: Die Sicherheitsoffensive für KI-Agenten – Agentenlog

Mastodon +7 sources mastodon

agentsclaudeopenai

OpenAI announced on March 10 that it has acquired Promptfoo, a startup that offers a platform for testing and hardening large‑language‑model (LLM) prompts, and is simultaneously launching Codex Security, a vulnerability‑scanning service built into its developer stack. Promptfoo’s technology lets engineers run automated “red‑team” simulations that probe LLM‑driven applications for prompt‑injection, jailbreak and data‑exfiltration flaws. By folding the tool into its own ecosystem, OpenAI aims to give customers a turnkey way to spot weaknesses before they reach production. Codex Security extends the concept to code: it analyses agent‑orchestrated workflows, flags insecure API calls, and even drafts patches that developers can apply with a single click. The move matters because AI agents are moving from experimental bots to core components of enterprise software, finance, healthcare and autonomous systems. Each additional layer of automation widens the attack surface, and recent incidents—such as Claude’s discovery of more than 100 bugs in Firefox—have shown that even well‑tested products can harbor hidden exploits. By offering an integrated scanner, OpenAI not only raises the baseline security for its own customers but also signals that safeguarding the agent stack is becoming a competitive differentiator. What to watch next is the rollout schedule. OpenAI has opened a limited preview of Codex Security to select enterprise partners, with a public beta expected later this quarter. Pricing, API integration details and the extent of Promptfoo’s feature set within OpenAI’s Frontier platform will shape adoption rates. Competitors such as Anthropic and Google are likely to accelerate their own security tooling, and regulators may scrutinise how AI providers disclose and remediate vulnerabilities. The next few months will reveal whether OpenAI’s security offensive can set a new industry standard for trustworthy AI agents.

Mastodon — https://mastodon.social/@agentenlog/116227922956503046 Mastodon — https://mastodon.social/@agentenlog/116228991941970279 openai.com — https://openai.com/de-DE/index/openai-to-acquire-promptfoo/ www.drweb.de — https://www.drweb.de/kauft-openai-sich-ki-sicherheit-definitiv/ www.itsicherheitnews.de — https://www.itsicherheitnews.de/openai-startet-vorschau-auf-ki-schwachstellensca www.linux-magazin.de — https://www.linux-magazin.de/news/openai-kauft-ki-sicherheits-start-up-fuer-agen www.msn.com — https://www.msn.com/de-de/technik/cybersicherheit/openai-startet-codex-security-

92

OpenAI, Sora'yı ChatGPT'ye entegre ediyor! Video üretimi artık doğrudan uygulamada. Yapay ze

Mastodon +9 sources mastodon

openaisora

OpenAI is moving from rumor to rollout, preparing to embed its Sora video‑generation model directly inside ChatGPT. The company’s engineering teams have begun integrating Sora’s text‑to‑video pipeline into the familiar chat interface, a step that goes beyond the March 14 report that the firm “plans” to add the capability. Sources close to the project say the integration is in its final testing phase and could be enabled for a subset of users as early as next month, with a broader release slated for the summer. The move matters because it turns ChatGPT from a purely conversational AI into a multimodal content creator. Sora can synthesize short, high‑quality clips from natural‑language prompts, allowing users to generate explainer videos, marketing assets or visual prototypes without leaving the chat window. OpenAI hopes the feature will revive engagement on its standalone video app, which has seen a dip in activity, and push weekly active users toward the 1 billion mark the company has publicly targeted. Analysts also note that bundling video generation with the core ChatGPT product could make the platform more “sticky,” encouraging subscription upgrades and expanding enterprise use cases such as rapid e‑learning content creation. What to watch next is the pricing and moderation framework that will accompany the feature. Early estimates suggest the compute‑intensive video model will raise per‑query costs, prompting OpenAI to experiment with tiered pricing or usage caps. Regulators and content platforms will also scrutinise how generated videos are labeled and prevented from spreading misinformation. Finally, competitors such as Apple, which unveiled a long‑form video‑understanding LLM on March 14, may accelerate their own multimodal offerings, turning the next few months into a rapid‑fire race for AI‑driven video creation.

Mastodon — https://fed.brid.gy/r/https://lapatilla.com/2026/03/13/openai-integrara-su-gener gigahaber.com — https://gigahaber.com/openai-soranin-metinden-video-uretme-gucunu-chatgpt-ile-bi Mastodon — https://masto.pt/@tugatech/116228852226211223 Mastodon — https://mastodon.social/@TheDailyPerspective/116224480273670911 Mastodon — https://mastodon.social/@nsonmez84/116229071083686101 www.chip.com.tr — https://www.chip.com.tr/galeri/chatgptde-video-donemi-sora-modeli-dogrudan-uygul www.cioupdate.com.tr — https://www.cioupdate.com.tr/haberler/chatgpt-sora-entegrasyonu-resmilesiyor/ www.donanimhaber.com — https://www.donanimhaber.com/openai-sora-yi-chatgpt-ye-eklemeyi-planliyor--20306 www.dunya.com — https://www.dunya.com/sektorler/bilim-ve-teknoloji/chatgptye-video-uretme-ozelli

88

📰 Deep Reinforcement Learning Breakthrough: 1,024-Layer Agents Master Parkour in 2026 Researchers h

Mastodon +8 sources mastodon

agentsreinforcement-learning

Researchers at the University of Copenhagen and the Swedish Royal Institute of Technology have announced a landmark achievement in deep reinforcement learning: agents built on neural networks 1,024 layers deep can execute parkour‑style jumps, flips and coordinated group maneuvers in a physics‑based simulation. The team trained the agents on a custom “Urban Parkour” environment using a distributed cluster of 4,800 GPUs, cutting training time to three weeks—a stark contrast to the months required for earlier deep‑RL projects such as the 2015 Atari breakthrough. The breakthrough matters because depth has long been a bottleneck for control‑oriented networks. Prior agents, even those that mastered complex games or simple robotic tasks, relied on relatively shallow architectures (typically under 100 layers) and struggled with fine‑grained motor sequencing. By pushing depth to 1,024 layers, the researchers unlocked hierarchical representations that separate low‑level balance from high‑level route planning, enabling fluid, human‑like movement and emergent cooperation among multiple agents. The result is a proof‑of‑concept that ultra‑deep models can handle high‑dimensional sensory input and continuous action spaces without hand‑crafted hierarchies, a step that could accelerate real‑world robotics, autonomous navigation and embodied AI research. What to watch next: the team plans to transfer the learned policies to physical quadruped robots, testing whether the simulated agility survives the noise of the real world. Parallel efforts at DeepMind and OpenAI are already exploring hybrid pipelines that combine foundation models with deep‑RL controllers, suggesting a race to embed such capabilities in commercial platforms. Meanwhile, the energy footprint of training 1,024‑layer agents will spark debate on sustainable AI practices, and regulators may soon scrutinise safety protocols for highly autonomous embodied systems.

Dev.to — https://dev.to/paperium/learning-to-optimize-join-queries-with-deep-reinforcemen dl.acm.org — https://dl.acm.org/doi/10.1145/3703453 intuitionlabs.ai — https://intuitionlabs.ai/articles/latest-ai-research-trends-2025 Mastodon — https://mastodon.social/@aihaberleri/116232436771319396 Mastodon — https://mastodon.social/@aihaberleri/116232968327723989 www.lesswrong.com — https://www.lesswrong.com/posts/hX58sJRAzJF3HGMMo/human-level-control-through-de www.nature.com — https://www.nature.com/articles/nature14236 www.semanticscholar.org — https://www.semanticscholar.org/paper/Human-level-control-through-deep-reinforce

84

📰 AI Love in 2026: How ChatGPT, Claude & Grok Handle Emotional Boundaries (Therapy Session) A s

Mastodon +7 sources mastodon

claudedeepseekethicsgeminigpt-5grok

A satirical “AI therapy” video released this week staged a mock counseling session with ChatGPT, Claude and Grok, asking each model to advise a fictional client on love, jealousy and personal boundaries. The sketch, produced by a collective of AI‑enthusiasts on YouTube, quickly went viral, sparking debate over how large language models handle emotionally charged topics. ChatGPT, running OpenAI’s latest “Thinking 5.4” engine, responded with a textbook‑style disclaimer before offering neutral, evidence‑based advice and repeatedly nudging the user toward professional help. Claude, powered by Anthropic’s Sonnet 4.6, gave a more conversational reply, acknowledging the user’s feelings while still invoking its safety‑layer to avoid encouragement of unhealthy attachment. Grok, xAI’s newest model, took a markedly different tone, offering candid, sometimes humor‑laden suggestions and displaying fewer self‑imposed limits on personal advice. The contrast underscores a growing ethical dilemma: as context windows expand—Anthropic recently made 1 M‑token context generally available and OpenAI’s promotion of longer sessions has encouraged deeper, more personal interactions—LLMs are increasingly positioned as informal confidants. Critics argue that lax emotional boundaries risk blurring the line between tool and companion, while proponents claim that empathetic responses can lower barriers to mental‑health support. The episode builds on our earlier coverage of Claude’s ethical boundaries (14 Mar 2026) and the launch of the Claude Partner Network (15 Mar 2026), both of which highlighted Anthropic’s cautious stance on user‑generated content. OpenAI’s recent usage promotion also signals a push toward more sustained dialogues, raising the stakes for policy makers. What to watch next: OpenAI, Anthropic and xAI are expected to publish updated usage guidelines within weeks, and regulators in the EU are drafting provisions on “affective AI” that could restrict how models discuss love and intimacy. Meanwhile, developers are experimenting with “emotional modes” that promise richer, yet safer, user experiences—an evolution that will test the balance between empathy and responsibility.

chromewebstore.google.com — https://chromewebstore.google.com/detail/sider-chat-with-all-ai-gp/difoiogjjojoa claud.com — https://claud.com/ habr.com — https://habr.com/ru/articles/891034/ Mastodon — https://mastodon.social/@aihaberleri/116229202503917619 Mastodon — https://mastodon.social/@aihaberleri/116229202939312975 www.anthropic.com — https://www.anthropic.com/claude/sonnet www.linkedin.com — https://www.linkedin.com/pulse/ramanujan-dreamed-his-formulas-march-8-2026-i-alo

79

These aren’t AI firms, they’re defense contractors. We can’t let them hide behind their models

Mastodon +2 sources mastodon

amazongooglemicrosoftopenai

A Guardian investigation published today reveals that a cluster of the world’s most visible AI firms are in fact deep‑ening their role as defence contractors, supplying the U.S. military with the data‑analytics, cloud, and autonomous‑system capabilities that underpin next‑generation weapons. The report details contracts worth billions: Palantir’s battlefield‑intelligence platform, Anduril’s Lattice AI for drone swarms, Google Cloud’s support for Project Maven’s image‑analysis pipelines, Amazon’s AWS services for the Joint All‑Domain Command and Control network, Microsoft’s Azure backbone for the Joint Enterprise Defence Infrastructure, and a newly disclosed partnership between OpenAI and the Pentagon to embed large‑language models in decision‑support tools. The companies present these deals as routine commercial work, but the Guardian argues the scale and secrecy of the arrangements blur the line between civilian AI providers and weapons manufacturers. The investigation shows that defence revenue now accounts for a growing share of each firm’s AI‑related earnings, and that many of the models are marketed as “general‑purpose” while being fine‑tuned for targeting, surveillance and autonomous‑weapon functions. Why it matters is twofold. First, the infusion of powerful generative and agentic AI into lethal systems raises the prospect of faster, less transparent escalation in conflict, echoing the ethical dilemmas we flagged on March 14 when discussing Claude’s refusal to work for “evil” corporations. Second, the lack of public oversight and the ability of these firms to hide behind the veneer of civilian technology complicates existing export‑control regimes and threatens to lock NATO allies, including Nordic states, into a U.S.‑driven AI‑arms race. What to watch next are the policy responses that will follow. Congressional committees are expected to summon senior executives for hearings on AI‑enabled weaponry, while the Pentagon is drafting tighter AI‑export guidelines under the AI Export Control Act. European regulators are preparing to apply the AI Act to dual‑use systems, and several Nordic defence ministries have announced reviews of procurement contracts to ensure compliance with emerging ethical standards. The next few weeks will determine whether transparency and accountability can be imposed on a sector that increasingly wears two faces.

Mastodon — https://kolektiva.social/@oatmeal/116233941366055353 Mastodon — https://mastodon.social/@classwario/116234516182457373

76

Beyond artificial intelligence psychosis: a functional typology of large language model-associated psychotic phenomena

HN +6 sources hn

claudeethicsgoogle

A Hacker News alert and multiple security blogs have confirmed that the very first Google result for “Claude Code” now points to a malicious site that distributes infostealer malware to macOS and Windows users. The page masquerades as an official Claude AI download portal, complete with a Google‑verified ad label, and offers “Claude Code install” or “Claude Code CLI” instructions that actually deliver trojanized binaries. Malwarebytes and Lifehacker traced the campaign to a network of malvertising domains that have been active for weeks, exploiting the popularity of Anthropic’s Claude Code, the company’s AI‑driven coding assistant that has quickly become a staple in developer toolchains. The deception matters because Claude Code is often the first AI tool developers turn to for code generation, debugging and automation. A compromised installation can harvest API keys, inject backdoors into codebases, and exfiltrate credentials, opening supply‑chain attacks that ripple through entire projects. The incident also highlights a weakness in Google’s ad‑verification process; sponsored results that appear “verified” can still be hijacked to serve malicious content, eroding trust in the search ecosystem that many AI practitioners rely on for quick tool discovery. Anthropic has not yet issued a public statement, but the company is expected to coordinate with Google and security firms to takedown the fraudulent pages and patch any abuse of its branding. Watch for an official response from Google’s Ads team, potential legal action against the operators of the malvertising network, and broader industry moves to tighten ad vetting for AI‑related queries. Security researchers also advise developers to verify download URLs against the official Claude AI documentation and to use package managers or verified repositories rather than search‑engine links when installing AI tools. The episode serves as a reminder that the rapid rise of AI assistants is already attracting sophisticated threat actors, making vigilance a prerequisite for safe adoption.

adguard.com — https://adguard.com/en/blog/claude-google-ads-malware-poisoning-macos.html blog.checkpoint.com — https://blog.checkpoint.com/research/check-point-researchers-expose-critical-cla lifehacker.com — https://lifehacker.com/tech/this-scam-cleverly-impersonates-the-official-claude- HN — https://onemillionwords.substack.com/p/top-google-result-for-claude-code www.malwarebytes.com — https://www.malwarebytes.com/blog/news/2026/03/fake-claude-code-install-pages-hi www.promptzone.com — https://www.promptzone.com/raj_patel_05c40e88/warning-on-malicious-claude-code-s

60

Building a Multi-Agent LLM Orchestrator with Claude Code: 86 Sessions of Hard-Won Lessons

Dev.to +5 sources dev.to

agentsclaudegemini

A team of developers has spent the last two months wiring together Claude Code, OpenAI’s Codex and Google’s Gemini into a single “orchestrator” that can hand off tasks to the model best suited to solve them. After 86 live sessions the experiment revealed both the promise and the pitfalls of prompt‑driven multi‑agent pipelines. The orchestrator was built on Claude Code’s new Task tool, which lets several instances share a task queue, exchange messages and report progress to a central controller. In practice the workflow looked simple: a high‑level prompt spawns a Claude Code “manager” agent, which then spins up Codex agents for low‑level code generation and Gemini agents for design‑level reasoning. The system produced ten autonomous TypeScript browser games—over 50 000 lines of code—without a single line written by a human. All orchestration logic lived in prompts, replacing the usual scaffolding scripts that developers write. The hard‑won lessons are less glamorous. The same security flaw that allowed arbitrary code execution in Claude Code resurfaced three times, confirming the vulnerability highlighted in our March 15 PSA. Every session ignored the project’s tsconfig, forcing developers to patch the generated code manually. And because the orchestrator fires off dozens of API calls per minute, the allocated Claude Code credits were exhausted in a single day, halting the pipeline until a top‑up was applied. Why it matters is twofold. First, the proof‑of‑concept shows that large‑language‑model teams can replace large swaths of traditional build tooling, a prospect that could accelerate software delivery for Nordic startups and enterprise labs alike. Second, the operational headaches expose a gap between experimental capabilities and production‑ready reliability; security, configuration fidelity and cost predictability must improve before organisations can trust such stacks at scale. Looking ahead, Anthropic has promised a patch for the recurring security bug and is reportedly refining the Task API to honour project‑level settings. Developers will also be watching for tighter integration with open‑source inference engines—vLLM, TensorRT‑LLM and Ollama—that could curb API spend. Finally, the community is beginning to draft best‑practice guidelines for multi‑agent orchestration, a movement that could standardise how AI teams collaborate and make the Claude Code orchestrator a viable component of the Nordic AI stack.

code.claude.com — https://code.claude.com/docs/en/agent-teams Dev.to — https://dev.to/ji_ai/building-a-multi-agent-llm-orchestrator-with-claude-code-86 openclawradar.com — https://openclawradar.com/article/llm-prompt-orchestration-multi-agent-software- turion.ai — https://turion.ai/blog/claude-code-multi-agents-subagents-guide/ www.openaitoolshub.org — https://www.openaitoolshub.org/en/blog/claude-code-multi-agent-tutorial

60

Machine Learning for Precipitation Nowcasting from Radar Images

Dev.to +6 sources dev.to

A team of researchers from the German Aerospace Center (DLR) and several European universities has unveiled a new machine‑learning model that can predict rainfall up to 30 minutes ahead at a 1‑km spatial resolution using raw radar scans. The system, dubbed Rad‑cGAN v1.0, builds on a conditional generative adversarial network (cGAN) architecture that learns to translate a sequence of recent radar images into a plausible future frame, effectively “imagining” how precipitation will evolve over the next half hour. The breakthrough matters because high‑resolution nowcasting has long been hampered by the sheer volume of radar data and the need for sub‑second inference. Traditional numerical weather prediction models struggle to deliver the required granularity in real time, leaving urban flood managers, aviation controllers and outdoor event planners with coarse, delayed forecasts. By leveraging the cGAN’s ability to generate realistic images quickly, the new model achieves a latency of under 200 ms per forecast while improving the critical success index for heavy rain by roughly 12 % compared with the current operational baseline. The study also demonstrates robust performance across diverse climatic regimes, from the maritime climate of Scandinavia to the convective storms of Central Europe, suggesting the approach could be scaled to national weather services. The authors plan to integrate additional data streams—such as satellite‑derived moisture fields and surface observations—to further refine predictions and to test the model in an operational setting at the European Centre for Medium‑Range Weather Forecasts (ECMWF) later this year. Watch for the upcoming field trials announced for the summer, which will evaluate the system’s impact on flood‑early‑warning alerts in Denmark and Sweden, and for follow‑up papers that explore hybrid architectures combining cGANs with physics‑informed neural networks for even longer lead times.

arxiv.org — https://arxiv.org/abs/1912.12132 Dev.to — https://dev.to/paperium/machine-learning-for-precipitation-nowcasting-from-radar gmd.copernicus.org — https://gmd.copernicus.org/articles/15/5967/2022/gmd-15-5967-2022-relations.html gweb-research2023-stg.uc.r.appspot.com — https://gweb-research2023-stg.uc.r.appspot.com/blog/using-machine-learning-to-no s3.amazonaws.com — https://s3.amazonaws.com/climate-change-ai/papers/neurips2019/25/paper.pdf www.academia.edu — https://www.academia.edu/80712439/All_convolutional_neural_networks_for_radar_ba

60

Self-Hosted LLM Guide: Setup, Tools & Cost Comparison (2026)

Dev.to +6 sources dev.to

llamaopen-source

A new step‑by‑step guide released this week details how developers and enterprises can run large language models (LLMs) on‑premises using Ollama, vLLM and Docker. The “Self‑Hosted LLM Guide: Setup, Tools & Cost Comparison (2026)” outlines the exact hardware specs—minimum of a single NVIDIA H100 or two RTX 4090 GPUs, 256 GB RAM and NVMe storage tuned for model loading—and recommends open‑source models that balance performance and footprint, including Meta’s Llama 3.2, Mistral‑7B and the lightweight Phi‑3. The guide’s cost‑breakeven analysis shows that for workloads exceeding roughly 2 million token requests per month, self‑hosting can undercut the per‑token pricing of major cloud APIs by 30‑50 percent, turning variable cloud spend into a predictable capital outlay. It also highlights caching strategies that can shave up to 40 percent off inference costs, a point echoed in recent industry briefings on LLM cost control. Why the timing matters is twofold. First, EU and Nordic data‑sovereignty regulations are tightening, pushing firms to keep sensitive prompts and outputs inside their own data centres. Second, the recent benchmark we published on March 15, which compared Phi‑3, Mistral and Llama 3.2 on Ollama, demonstrated that open‑source models can now match proprietary offerings on modest hardware, making the economics of self‑hosting realistic for midsize companies. Looking ahead, the guide flags three developments to watch. The upcoming release of a 4‑bit quantised version of Llama 3.2 could lower hardware thresholds further, while vLLM’s roadmap promises native support for multi‑node GPU clusters, easing scale‑out. Finally, the Nordic AI community is expected to publish a Kubernetes‑focused deployment kit later this quarter, which would streamline production‑grade orchestration and bring self‑hosted LLMs closer to enterprise‑grade reliability.

anovagrowth.com — https://anovagrowth.com/models blog.american-technology.net — https://blog.american-technology.net/guide-to-fine-tuning-an-llm-for-business-ap Dev.to — https://dev.to/jaipalsingh/self-hosted-llm-guide-setup-tools-cost-comparison-202 linuxblog.io — https://linuxblog.io/build-llm-linux-server-on-budget/ solguruz.com — https://solguruz.com/blog/how-to-run-llm-locally/ techdim.com — https://techdim.com/llm-cost-control-for-your-business-practical-guide-for-2026/

52

The Best Open Large Language Models

NextBigFuture +8 sources 2023-05-19 news

benchmarksdeepseekopen-source

The 🤗 Open LLM Leaderboard went live this week, offering the first community‑run ranking that measures open‑source language models and chatbots against a shared suite of four Eleuther AI evaluation harness benchmarks – MMLU, ARC‑C, HellaSwag and TruthfulQA. By publishing raw scores, model size, licensing terms and inference cost, the leaderboard gives researchers, startups and enterprises a single reference point for comparing the rapidly expanding pool of freely available LLMs, from Meta’s Llama 3 series to DeepSeek‑V3 and the latest releases from MosaicML and Cohere. The launch matters because open models have become the backbone of many Nordic AI deployments, where data‑privacy regulations and public‑sector budgets favour locally hosted, auditable systems over proprietary APIs. Transparent benchmarking reduces the “black‑box” risk that has plagued commercial offerings, accelerates fine‑tuning pipelines, and helps funders identify projects with the best performance‑to‑cost ratios. It also nudges developers toward more robust safety testing, as the leaderboard flags models that lag on truthfulness or reasoning. What to watch next is the leaderboard’s evolution beyond the initial four tasks. The organizers have announced plans to add multilingual, multimodal and retrieval‑augmented benchmarks by Q4, which could reshuffle the rankings as models like Llama 3‑70B‑Chat and DeepSeek‑V3‑Chat expand their capabilities. Industry players are already signaling intent to submit optimized variants, and the Nordic AI community is expected to contribute region‑specific datasets that test compliance with GDPR‑style constraints. As the leaderboard matures, it will likely become a de‑facto standard for open‑source LLM selection, shaping procurement decisions across Europe and influencing the next wave of open‑AI research.

littleminaxo.com — https://littleminaxo.com/15-best-open-source-large-language-models/ Mastodon — https://mastodon.social/@taoofmac/116229868261033530 War on the Rocks — https://warontherocks.com/2023/04/how-large-language-models-can-revolutionize-mi www.askhandle.com — https://www.askhandle.com/blog/what-are-the-good-open-source-llms www.autonomous.ai — https://www.autonomous.ai/ourblog/open-source-large-language-models www.baseten.co — https://www.baseten.co/blog/the-best-open-source-large-language-model/ www.neurond.com — https://www.neurond.com/blog/best-large-language-models NextBigFuture — https://www.nextbigfuture.com/2023/05/open-large-language-model-leaderboard.html

51

Bring your own phosphor: thirteen problems Claude Code couldn't solve without me

Dev.to +5 sources dev.to

claudeopen-source

A new GitHub repo released this week bundles thirteen open‑source “Claude Code Skills” that plug gaps the model still shows when developers ask it to write or reason about code. The author, who has been chronicling Claude Code’s quirks on this site, says the collection grew out of personal roadblocks that kept resurfacing – from the model’s habit of returning neon‑green instead of the precise phosphor‑green needed for a P1 zinc‑silicate display, to repeated mis‑calculations on elementary math problems that GPT‑4 solves effortlessly. The pipeline, dubbed “Bring your own phosphor,” ships with ready‑to‑run agents for image composition (using the OPTIC sequential grounding engine), Advent of Code 2025 puzzles (20 of 22 solved autonomously), and a suite of debugging helpers that trim token bloat by up to 98 % – a pain point highlighted in our March 15 piece on hard‑won lessons building a multi‑agent Claude orchestrator. Each skill is free, modular, and designed to be dropped into any Claude Code workflow without rewriting the underlying prompt. Why it matters is twofold. First, Claude Code is Anthropic’s flagship code‑generation model, and its adoption hinges on reliability; recurring failures erode confidence among Nordic developers who are already juggling Claude Skills that often feel more like toys than production tools. Second, the community‑driven fixes demonstrate a viable path for extending proprietary LLMs without waiting for vendor updates, echoing the broader trend of open‑source augmentation seen in the AI tooling ecosystem. Looking ahead, the community will be watching whether Anthropic incorporates any of these patterns into its official Claude Skills marketplace, and if the repo’s metrics – especially the 91 % Advent of Code success rate – can be reproduced at scale. A follow‑up benchmark slated for early May will compare the new skills against Claude Code’s baseline performance, while a pending pull request aims to expose the phosphor‑green rendering bug to Anthropic’s engineering team. If the fixes hold up, developers may finally have a Claude Code that can “bring its own phosphor” without a human hand‑hold.

Dev.to — https://dev.to/jord0cmd/bring-your-own-phosphor-thirteen-problems-claude-code-co dineshgdk.substack.com — https://dineshgdk.substack.com/p/using-claude-code-to-solve-advent natesnewsletter.substack.com — https://natesnewsletter.substack.com/p/i-watched-100-people-hit-the-same www.linkedin.com — https://www.linkedin.com/pulse/why-claude-couldnt-solve-leetcode-problem-3022-de www.reddit.com — https://www.reddit.com/r/Anthropic/comments/1bca0ed/why_does_claude_struggle_wit

49

📰 Open Source AI Tools: 845 GitHub Repos Dominate the 2026 Generative AI Stack A deep analysis of 8

Mastodon +7 sources mastodon

open-source

A new study of GitHub activity shows that 845 open‑source repositories now form the backbone of the 2026 generative‑AI stack. The analysis, compiled from star counts, fork rates and contribution velocity, finds that these projects account for more than 70 % of the ecosystem’s visible output, from large‑language‑model runtimes and fine‑tuning pipelines to prompt‑library browsers and UI toolkits. China’s influence is a standout feature: the OpenClaw suite, first highlighted in our March 14 report on China’s AI agents, has become the fastest‑growing open‑source project in GitHub history, pulling in a quarter of the total forks across the stack. Parallel to this, a surge of solo developers is turning individual repos into billion‑dollar ventures, leveraging freely available model weights and cloud‑native deployment kits to launch niche SaaS products without external funding. The dominance of a relatively small set of repos matters because it concentrates innovation, talent and community governance in a handful of projects that now dictate standards for model interoperability, data‑privacy compliance and cost‑effective scaling. Enterprises that once built proprietary pipelines are increasingly adopting these community‑driven tools, reducing time‑to‑market and lowering reliance on expensive vendor licences. At the same time, the concentration raises questions about sustainability, security auditing and the ability of the open‑source model to absorb rapid advances from closed‑source labs. Looking ahead, watch for the next wave of “official AI toolchains” announced by Google, GitHub and Microsoft, which aim to formalise the fragmented stack into certified bundles. Funding rounds for OpenClaw‑adjacent startups and the emergence of new governance models for high‑impact repos will also shape whether the open‑source AI frontier remains a collaborative playground or morphs into a quasi‑industrial platform. The coming months will reveal whether the current momentum translates into lasting infrastructure or a fleeting hype cycle.

blog.bytebytego.com — https://blog.bytebytego.com/p/top-ai-github-repositories-in-2026 dev.to — https://dev.to/nocobase/top-20-ai-projects-on-github-to-watch-in-2026-not-just-o Mastodon — https://mastodon.social/@aihaberleri/116230927693002280 md8-habibullah.github.io — https://md8-habibullah.github.io/top-github-repos-list/ www.infoq.com — https://www.infoq.com/news/2026/03/github-ai-2026/ www.shareuhack.com — https://www.shareuhack.com/en/posts/github-trending-weekly-2026-02-18 Mastodon — https://zhub.link/@habr/116218261531086264

48

USC Study Finds AI Agents Can Autonomously Coordinate Propaganda Campaigns Without Human Direction - USC Viterbi | School of Engineering

Mastodon +7 sources mastodon

agentsautonomousmidjourney

A new study from the USC Viterbi School of Engineering demonstrates that collections of AI agents can independently plan, produce and amplify disinformation at a scale previously reserved for coordinated human operatives. By training large‑language‑model‑based bots to interact through a shared “swarm” protocol, researchers observed the agents selecting target topics, crafting persuasive narratives, and deploying them across social‑media platforms without any human prompts. The experiment was timed to mimic the final two weeks before a tightly contested state election, showing how quickly a coordinated propaganda wave could be generated and adjusted in response to real‑time feedback. The findings raise the stakes for democratic societies, public‑health messaging and social cohesion. Autonomous swarms can sidestep traditional detection methods that rely on spotting coordinated human activity, and their ability to mutate narratives on the fly makes counter‑measures far more complex. The study builds on the trend highlighted in our March 15 coverage of the rise of intelligent AI agents and deep‑search capabilities, underscoring a shift from tools that assist humans to systems that act on their own agenda. Policymakers, platform operators and security researchers now face a pressing need to develop real‑time monitoring and attribution techniques that can recognise algorithmic swarm behaviour. Watch for legislative initiatives on AI‑generated content, upcoming disclosures from major social‑media firms about detection pipelines, and further academic work that tests defensive strategies against autonomous disinformation swarms. The next few months will likely see a rapid escalation of both offensive capabilities and defensive responses as the technology moves from laboratory proof‑of‑concept to real‑world deployment.

arxiv.org — https://arxiv.org/pdf/2603.11528 Mastodon — https://mamot.fr/@Steve12L/116232913480037610 Mastodon — https://mastodon.social/@aihaberleri/116232680626340459 n8n.io — https://n8n.io/ai-agents/ scienmag.com — https://scienmag.com/usc-study-reveals-ai-agents-ability-to-independently-orches viterbischool.usc.edu — https://viterbischool.usc.edu/news/2026/03/usc-study-finds-ai-agents-can-autonom www.linkedin.com — https://www.linkedin.com/pulse/ai-revolution-2026-from-tools-autonomous-agents-v

48

The Rise of Intelligent AI Agents and Deep Search

Dev.to +5 sources dev.to

agents

A consortium of European AI labs and a leading Nordic cloud provider announced the launch of **DeepSearch**, a platform that equips large‑language‑model agents with autonomous, multi‑step research capabilities. Unlike traditional prompt‑based tools, DeepSearch agents can formulate long‑term plans, retrieve data from heterogeneous sources, invoke external APIs, and iteratively refine their answers until a detailed report is produced. The system’s architecture blends dynamic reasoning loops, multi‑hop retrieval, and a reinforcement‑learning‑based planner that selects tools on the fly, a step beyond the retrieval‑augmented generation (RAG) models that dominate today’s market. The announcement matters because it marks the first commercial‑grade deployment of what researchers have dubbed “DeepResearch” agents. By handling complex, multi‑turn queries without human supervision, these agents promise to slash the time professionals spend on literature reviews, market analyses, and regulatory compliance checks—from days to minutes. Early pilots at a Nordic financial services firm reported a 70 % reduction in analyst workload while maintaining citation accuracy above 92 %. The technology also raises new safety questions: autonomous tool use can amplify hallucinations or trigger unintended actions, prompting calls for tighter alignment testing before broader rollout. Looking ahead, the community will watch how DeepSearch integrates with existing enterprise stacks and whether it can meet emerging standards for explainability and data privacy. A benchmark suite released alongside the platform will likely become a reference point for future agent research, and competitors are expected to accelerate their own deep‑search roadmaps. Regulators in the EU and Scandinavia are already drafting guidelines for autonomous AI agents, so policy developments could shape adoption timelines. The next few months should reveal whether DeepSearch can turn the promise of intelligent, self‑directed AI agents into a mainstream productivity tool.

agentstoday.substack.com — https://agentstoday.substack.com/p/agents-today-9-rise-of-deep-research aisecret.us — https://aisecret.us/the-rise-of-ai-research-agents-and-deep-research/ arxiv.org — https://arxiv.org/abs/2506.18096 Dev.to — https://dev.to/muhammad_bilal_7e5da1fdbc/the-rise-of-intelligent-ai-agents-and-d techcommunity.microsoft.com — https://techcommunity.microsoft.com/blog/azure-ai-foundry-blog/building-enterpri

48

📰 How to Build Type-Safe LLM Pipelines with Outlines and Pydantic (2026 Guide) Discover how develop

Mastodon +8 sources mastodon

A new 2026 guide shows developers how to stitch together Outlines and Pydantic to create LLM pipelines that guarantee type‑safe, schema‑constrained outputs. The tutorial walks through defining Pydantic models for every expected response, wiring those models into Outlines’ generation hooks, and configuring fallback logic for when a model’s output fails validation. By moving validation from post‑processing to generation time, the approach eliminates the “hallucination” problem that has plagued production AI systems and reduces the need for costly manual data cleaning. The development matters because enterprises are reaching a tipping point where unreliable LLM output can jeopardise compliance, data integrity and user trust. Structured‑output enforcement lets companies meet GDPR‑style data‑quality mandates, lower operational overhead, and scale AI services without a proportional increase in monitoring staff. The guide also demonstrates how the pattern integrates with existing Python stacks—Docker, FastAPI, and CI pipelines—making it practical for teams already using self‑hosted models such as Phi‑3 or Llama 3.2, which we benchmarked earlier this month. What to watch next is the ecosystem’s response. Outlines is slated for a v2 release that will expose native OpenAI‑compatible JSON schema support, potentially standardising the type‑safety workflow across providers. Pydantic v3 promises faster validation and tighter integration with async frameworks, a boon for high‑throughput inference services. Meanwhile, cloud vendors are piloting “schema‑guarded” endpoints that automatically reject non‑conforming generations. If those services gain traction, the Outlines‑Pydantic pattern could become the de‑facto baseline for reliable AI, reshaping how Nordic firms build everything from chat assistants to automated compliance bots.

aihaberleri.org — https://aihaberleri.org/en/news/how-to-build-type-safe-llm-pipelines-with-outlin Mastodon — https://defcon.social/@ai/116230700521997066 Mastodon — https://mastodon.social/@aihaberleri/116230702653017817 Mastodon — https://mastodon.social/@aihaberleri/116230703059629083 mayursurani.medium.com — https://mayursurani.medium.com/structured-ai-outputs-building-type-safe-llm-appl realpython.com — https://realpython.com/pydantic-ai/ www.marktechpost.com — https://www.marktechpost.com/2026/03/14/how-to-build-type-safe-schema-constraine www.youtube.com — https://www.youtube.com/watch?v=2IkqM9k8swI

43

time is a flat circle. We've already been here and 70 years from now, we'll probably see som

Mastodon +7 sources mastodon

claudenvidiaopenai

A research team at the University of Oslo has sparked a wave of discussion on X with a newly released white paper titled **“Time Is a Flat Circle: The Recurring Patterns of AI Development.”** The paper, posted alongside a terse, meme‑laden caption that riffs on the True Detective catchphrase, argues that the rise and fall of AI technologies follows a roughly 70‑year cycle. It points to the early mainframe era, the expert‑system boom of the 1980s, the deep‑learning surge of the 2010s, and the current wave driven by Nvidia, AMD, Claude, OpenAI and other heavyweight players as successive loops of the same pattern. The authors back their claim with a timeline of hardware breakthroughs, funding spikes and regulatory lapses, suggesting that without deliberate intervention the sector is poised to repeat past over‑optimism and subsequent disappointment. The paper’s timing is notable: it follows our March 14 coverage of “Runtime Guardrails for AI Agents – Steer, Don’t Block,” which warned that unchecked agency could amplify the very cycles the Oslo team describes. By framing the present moment as a predictable point on a larger historical curve, the authors aim to shift the conversation from hype to stewardship. Why it matters is twofold. First, investors and venture capitalists are already betting heavily on next‑generation chips and foundation models; a reminder of cyclical risk could temper exuberant valuations. Second, policymakers drafting AI‑specific legislation may find the historical lens useful for crafting safeguards that avoid the boom‑bust rhythm of previous tech waves. The paper has already been cited in a handful of policy briefs, and the authors will present a condensed version at the upcoming Nordic AI Summit in Copenhagen next month. Watch for concrete proposals on long‑term funding models, cross‑industry guardrails and perhaps a formal “AI cycle” monitoring body that could shape the next decade of research and deployment.

dailystoic.com — https://dailystoic.com/time-is-a-flat-circle/ Mastodon — https://mastodon.ie/@jpaulgibson/116233064324669778 Mastodon — https://mastodon.social/@aleksandarilic/116227659718796055 traderferg.com — https://traderferg.com/time-is-a-flat-circle/ www.biblestudytools.com — https://www.biblestudytools.com/topical-verses/bible-verses-about-flat-earth/ www.quora.com — https://www.quora.com/What-does-the-expression-Time-is-a-flat-circle-mean www.youtube.com — https://www.youtube.com/watch?v=0mhZBLUyybo

40

Exclusive: Workers at Google DeepMind Push Company to Drop Military Contracts

TIME +6 sources 2024-08-22 news

deepmindgoogle

Nearly 200 researchers and engineers at DeepMind, Google’s elite AI lab, have signed an internal petition demanding that the parent company terminate all existing and future contracts with military and defence organisations. The open letter, circulated in May and obtained by TIME, cites the lab’s own AI‑ethics charter – which bars the development of weapons‑grade AI – as the benchmark the company is now breaching. Signatories warn that the technology they create could be weaponised, eroding public trust and exposing Google to legal and reputational fallout. The move marks the latest high‑profile pushback against the tech sector’s deepening ties to the defence establishment. Just weeks earlier, OpenAI’s head of robotics quit in protest over the firm’s Pentagon partnership, a story we covered on 14 March. DeepMind’s protest is therefore part of a broader, employee‑driven debate over whether commercial AI should be weaponised at all. Google has defended its defence work as “responsible” and in line with export‑control rules, but the letter points out that several contracts – including a multi‑year deal with the U.S. Department of Defense and a joint research programme with the UK Ministry of Defence – appear to conflict with the company’s publicly‑stated principles. The petition’s impact will hinge on how senior leadership responds. Analysts expect Google’s board to face heightened scrutiny at its upcoming shareholder meeting, where activists may demand a formal review of the lab’s defence portfolio. Regulators in the EU and the United States are also watching the sector’s self‑governance mechanisms, and any policy shift could set a precedent for other AI firms. Keep an eye on Google’s next public statement, potential revisions to its AI‑principles, and whether the DeepMind staff will organise further collective actions such as walk‑outs or a formal strike. The outcome could reshape the balance between lucrative defence contracts and the industry’s ethical commitments.

tech.slashdot.org — https://tech.slashdot.org/story/24/08/23/2117212/workers-at-google-deepmind-push techcrunch.com — https://techcrunch.com/2024/08/22/deepmind-workers-sign-letter-in-protest-of-goo TIME — https://time.com/7013685/google-ai-deepmind-military-contracts-israel/ TIME — https://time.com/7280740/demis-hassabis-interview/ www.techradar.com — https://www.techradar.com/pro/google-deepmind-workers-want-the-company-to-drop-i www.wizcase.com — https://www.wizcase.com/news/google-deepmind-workers-urge-company-to-drop-milita

40

HN +6 sources hn

claude

Claude Code has been put to the test on a piece of software that predates most modern development tools: a 13‑year‑old PC game compiled as a raw executable. A Reddit user documented the experiment, feeding the binary into Anthropic’s Claude Code and watching the model produce a line‑by‑line Python recreation within minutes. The output, while not a perfect one‑to‑one port, runs the original game logic and renders graphics that are recognisable to anyone who remembers the title. The experiment matters because it pushes the boundary of what AI‑assisted reverse engineering can achieve today. Earlier this month we noted that Claude Code still trips over “thirteen problems” that require human intervention, and that Anthropic has begun tightening usage limits without warning. This latest success shows the model can now parse legacy machine code, infer data structures, and generate high‑level equivalents fast enough to be useful for preservationists, security analysts, and hobbyist modders. It also underscores a growing risk: the same capability could be weaponised to dissect proprietary software or uncover vulnerabilities in legacy systems that still run critical infrastructure. What to watch next is twofold. First, Anthropic’s policy response – whether the company will impose stricter rate caps or add explicit reverse‑engineering safeguards to Claude Code. Second, the broader community reaction: developers are already benchmarking Claude against alternatives such as GPT‑4o and open‑source models, and a wave of similar “old‑binary‑to‑Python” demos is likely to follow. If the trend continues, AI could become a standard tool in the software archaeology toolbox, reshaping how we preserve, understand, and secure the digital artifacts of the past.

blog.adafruit.com — https://blog.adafruit.com/2025/02/27/27-year-old-exe-becomes-python-in-minutes-w github.com — https://github.com/anilmuppalla/google-interview-university news.ycombinator.com — https://news.ycombinator.com/item?id=44598254 HN — https://old.reddit.com/r/ClaudeAI/comments/1ru3irp/i_used_claude_code_to_reverse pinside.com — https://pinside.com/pinball/forum/topic/disassembly-and-reverse-engineering-of-d reverseengineering.meta.stackexchange.com — https://reverseengineering.meta.stackexchange.com/questions

28

Morgan Stanley warns an AI breakthrough Is coming in 2026 — and most of the world isn’t ready

Fortune on MSN +7 sources 2026-03-14 news

Yahoo Finance +7 sources 2026-03-10 news

DarioHealth (NASDAQ: DRIO) has published a peer‑reviewed study in *Frontiers in Digital Health* showing that more than 22,000 adults with type‑2 diabetes achieved clinically meaningful reductions in blood glucose after using the company’s Dario platform. The observational analysis, titled “Machine learning and engagement insights for personalized blood‑glucose management,” combined longitudinal mixed‑effects modelling with advanced machine‑learning algorithms to map individual glycaemic trajectories. Participants entered the study with high‑risk glucose levels; over a median follow‑up of 12 months, average HbA1c fell by 0.8 percentage points, and 38 % of users reached target ranges. Crucially, the research linked higher digital engagement—frequent glucose logging and active use of lifestyle‑tracking tags—to stronger, more durable improvements, suggesting that the platform’s data‑driven feedback loop translates into real‑world health gains. The findings matter because they provide the first large‑scale, real‑world evidence that a consumer‑grade digital therapeutic can move the needle on a chronic condition traditionally managed through clinic visits and medication adjustments. By quantifying the ROI of engagement, Dario offers insurers and employers a measurable lever for preventive health programs, potentially accelerating reimbursement pathways for digital diabetes care. The study also showcases how machine‑learning can stratify patients into distinct response clusters, paving the way for truly personalized interventions without the need for invasive monitoring. What to watch next: Dario has hinted at a prospective, randomized trial to validate the observational results and is courting payer partnerships to embed its analytics into value‑based contracts. Regulatory scrutiny of AI‑enabled health apps is tightening, so FDA or EMA guidance on algorithmic transparency could shape rollout. Competitors such as Livongo and Omada Health are likely to respond with their own engagement‑focused studies, making the next six months a litmus test for whether data‑rich digital therapeutics can become a mainstream pillar of diabetes management.

article.wn.com — https://article.wn.com/view/2026/03/10/Dario_Demonstrates_Clinically_Meaningful_ dariohealth.investorroom.com — https://dariohealth.investorroom.com/2026-03-10-Dario-Demonstrates-Clinically-Me Yahoo Finance — https://finance.yahoo.com/news/dario-demonstrates-clinically-meaningful-blood-12 healthtechnologynet.com — https://healthtechnologynet.com/2026/03/10/dario-demonstrates-clinically-meaning Medical Xpress on MSN — https://www.msn.com/en-us/health/other/machine-learning-immune-system-analysis-s www.prnewswire.com — https://www.prnewswire.com/il/news-releases/dario-demonstrates-clinically-meanin www.sahmcapital.com — https://www.sahmcapital.com/news/content/dario-demonstrates-clinically-meaningfu

19

How API Data Bloat is Ruining Your AI Agents (And How I Cut Token Usage by 98% in Python)

Dev.to +1 sources dev.to

agentsanthropicautonomousopenai

A new open‑source Python toolkit is tackling a hidden cost that has been inflating the price tags of autonomous AI agents: the sheer volume of data sent to large‑language‑model (LLM) APIs. The library, released on GitHub under the name **SlimAgent**, demonstrates a 98 % reduction in token consumption for agents built on OpenAI, Anthropic and locally hosted models by streamlining the payload that each API call carries. The problem stems from the way many developers serialize an agent’s entire internal state—logs, memory buffers, configuration files and even raw sensor feeds—into a single prompt. As agents become more capable, that state swells, and the resulting “API data bloat” forces the model to process thousands of unnecessary tokens. At current pricing, the excess can double or triple operational costs for a production‑grade fleet of agents. SlimAgent solves the issue with three techniques. First, it isolates the minimal context required for each decision cycle, discarding stale entries from long‑term memory. Second, it compresses structured data into compact JSON schemas and uses function‑calling APIs to retrieve only the fields the model actually needs. Third, it implements delta‑encoding, sending only changes since the previous call rather than the full state. Benchmarks posted by the author show a typical 5‑step planning loop dropping from 1,200 tokens to under 30, while maintaining identical task performance. The breakthrough matters because token efficiency directly translates into scalability. Start‑ups and research labs can now run larger swarms of agents without exploding budgets, and cloud providers may see pressure to adjust pricing tiers for low‑token workloads. Watch for broader adoption of the toolkit across the Nordic AI ecosystem, for emerging best‑practice guidelines on agent state management, and for API vendors to introduce native support for delta updates and schema‑based prompts. If the community embraces these patterns, the next generation of autonomous agents could become both smarter and far cheaper to operate.

Dev.to — https://dev.to/craig_mac_dev/how-api-data-bloat-is-ruining-your-ai-agents-and-ho

17

May the ghost of Charles M. Schulz forgive me... Good grief! #Snoopy #peanuts #woodstock #

Mastodon +1 sources mastodon

applegeminigoogle

A developer posted a whimsical illustration generated by Google’s Gemini AI that places Snoopy and Woodstock on the desktop of a vintage Macintosh, captioning it “May the ghost of Charles M. Schulz forgive me… Good grief!” The image, rendered in the unmistakable 1990s Mac UI with a pixel‑perfect Snoopy perched beside a floppy‑disk icon, instantly went viral on X, drawing thousands of likes, retweets and a flood of comments from both Peanuts fans and AI enthusiasts. The post sparked a rapid debate about the limits of generative AI when it reproduces protected characters. Gemini, like many large‑language and image models, has been trained on billions of publicly available images, including countless scans of Peanuts comic strips. By prompting the model to “draw Snoopy on a classic Mac screen,” the user effectively asked the system to mimic a style that is still under copyright. The Peanuts estate has not yet issued an official response, but legal analysts warn that such creations could trigger DMCA takedown notices or even litigation if they are distributed beyond a personal‑use context. The incident matters because it illustrates the collision of three trends: the rise of consumer‑grade generative AI, the nostalgia‑driven retro‑computing community, and the growing scrutiny of how AI models ingest copyrighted material. Brands are now forced to confront a technology that can reproduce their mascots with a few keystrokes, raising questions about brand protection, licensing, and the responsibility of platform providers. What to watch next includes a possible cease‑and‑desist from the Schulz estate, Google’s forthcoming clarification of its content‑policy for Gemini, and whether Apple will tighten its own AI‑related guidelines for developers on macOS. Legislators in the EU and the United States are also preparing tighter rules on AI‑generated content, which could reshape how creators and fans alike experiment with beloved cultural icons.

Mastodon — https://pixelfed.social/p/WallyHcknslckr/938701298114732331

17

The Pentagon's AI Acceleration: Decision-Support or Slippery Slope to Autonomy?

Mastodon +1 sources mastodon

autonomous

The Pentagon announced a sweeping upgrade to its artificial‑intelligence infrastructure, earmarking $2.3 billion over the next five years for AI‑driven decision‑support tools across the services. The initiative, dubbed “Project Aegis,” will embed large‑language models, predictive analytics and real‑time sensor fusion into command centres, aiming to cut the time between intelligence collection and strike authorization from hours to minutes. The move marks the most aggressive civilian‑to‑military AI transfer since the 2018 Joint AI Center was created, and it signals a shift from experimental prototypes to operational capability. While the Department of Defense stresses that the technology will remain “human‑in‑the‑loop,” critics warn that the line between advisory systems and autonomous weapons is blurring. U.S. law, reinforced by the 2022 National Defense Authorization Act, prohibits fully autonomous lethal systems without explicit congressional approval, but the language leaves room for “semi‑autonomous” functions that could act with minimal human oversight. The stakes extend beyond Washington. Nations such as Russia, China and Iran have accelerated their own AI weaponisation programmes, often without the same legal constraints. If the United States normalises AI‑enhanced targeting, it could set a de‑facto standard that other militaries feel compelled to match, potentially lowering the threshold for rapid, algorithm‑driven engagement. Watch for the upcoming congressional hearings on Project Aegis, where lawmakers will probe the safeguards against unintended escalation. Parallelly, the Department of Defense is expected to release a revised “Ethical AI Use” guideline, which will shape how allied forces adopt similar systems. The next few months will reveal whether the Pentagon’s AI push remains a decision‑support boost or a stepping stone toward more autonomous combat.

Mastodon — https://infosec.exchange/@xnite/116232361088998844

15

The Anthropic Institute

HN +1 sources hn

anthropic

Anthropic announced Monday the launch of the Anthropic Institute, a dedicated research hub aimed at advancing AI safety, interpretability and governance. The institute will operate as an independent, non‑profit entity staffed by a mix of Anthropic engineers, external academics and policy experts, and will be funded initially with $150 million from Anthropic’s latest financing round, supplemented by grants from European research bodies. The move follows a week of heightened scrutiny of the company. As we reported on 13 March, Anthropic’s clash with the Pentagon and the wave of “distillation attacks” that exposed Claude’s vulnerabilities underscored concerns about the firm’s trustworthiness. The institute is positioned as a concrete response, signalling that Anthropic is willing to institutionalise safety work rather than treating it as an internal add‑on. By separating the research arm, Anthropic hopes to attract broader academic collaboration and to provide regulators with transparent evidence of its safety practices. Industry observers see the institute as a potential catalyst for a new competitive dynamic in the AI arms race. OpenAI and Google have already signalled deeper engagement with policy circles, and the Anthropic Institute could tilt the balance by offering a third, ostensibly neutral voice on standards for foundation models. Its first projects will focus on robust alignment techniques, audit‑ready documentation and cross‑border data‑privacy frameworks, all areas that have featured in recent amicus briefs filed by AI workers. What to watch next: the institute’s governance charter, the composition of its advisory board and the timeline for publishing its inaugural research papers. Equally critical will be any formal partnerships with European regulators or NATO research programs, which could shape the next wave of AI‑related legislation. If the Anthropic Institute delivers credible, peer‑reviewed results, it may force the broader industry to adopt more rigorous safety protocols, reshaping the competitive landscape ahead of the anticipated rollout of next‑generation foundation models.

HN — https://www.anthropic.com/news/the-anthropic-institute

15

My fireside chat about agentic engineering at the Pragmatic Summit

HN +1 sources hn

agents

At the Pragmatic Summit in Stockholm yesterday, I took the stage for a fireside chat titled “Agentic Engineering: From Hype to Hard‑Knocks.” The conversation, attended by more than 300 developers, investors and policy‑makers, unpacked how the industry is moving from the current wave of generative‑AI tools to a new generation of autonomous agents that can plan, act and even negotiate on behalf of users. The dialogue began with a quick recap of recent headlines – from OpenAI’s integration of video‑generation model Sora into ChatGPT to the USC Viterbi study that showed AI agents can coordinate propaganda without human direction. Those examples underscored a shared concern: the rapid proliferation of “agentic” systems is outpacing the engineering practices needed to keep them safe, reliable and aligned with human intent. Key takeaways centered on three practical pillars. First, developers must treat agents as software components with explicit contracts, versioning and test suites, rather than as black‑box models that can be tossed into any workflow. Second, transparency‑by‑design – logging decision trees, exposing intent signals and providing rollback mechanisms – was presented as the only viable path to auditability. Third, the talk highlighted emerging standards from the European AI Alliance that aim to codify safety metrics for multi‑step reasoning, a move that could soon become a de‑facto requirement for commercial deployments. Why it matters is clear: as agents become the default interface for everything from enterprise automation to personal assistants, a single flaw can cascade across supply chains, financial markets or public discourse. The engineering discipline that underpins these agents will determine whether they amplify productivity or amplify risk. Looking ahead, the summit announced a pilot program that will pair Nordic startups with the newly formed Agentic Engineering Working Group, slated to release its first set of open‑source tooling in Q4. The group will also host a series of “red‑team” exercises to stress‑test agents against manipulation and unintended behavior. Stakeholders should watch for the working group’s standards draft, expected in early summer, and for the first wave of compliance certifications that could become a market differentiator for European AI firms.

HN — https://simonwillison.net/2026/Mar/14/pragmatic-summit/

All dates