Effective Altruism News
Effective Altruism News
- Most technical AI safety work that I read seems to miss the mark, failing to make any progress on the hard part of the problem. I think this is a common sentiment, but there's less agreement about what exactly the hard part is? Characterizing this more clearly might save a lot of time and better target the search for solutions.
- Check out some of the ways you can win prizes, get swag, donate to charity, and ask your burning questions
- Friendship breakups are never easy, but few are as messy and expensive as the collapse of Elon Musk and Sam Altman’s once thriving tech bromance. On Thursday, closing arguments wrapped up in Musk’s lawsuit against OpenAI, leaving a jury to deliberate next week whether Altman and other executives “stole a charity” (as one of Musk’s […]...
- Deployment-time spread is the most plausible near-term route to consistent adversarial misalignment
- Risk reports commonly use pre-deployment alignment assessments to measure misalignment risk from an internally deployed AI. However, an AI that genuinely starts out with largely benign motivations can develop widespread dangerous motivations during deployment. I think this is the most plausible route to consistent adversarial misalignment in the near future.
- We have developed some relatively general methods for mechanistic estimation competitive with sampling by studying problems that are expressible as expectations of random products. This includes several different estimation problems, such as random halfspace intersections, random #3-SAT and random permanents.
- Risk reports commonly use pre-deployment alignment assessments to measure misalignment risk from an internally deployed AI. However, an AI that genuinely starts out with largely benign motivations can develop widespread dangerous motivations during deployment. I think this is the most plausible route to consistent adversarial misalignment in the near future.
- This series of posts outlines how I think about catastrophic biological risks. My goal here is to share my worldview in a straightforward and compressed form rather than trying to persuade a skeptical audience, although I do share some of my reasoning. Part I describes my views on the sources.
- Our mission is to protect humanity against biological catastrophes, including those that could lead to human extinction or cause similarly bad outcomes. This series of posts outlines how I think about these most extreme types of risks. My goal here is to share my worldview in a straightforward and compressed.
- We have developed some relatively general methods for mechanistic estimation competitive with sampling by studying problems that are expressible as expectations of random products. This includes several different estimation problems, such as random halfspace intersections, random #3-SAT and random permanents.
- AlphaGo is still the cleanest worked example of the primitives of intelligence: search, learning from experience, and self-play.
- Are those knee deep in blood any different from the rest?
- This review synthesizes veterinary science, animal welfare, and neuroscience research to argue that for animals in barren, confined environments, pain isn’t just unrelieved — it’s amplified. The post How Barren Environments Amplify Pain In Captive Animals appeared first on Faunalytics.
- Summary: AI safety is constrained on talent in many ways, but the reasons behind the constraints vary between types of talent. This post is based on all posts and documents I could find from the past ~ 3 years related to hiring needs and talent pipelines, which I have listed in this document. Technical research talent - we have strong talent pipelines delivering young researchers to the...
- Transformer Weekly: US-China talks, AI executive order, and Anthropic’s $900b valuation...
- An insidious pattern among smart people is feeling that because something is familiar and obvious, you are impervious to ignoring or forgetting it. In challenging times, I have often heard these clichés and reflexively shrugged them off. “Oh, I should dust myself off and pick myself up? What a lazy aphorism. What a patronising throwaway line. They must think I’m some kind of idiot.
- or, how to give presentations
- Last year, nearly 130 million pigs were raised for meat in the US, but they didn’t come out of nowhere; they had parents. Or as pork producers call them, “breeder pigs.” Since the 1970s, producers have been keeping most of the breeding females — known as sows — in tiny enclosures called gestation crates. It’s […]...
- "Sometimes the AI just makes stuff up" is a problem I don't really expect to go away. In the nearterm, AI is going to keep occasionally hallucinating, or misinterpreting information. Eventually, AI will be powerful enough we need to be worried if it's presenting misleading information on purpose.
- Tl;dr: Convergent abstraction hypothesis posits abstractions are often convergent in the sense of convergent evolution: different cognitive systems converge on the same abstraction, when facing similar selection pressures and learning in similar environments. It is a less ambitious alternative to 'natural abstractions hypotheses' and, in my view, more likely to be true.
- Video | Abhijit Banerjee says teaching children, not curriculum, is key to faster global education progress J-PAL's co-founder Abhijit Banerjee says teaching children, not curriculum, is key to faster global education progress, at the Yashraj Bharati Samman, 2026. spriyabalasubr… Fri, 05/15/2026 - 02:44...
- Anti-poverty program is effective even in one of the world's toughest settings Northwestern University economist and J-PAL affiliate Dean Karlan highlighted that the Graduation approach delivered results even in one of the world's most challenging environments, Somalia noting that the results fall in the upper end of the spectrum for what the program typically delivers, and that the biggest,...
- Advocates push TVET to tackle youth unemployment J-PAL affiliate Monica Lambon-Quayefio, Associate Professor at the Department of Economics, University of Ghana, said unlocking the potential of TVET required a deliberate, well-resourced, and inclusive ecosystem to prepare the youth for the modern economy during the webinar on the theme "Youth employment" organized by the World Bank Ghana with...
- Advocates push TVET to tackle youtb unemployment J-PAL affiliate Monica Lambon-Quayefio, Associate Professor at the Department of Economics, University of Ghana, said unlocking the potential of TVET required a deliberate, well-resourced, and inclusive ecosystem to prepare the youth for the modern economy during the webinar on the theme "Youth employment" organized by the World Bank Ghana with...
- It's been one year since Mercy For Animals called on Biggby Coffee and Bluestone Lane to drop the upcharge on plant-based milk. We will continue to call them out for their unjust policy! . The post Biggby Coffee and Bluestone Lane Profit at Animals’ Expense appeared first on Mercy For Animals.
- Summary: This is a summary of a paper published by the alignment team at UK AISI. Read the full paper here. AI research agents may help solve ASI alignment, for example via the following plan: Build agents that can do empirical alignment work (e.g.~writing code, running experiments, designing evaluations and red teaming) and confirm they are not scheming.
- GiveWell’s research doesn’t end once we’ve made a grant. We evaluate a subset of completed grants, comparing what we thought would happen to what actually took place, then try to use what we learn to improve our future funding decisions.
- GiveWell’s research doesn’t end once we’ve made a grant. We evaluate a subset of completed grants, comparing what we thought would happen to what actually took place, then try to use what we learn to improve our future funding decisions.
- In factory farms around the world, individual animal care is impossible. To manage thousands of farmed animals at once, workers use industrial marking paint on fur or skin, applying it with a brush, sprayer, or roller to categorize animals such as cows, pigs, goats, and sheep. Why Are Animals Spray-Painted and What Does It Represent? […]. The post Why Are Farmed Animals Spray Painted?
- 1) The safe-to-dangerous shift is a fundamental problem for eval realism. Suppose we have a capable and potentially scheming model, and before we deploy it, we want some evidence that it won’t do anything catastrophically dangerous once we deploy it. A common approach is to use black-box alignment evaluations.
- The Cyber Resilience Corps, was listed as a resource for CI Fortify, a new initiative launched by the Cybersecurity and Infrastructure Security Agency (CISA), demonstrating the strong role that volunteers play in hardening the defenses of critical infrastructure in local communities. The post Cyber Resilience Corps Listed as Key Resource in CISA’s “CI Fortify” Initiative appeared first on CLTC.
- The trial of the year draws to a close
- The post Is Anyone Still Charging Extra for Alt Milk in 2026? appeared first on Mercy For Animals.
- 1) The safe-to-dangerous shift is a fundamental problem for eval realism. Suppose we have a capable and potentially scheming model, and before we deploy it, we want some evidence that it won’t do anything catastrophically dangerous once we deploy it. A common approach is to use black-box alignment evaluations.
- Having empathy for others doesn't require weird metaphysics!
- On 14 May 2026 Pugwash held a side event during the Treaty on the Non-Proliferation of Nuclear Weapons Review Conference … More...
- Are companies the most durable cash transfers? View this email in your browser Hello! Our favourite links this month include: A major threat to animal welfare legislation in the US. The case for starting an export company instead of working in aid.
- AI incidents are scaling fast, and coordinated global governance is lagging behind. This report proposes addressing this challenge through the development of internationally-distributed incident management infrastructure. Our recommendations aim to enable governments, multilateral bodies, and frontier AI companies to jointly detect, prepare for, and respond to AI incidents across jurisdictions.
- We asked attendees at EA Global about effective altruism. Here is what Calum said. Find an upcoming conference at 👉 effectivealtruism.org/ea-global #EffectiveAltruism #EAVoxPop #EAGlobal...
- Short of time? Read the key takeaways. Value to others beats personal interest. A common mistake creators make is assuming that what fascinates them will fascinate their audience. If you value impact, we recommend focusing instead on what your specific audience finds useful, actionable, or genuinely relevant to their lives. Your impact depends on comparison, not just quality.
- The Transparency Edition Plus a new opening, updates on our farm program, R&D updates, and more View this email in your browser Hi there,. We hope you’re having a good May! We don’t have any major project updates to share this month, so we’re instead focusing our highlight article on the various ways FWI aims to be a transparent...
- the space between
- TL;DR: We estimate how often Qwen 3 4B exhibits rare harmful behaviors with 30× fewer rollouts than naive sampling, using a new method that interpolates between the model and a less-safe variant in logit space. Authors: Francisco Pernice (MIT), Santiago Aranguri (Goodfire). Introduction.
- Please help Andre. He's struggling. 🆘 Plus: community weekend recap, movie nights, and one (1) puppy with an agenda 🐶 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ...
- Anthropic are now actively using the approach to alignment often called “ Alignment Pretraining” or “Safety Pretraining” — using Stochastic Gradient Descent on a large body of natural or synthetic documents showing the AI assistant doing the right thing in morally challenging situations.
- TL;DR: vLLM-Lens is a vLLM plugin for top-down interpretability techniques such as probes, steering, and activation oracles. We benchmarked it as 8–44× faster than existing alternatives for single-GPU use, though we note a planned version of nnsight closes this gap.
- I have tried and failed to write a longer post since 2024, so here goes a short one with less detail. Discourse has primarily focused on models' ability to develop new exploits against important software from scratch. That capability is impressive, but the tech industry has been dealing with people regularly finding 0-day exploits for important pieces of software for more than twenty years.
- TL;DR: Voters are now surprisingly open to talking about existential risk from AI. This seems to have changed in the last 6 months. When campaigning for AI safety-friendly politicians (e.g., Alex Bores), we should talk more about AI in general, and about AI risk in particular. This is currently actionable for the CA-11 and NY-12 Democratic primaries.
- Imagine you’re looking for a personal trainer. You open one trainer’s webpage and read their testimonials: “I had an experience tied for the most intense experiences of my life”; “They do it all with fun, care, and a sense of humour.” You notice that none of the testimonials mention improved body composition, fitness, or bloodwork. What would you think?.
- The post Open position: Senior Video Operations appeared first on 80,000 Hours.
- What might explain AI researcher pay, and why it matters
- I’m an EA who has been trying to find ways to make animal suffering more salient. I’ve been working on a feature-length documentary called ‘The Dying Trade’ for the last 5 years and I’ve just released it on YouTube.
- Cross-posted from The Counterfactual by the Forum Team. Subtitle: A concrete strategy for deploying the largest wave of philanthropic capital in history. . The OpenAI Foundation holds $180 billion in equity. Anthropic’s co-founders have pledged to donate 80% of their wealth. When the time comes to spend all this money, what should we actually do with it?. Here’s my best guess.
- Last month, Anthropic announced Mythos Preview, the most powerful cyberweapon in history, capable of finding and exploiting zero-day vulnerabilities in every major operating system and web browser. Meanwhile, many frontier AI company employees increasingly expect full automation of AI R&D in the next year or two, followed by the rapid automation of thousands of other important tasks and jobs.
- Teenage panic attacks are not uncommon. Teenagers are going through a crucial time of learning how to manage emotions and deal with stress, and this can be a tough challenge at times. Teenage panic attacks can occur just one or a few times, but in some cases they can develop into panic disorder (chronic, repeated panic attacks).
- Over the years, I’ve written two op-eds for The New York Times about quantum computing, at the NYT editors’ invitation: I’ve also visited the NYT office and helped NYT reporters with numerous stories about quantum computing and beyond. In the wake of Cade Metz’s infamous NYT hatchet job against Scott Alexander and the rationalist community, […]...
- After the AI super PAC endorsed her and two other Democrats, Rep. Val Hoyle went back and forth on whether she was happy with their support
- MIRI CEO Malo Bourgon at the Buckley Institute at Yale: Humans didn't wipe out 10,000+ species because we were evil. We did it because our goals weren't aligned with theirs. A superintelligence relates to us the same way. Not hostile. Just indifferent, and far more capable.
- On the dangers of being self-enamored
- Tom Davidson explains how AI could enable a small group to seize power, why he puts the risk of an AI-enabled coup at 10% in the next 30 years, and what democracies must do to prevent it. The conversation covers robot armies, the mechanics of takeover, democratic backsliding, the AI race, and the steps companies and governments should take to maintain a balance of power.
- Meat is the flesh of tortured innocent animals who did not want to die
- Disclaimer: I’m not vegan. I’m not even vegetarian. I eat meat all the time. I’ve been a firm critic of efforts to objectively quantify the difference in suffering across very different species. That said, I cannot help but agree that eating meat is probably the morally worst thing I do, and I also have to agree that eating different kinds of meat are different levels of bad.
- What looks like public education about farming is often industry PR in disguise. This blog breaks down how agriculture front groups manufacture public trust in Canada, and how advocates can counter these efforts. The post Agriculture Front Groups In Canada And The Public Trust Agenda appeared first on Faunalytics.
- Today, beneath the gilded ceilings of the House of Lords, one King delivered his speech to the nation, while we, no less crowned (and rather better armoured), listened from our rocky throne, antennae poised, claws crossed - only to find, once again, that we magnificent 10-legged creatures had been entirely overlooked.
- The strange path to global monopoly
- EA Forum Digest #291 Global development takes the spotlight this week Hello!. It’s In Development Highlight week on the EA Forum! The authors and Editor in Chief from the new global development magazine are on the Forum all week, ready to answer your questions. Start by reading their articles:
- Alicorn writes things sometimes
- Should you be worried about the hantavirus outbreak? Should you be afraid? Should you be panicking? Should you start freaking out? If you’ve been following the coverage of the hantavirus outbreak aboard the cruise ship MV Hondius, these are the questions you’ve seen posed in headlines. And a small tip from inside the media: If […]...
- I say this with love
- I had a conversation with someone who claimed offhandedly that AI will dramatically raise agricultural productivity (via agritech advancements) in low-income countries and trigger growth as a result. My instinct was to respond that we've already had substantial advancements in agricultural technology, and yet it hasn't resulted in the magnitude of yield growth, let alone economic growth, you'd...
- Red Button, Blue Button. On April 24th, 2026, Tim Urban put forth the following poll on Twitter/X: Everyone in the world has to take a private vote by pressing a red or blue button. If more than 50% of people press the blue button, everyone survives. If less than 50% of people press the blue button, only people who pressed the red button survive. Which button would you press?.
- Estimating the resources CAISI needs to deliver on American AI readiness
- Today's model specs are written for current and near-future versions of LLMs, and AI labs typically treat them as provisional. But what if the AI behaviors we set now stick around and end up governing far more capable future models by default?
- Gen Z are a bunch of cowards…or are they risking it all on crypto? The editors of The New Critic report on their generation’s Risk-geist.
- Caveat [5/14/26]. See the comments: the results are more prompt-sensitive than I'd thought. Overview. When asked about how they would give away money, or about how to have a moral career, the leading LLMs typically give answers in an EA spirit, and informed by thinking from people and organizations in the EA community.
- (An LLM Whisperer placed a strong request that I put this 2024 story somewhere not on Twitter, so it could be scraped for AI datasets besides Grok's. I perhaps do not fully understand or agree with the reasoning behind this request, but it costs me little to fulfill and so I shall. -- Yudkowsky). And another day came when the Ships of Humanity, going from star to star, found Sapience.
- Looking over my favourite posts, I notice that many of them are making specific versions of a more general claim, which is essentially: don’t confuse selective processes for predictive processes. Here, I’m going to try to make that more general claim, rehash some examples in light of it, and end with a few ambient confusions I think this framework can help with, for the reader to ponder.
- And we're hiring
- Explaining, for those out of the loop, what is coming and how we know
- Kroger's "Fresh for Everyone" slogan stops at the cage door. Unmask the truth behind their broken promise and help end cage cruelty for good. The post Kroger’s Cage-Free Egg Policy: Unmasking the Truth Behind the Broken Pledge appeared first on Mercy For Animals.
- An examination of video footage from an Australian rodeo found that calves experience fear and stress while confined in the chute — before the calf-roping event even begins. The post Rodeo Calves Experience Fear While In The Chute appeared first on Faunalytics.
- The EU's AI Act and Code of Practice requires providers of the most advanced AI models to meet the ‘state of the art’ (SOTA) in safety and security. In a new policy memo, we argue that SOTA is best understood as a process-driven concept, advanced by the broader expert ecosystem.
- This is a crosspost of the full text of Money for nothing: the roles of evidence in GiveDirectly’s journey to $1 billion delivered from In Development, made for the EA Forum's In Development Highlight Week. GiveDirectly will be taking part in the discussion thread, but the author, Paul Niehaus, may not see your comments here.
- “I would volunteer and work at a bunch of nonprofits, but it just never felt good enough. Then when I found effective altruism… it just blew my mind.” -Kearney Capuano, Program Associate at Coefficient Giving See more impact stories at 👉 effectivealtruism.org/stories #EffectiveAltruism #EffectiveAltruismStories...
- for those whose eyes evolved to see
- A disease that was once a death sentence is increasingly treatable
- Local shoppers pressure one of the nation’s largest grocers after failing to fulfill their 2025 commitment LOS ANGELES — Kroger promised customers it would go 100% cage-free. Instead, the nation’s number one supermarket chain failed to deliver, leaving millions of hens confined in cages across its supply chain, raising serious concerns about corporate accountability and […].
- For the past decade, the fight to make it legal and feasible to build housing at scale in California felt Sisyphean. California YIMBY and our allies pushed against exclusionary land use policies, and a political class content to blame the…. The post On the Race for California Governor: An Abundance of <span class="dewidow">Pro-Housing Candidates</span> appeared first on California YIMBY.
- The Availability Problem: Imagine you have cancer, or chronic pain, or a progressive degenerative disease of some sort. You have exhausted the traditional treatment options available to you, and none of them have worked. However, there are treatments that are still undergoing clinical trials which might help you.
- New York lawmakers are advancing legislation that could make the state the first on the East Coast to preemptively ban octopus factory farming, a practice scientists and advocates warn would pose significant animal welfare and environmental concerns. This week, a key Assembly bill advanced out of committee with a favorable vote, marking a major step […].
Loading...