The Nonlinear Library: EA Forum Podcast - EA - Future Matters #8: Bing Chat, AI labs on safety, and pausing Future Matters by Pablo

Discover

Podcast Features
Your all-in-one podcasting solution.

Blog to Podcast
Turn your blog into an engaging podcast.
Livestream
High-performing audio live, without limits.

Podcast Studio
Easy-to-use audio recorder app.
Podbean AI
AI-Enhanced Audio Quality and Content Generation.

Podcast App
The best podcast player & podcast app.

Ads Marketplace
Join Ads Marketplace to earn money
through sponsorship on your podcast.

PodAds
Manage your ads with dynamic ad insertion capability.
Apple Podcasts Subscriptions Integration
Effortlessly publish and manage exclusive episodes for your
Apple Podcasts subscribers directly from Podbean.
Live Streaming
Receive livestream rewards from your audience and earn
recurring income from your Fan Club membership.

All Arts Business Comedy Education
Fiction Government Health & Fitness History Kids & Family
Leisure Music News Religion & Spirituality Science
Society & Culture Sports Technology True Crime TV & Film
Live

How to Start a Podcast
How to Start a Live Podcast
How to Monetize a podcast
How to Promote Your Podcast
How to Use Group Recording

Log in
Start your podcast for free

Podcasting
Monetization
Advertisers
Enterprise
Pricing
Discover

The Nonlinear Library: EA Forum

Education

EA - Future Matters #8: Bing Chat, AI labs on safety, and pausing Future Matters by Pablo

2023-03-21

iOS

Android Share

Link to original articleWelcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Future...

Link to original article

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Future Matters #8: Bing Chat, AI labs on safety, and pausing Future Matters, published by Pablo on March 21, 2023 on The Effective Altruism Forum.Future Matters is a newsletter about longtermism and existential risk. Each month we collect and summarize relevant research and news from the community, and feature a conversation with a prominent researcher. You can also subscribe on Substack, listen on your favorite podcast platform and follow on Twitter. Future Matters is also available in Spanish.A message to our readersThis issue marks one year since we started Future Matters. Weâ€™re taking this opportunity to reflect on the project and decide where to take it from here. Weâ€™ll soon share our thoughts about the future of the newsletter in a separate post, and will invite input from readers. In the meantime, we will be pausing new issues of Future Matters. Thank you for your support and readership over the last year!Featured researchAll things BingMicrosoft recently announced a significant partnership with OpenAI [see FM#7] and launched a beta version of a chatbot integrated with the Bing search engine. Reports of strange behavior quickly emerged. Kevin Roose, a technology columnist for the New York Times, had a disturbing conversation in which Bing Chat declared its love for him and described violent fantasies. Evan Hubinger collects some of the most egregious examples in Bing Chat is blatantly, aggressively misaligned. In one instance, Bing Chat finds a userâ€™s tweets about the chatbot and threatens to exact revenge. In the LessWrong comments, Gwern speculates on why Bing Chat exhibits such different behavior to ChatGPT, despite apparently being based on a closely-related model. (Bing Chat was subsequently revealed to have been based on GPT-4).Holden Karnofsky asks What does Bing Chat tell us about AI risk? His answer is that it is not the sort of misaligned AI system we should be particularly worried about. When Bing Chat talks about plans to blackmail people or commit acts of violence, this isnâ€™t evidence of it having developed malign, dangerous goals. Instead, itâ€™s best understood as Bing acting out stories and characters itâ€™s read before. This whole affair, however, is evidence of companies racing to deploy ever more powerful models in a bid to capture market share, with very little understanding of how they work and how they might fail. Most paths to AI catastrophe involve two elements: a powerful and dangerously misaligned AI system, and an AI company that builds and deploys it anyway. The Bing Chat affair doesnâ€™t reveal much about the first element, but is a concerning reminder of how plausible the second is.Robert Long asks What to think when a language model tells you it's sentient []. When trying to infer whatâ€™s going on in other humansâ€™ minds, we generally take their self-reports (e.g. saying â€œI am in painâ€) as good evidence of their internal states. However, we shouldnâ€™t take Bing Chatâ€™s attestations (e.g. â€œI feel scaredâ€) at face value; we have no good reason to think that they are a reliable guide to Bingâ€™s inner mental life. LLMs are a bit like parrots: if a parrot says â€œI am sentientâ€ then this isnâ€™t good evidence that it is sentient. But nor is it good evidence that it isnâ€™t â€” in fact, we have lots of other evidence that parrots are sentient. Whether current or future AI systems are sentient is a valid and important question, and Long is hopeful that we can make real progress on developing reliable techniques for getting evidence on these matters.Long was interviewed on AI consciousness, along with Nick Bostrom and David Chalmers, for Kevin Collierâ€™s article, What is consciousness? ChatGPT and Advanced AI might define our answer [].How the major AI labs are thinking about safetyIn the last few weeks, we got more information about how the lead...