AI: A Guide for Thinking Humans
Subscribe
Sign in
Home
Archive
About
Latest
Top
Discussions
The LLM Reasoning Debate Heats Up
Three recent papers examine the robustness of reasoning and problem-solving in large language models
Oct 21
•
Melanie Mitchell
252
Share this post
AI: A Guide for Thinking Humans
The LLM Reasoning Debate Heats Up
Copy link
Facebook
Email
Notes
More
33
September 2024
Podcast on "The Nature of Intelligence"
I never thought I would be a podcast host, but…Abha Eli Phoboo, the director of communications at the Santa Fe Institute, recently relaunched SFI’s…
Sep 21
•
Melanie Mitchell
124
Share this post
AI: A Guide for Thinking Humans
Podcast on "The Nature of Intelligence"
Copy link
Facebook
Email
Notes
More
37
August 2024
The Turing Test and Our Shifting Conceptions of Intelligence
Has the famous Turing Test been passed?
Aug 15
•
Melanie Mitchell
106
Share this post
AI: A Guide for Thinking Humans
The Turing Test and Our Shifting Conceptions of Intelligence
Copy link
Facebook
Email
Notes
More
27
On the “ARC-AGI” $1 Million Reasoning Challenge
In this post I’m going to go into the weeds, describing how some people are trying to win a big $$$ prize for solving a still-wide-open AI challenge…
Aug 6
•
Melanie Mitchell
138
Share this post
AI: A Guide for Thinking Humans
On the “ARC-AGI” $1 Million Reasoning Challenge
Copy link
Facebook
Email
Notes
More
33
May 2024
Stress-Testing Large Language Models’ Analogical Reasoning Abilities
Hello all.
May 21
•
Melanie Mitchell
129
Share this post
AI: A Guide for Thinking Humans
Stress-Testing Large Language Models’ Analogical Reasoning Abilities
Copy link
Facebook
Email
Notes
More
15
Evaluating Large Language Models Using “Counterfactual Tasks”
“[O]ne thing is clear: LLMs are not human.
May 13
•
Melanie Mitchell
142
Share this post
AI: A Guide for Thinking Humans
Evaluating Large Language Models Using “Counterfactual Tasks”
Copy link
Facebook
Email
Notes
More
26
"AI now beats humans at basic tasks": Really?
Two weeks ago, Nature, one of the world’s most prestigious journals, had this jarring headline:
May 2
•
Melanie Mitchell
186
Share this post
AI: A Guide for Thinking Humans
"AI now beats humans at basic tasks": Really?
Copy link
Facebook
Email
Notes
More
32
January 2024
An “AI Breakthrough” on Systematic Generalization in Language?
A Fun Puzzle Here’s a fun puzzle for you. I’ll give you six words in an alien language: saa, guu, ree, fii, hoo, and muo. Figure 1 gives a diagram…
Jan 7
•
Melanie Mitchell
130
Share this post
AI: A Guide for Thinking Humans
An “AI Breakthrough” on Systematic Generalization in Language?
Copy link
Facebook
Email
Notes
More
23
September 2023
Can Large Language Models Reason?
What should we believe about the reasoning abilities of today’s large language models? As the headlines above illustrate, there’s a debate raging over…
Sep 10, 2023
•
Melanie Mitchell
313
Share this post
AI: A Guide for Thinking Humans
Can Large Language Models Reason?
Copy link
Facebook
Email
Notes
More
58
June 2023
Did GPT-4 Hire And Then Lie To a Task Rabbit Worker to Solve a CAPTCHA?
A Little Fact Checking Is In Order
Jun 12, 2023
•
Melanie Mitchell
140
Share this post
AI: A Guide for Thinking Humans
Did GPT-4 Hire And Then Lie To a Task Rabbit Worker to Solve a CAPTCHA?
Copy link
Facebook
Email
Notes
More
18
May 2023
On Evaluating Understanding and Generalization in the ARC Domain
In a previous post I wrote about the Abstraction and Reasoning Corpus (ARC), an idealized domain created by François Chollet for evaluating abstraction…
May 15, 2023
•
Melanie Mitchell
41
Share this post
AI: A Guide for Thinking Humans
On Evaluating Understanding and Generalization in the ARC Domain
Copy link
Facebook
Email
Notes
More
7
April 2023
Do half of AI researchers believe that there's a 10% chance AI will kill us all?
Fact-checking a widespread claim
Apr 23, 2023
•
Melanie Mitchell
107
Share this post
AI: A Guide for Thinking Humans
Do half of AI researchers believe that there's a 10% chance AI will kill us all?
Copy link
Facebook
Email
Notes
More
27
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts