Consistent_Bit

Language models still can't pass complex Theory of Mind tests, Meta shows [about paper "Explore Theory of Mind: Program-guided adversarial data generation for theory of mind reasoning"]

Consistent_Bit_3295

2024-12-30 14:51:14

o3 benchmarks are awesome, but disappointment ahead?

Balance-

2024-12-30 12:49:34

Cerebras CePO: Supercharging Llama-3.3 70B with Test-Time Reasoning

Consistent_Bit_3295

2024-12-26 15:57:40

LLM's work just like me

Consistent_Bit_3295

2024-12-21 13:36:46

o3 smarter than François Chollet at Arc AGI(test output=o3 answer, image 2 = "Correct answer")

mantrakid

2024-12-21 22:29:39

Can’t stop thinking bout this o3 moment… ‘maybe not…’ *silence*

solsticeretouch

2024-12-21 00:13:07

What are your realistic expectations for o3 when it becomes available to the public?

Consistent_Bit_3295

2024-12-20 18:13:10

So how we gonna move the goalposts now?

Consistent_Bit_3295

2024-12-16 22:46:18

Topics Like ASI Seem Illogically Dismissed by The General Public

Consistent_Bit_3295

2024-12-16 02:26:31

All AGI/ASI timelines be like:

MetaKnowing

2024-12-08 15:24:47

Paper shows o1 demonstrates true reasoning capabilities beyond memorization

Consistent_Bit_3295

2024-12-06 00:57:24

Controversial Opinion: $200 a month pro mode is actually cheap

aniketandy14

2024-11-29 09:30:07

Gemini or Grok incoming?

GutiV

2024-10-17 17:11:54

What do you think on the fact that 90% of the front page of this sub is posted by the same two accounts?

Paper that trained a model with a GPT-2-like architecture on a synthetic math dataset: "We use a synthetic setting to demonstrate that language models can learn to solve grade-school math problems through true generalization, rather than relying on data contamination or template memorization."

qroshan

2024-08-01 16:41:55

Google claims #1 for the first time with 1.5 Pro Aug release

Kitchen_Task3475

2024-07-25 15:51:25

AI Hype: “Billions of dollars will be incinerated” Business Analysts Warn

Share Your Mood