Abstract Logical Reasoning Test

1 天

Alibaba Qwen 3.5 Small Models: 0.8B & 2B Benchmarks and Edge Tests

Alibaba Qwen 3.5 Small models run offline on phones and laptops; 0.8B and 2B sizes, with mixed reliability on hard tasks.

Manila Bulletin

DepEd NCR conducts first unified science high school admissions test for Grade 7

The Department of Education–National Capital Region (DepEd NCR) on Saturday, March 7, conducted the first Unified Science High School Admissions Test ...

Live Science on MSN

'Proof by intimidation': AI is confidently solving 'impossible' math problems. But can it ...

AI could soon spew out hundreds of mathematical proofs that look "right" but contain hidden flaws, or proofs so complex we can't verify them. How will we know if they're right?

VentureBeat

Databricks' OfficeQA uncovers disconnect: AI agents ace abstract tests but stall at 45% on ...

There is no shortage of AI benchmarks in the market today, with popular options like Humanity's Last Exam (HLE), ARC-AGI-2 and GDPval, among numerous others. AI agents excel at solving abstract math ...

Nature

‘Tiny’ AI model beats massive LLMs at logic test

A small-scale artificial-intelligence model that learns from only a limited pool of data is exciting researchers for its potential to boost reasoning abilities. The model, known as Tiny Recursive ...

AOL

Only Logical Thinkers Can Score 23/25 On This Tricky Cognitive Ability Test

Ready for a quick brain workout? This quiz has 25 questions that are here to test how well you notice patterns, think logically, and connect the dots. It might seem easy at first, but once you start ...

GitHub

abstract-reasoning

A prompt-level hack for deeper LLM thinking, which applies abstract reasoning principles to direct LLMs to look at paradoxes and edge cases from different angles.

marktechpost

AbstRaL: Teaching LLMs Abstract Reasoning via Reinforcement to Boost Robustness on GSM ...

Recent research indicates that LLMs, particularly smaller ones, frequently struggle with robust reasoning. They tend to perform well on familiar questions but falter when those same problems are ...

Frontiers

Senior high school students’ competence in logical operation and logical reasoning

The research suggests that the framework of logical operations and inference patterns remains unfinished even in adulthood. While various logical models exist beyond the classical true-or-false ...

GEN

Brain Regions Essential for Logical Thinking and Problem Solving in Humans Identified

Using two newly developed types of reasoning tests, a team of researchers at UCL and UCLH has identified key brain regions that are essential for logical thinking and problem-solving. The results will ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果