For half a century, artificial intelligence research has attempted to reproduce the human qualities of abstraction and reasoning - creating computer systems that can learn new concepts from a minimal set of examples, in settings where humans find this easy. While specific neural networks are able to solve an impressive range of problems, broad generalisation to situations outside their training data has proved elusive. In this work, we look at several novel approaches for solving the Abstraction & Reasoning Corpus (ARC). This is a dataset of abstract visual reasoning tasks introduced to test algorithms on broad generalization. Despite three international competitions with $100,000 in prizes, the best algorithms still fail to solve a majority of ARC tasks. The best solvers today rely on complex hand-crafted rules, without using machine learning at all. We revisit whether recent advances in neural networks allow progress on this task, or whether an entirely different class of models are required. First, we adapt the DreamCoder neurosymbolic reasoning solver to ARC. DreamCoder automatically writes programs in a bespoke domain-specific language to perform reasoning, using a neural network to mimic human intuition. We present the Perceptual Abstraction and Reasoning Language (PeARL) language, which allows DreamCoder to solve ARC tasks, and propose a new recognition model that allows us to significantly improve on the previous best implementation. We also propose a new encoding and augmentation scheme that allows large language models (LLMs) to solve ARC tasks, and find that the largest models can solve some ARC tasks. LLMs are able to solve a different group of problems to state-of-the-art solvers, and provide an interesting way to complement other approaches. We perform an ensemble analysis, combining systems to achieve better results than any system alone and analysing individual strengths. However, it is sobering to see that approaches based on neural networks still lag behind existing hand-crafted solvers, and we suggest avenues for future improvements. Our findings with the ensemble model may indicate that a diversity of methods might be necessary to solve problems in ARC. Humans likely employ diverse strategies to solve ARC. Studies involving human participants to identify the strategies they employ to solve ARC could provide valuable insights for future AI approaches. Finally, we publish the arckit Python library to make future research on ARC easier.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561310PMC
http://dx.doi.org/10.1038/s41598-024-73582-7DOI Listing

Publication Analysis

Top Keywords

solve arc
20
neural networks
16
abstraction reasoning
16
arc tasks
16
arc
10
solve
9
llms solve
8
reasoning
7
neural
5
tasks
5

Similar Publications

Study on numerical simulation of groundwater flow field and slope stability in multi-aquifer open pit mine.

Sci Rep

December 2024

Liaoning Institute of Technology and Equipment for Mineral Resources Development and Utilisation in Higher Educational Institutions, Liaoning Technical University, Fuxin, 123000, Liaoning, China.

Water is one of the most important influences on slope stability in open pit mines. In order to solve the problem of slope stability analysis in multi-aquifer open pit mines, the open pit mine in Block I of Thar Coalfield in Pakistan with multiple aquifers was taken as the research background. The groundwater flow field at different excavation phases was analyzed by numerical simulation method.

View Article and Find Full Text PDF

Fluid-Structure Interaction Analysis of Trapezoidal and Arc-Shaped Membranes Mimicking the Organ of Corti.

Int J Numer Method Biomed Eng

January 2025

Department of Mechanical Science and Bioengineering, Graduate School of Engineering Science, Osaka University, Osaka, Japan.

In a previous study [H. Shintaku et al., Sensors and Actuators A: Physical 158 (2010): 183-192], an artificially developed auditory sensor device showed a frequency selectivity in the range from 6.

View Article and Find Full Text PDF

Objective: Proton spot-scanning arc therapy (ARC) is an emerging modality that can improve the high-dose conformity to targets compared with standard intensity-modulated proton therapy (IMPT). However, the efficient treatment delivery of ARC is challenging due to the required frequent energy changes during the continuous gantry rotation. This work proposes a novel method that delivers a multiple IMPT (multi-IMPT) plan that is equivalent to ARC in terms of biologically effective dose (BED).

View Article and Find Full Text PDF

Importance: In poor-prognosis children's cancers, new therapies may carry fresh hope for patients and parents. However, there is an absolute requirement for any new therapy to be properly evaluated to fulfill scientific, regulatory, and reimbursement requirements. Randomized clinical trials (RCTs) are considered the gold standard, but no consensus exists on how and when they should be deployed to best meet the needs of all stakeholders.

View Article and Find Full Text PDF

Comparative study on corrosion characteristics of conductive concrete in red soil environment.

PLoS One

December 2024

School of Electrical and Automation Engineering, East China Jiaotong University, Nanchang, Jiangxi province, China.

In order to solve the corrosion problem of grounding materials in highly corrosive red soil environments, conductive concrete was proposed as a new type of grounding material. The corrosion resistance of conductive concrete was tested and compared to select a suitable preparation scheme with excellent corrosion resistance. A series of conductive concrete samples were made using different conductive materials such as graphite, stainless steel fiber (SSF), and ordinary silicate concrete.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!