Home
Research Papers
Datasets
State of the Art
Pricing
Sign In
Sign Up
Reasoning | State-of-the-Art
Reasoning
Explore cutting-edge benchmarks and research
181
Benchmarks
51
Tasks
302
Papers
Multi-Label Classification
Multi-Label Classification
10
benchmarks
10
papers
Missing Labels
0
benchmarks
9
papers
Extreme Multi-Label Classification
0
benchmarks
8
papers
Medical Code Prediction
7
benchmarks
10
papers
Hierarchical Multi-label Classification
16
benchmarks
8
papers
Video-based Generative Performance Benchmarking
Video-based Generative Performance Benchmarking (Consistency)
1
benchmarks
9
papers
Video-based Generative Performance Benchmarking (Contextual Understanding)
1
benchmarks
9
papers
Video-based Generative Performance Benchmarking (Correctness of Information)
1
benchmarks
9
papers
Video-based Generative Performance Benchmarking (Detail Orientation))
1
benchmarks
9
papers
Video-based Generative Performance Benchmarking (Temporal Understanding)
1
benchmarks
9
papers
Natural Language Inference
Answer Generation
2
benchmarks
10
papers
Visual Entailment
3
benchmarks
9
papers
Cross-Lingual Natural Language Inference
4
benchmarks
10
papers
Natural Language Inference
31
benchmarks
10
papers
Video Question Answering
Video Question Answering
20
benchmarks
10
papers
Zero-Shot Video Question Answer
13
benchmarks
10
papers
Few-shot Video Question Answering
0
benchmarks
1
papers
Autonomous Navigation
Sequential Place Recognition
0
benchmarks
5
papers
Autonomous Flight (Dense Forest)
1
benchmarks
1
papers
Decision Making Under Uncertainty
Uncertainty Visualization
0
benchmarks
5
papers
Decision Making Under Uncertainty
0
benchmarks
7
papers
Decision Making
Imitation Learning
0
benchmarks
10
papers
Decision Making
1
benchmarks
10
papers
Mathematical Proofs
Automated Theorem Proving
10
benchmarks
9
papers
Mathematical Proofs
0
benchmarks
8
papers
Multi-Label Learning
Multi-Label Learning
1
benchmarks
7
papers
Missing Labels
0
benchmarks
9
papers
Visual Reasoning
Visual Commonsense Reasoning
7
benchmarks
10
papers
Visual Reasoning
12
benchmarks
10
papers
General Reinforcement Learning
Offline RL
2
benchmarks
10
papers
Model-based Reinforcement Learning
0
benchmarks
10
papers
Commonsense Reasoning for RL
Commonsense Reasoning for RL
1
benchmarks
1
papers
Identify Odd Metapor
Identify Odd Metapor
1
benchmarks
2
papers
Human Judgment Classification
Human Judgment Classification
1
benchmarks
2
papers
Human Judgment Correlation
Human Judgment Correlation
2
benchmarks
3
papers
Anachronisms
Anachronisms
0
benchmarks
3
papers
Theory of Mind Modeling
Theory of Mind Modeling
0
benchmarks
5
papers
Analogical Similarity
Analogical Similarity
1
benchmarks
4
papers
Abstract Argumentation
Abstract Argumentation
0
benchmarks
4
papers
Pre-election ratings estimation
Pre-election ratings estimation
0
benchmarks
1
papers
Geometry Problem Solving
Geometry Problem Solving
0
benchmarks
8
papers
Odd One Out
Odd One Out
1
benchmarks
9
papers
Causal Identification
Causal Identification
0
benchmarks
10
papers
Generative Visual Question Answering
Video-based Generative Performance Benchmarking
1
benchmarks
9
papers
Discrete Choice Models
Discrete Choice Models
0
benchmarks
10
papers
Natural Language Visual Grounding
Natural Language Visual Grounding
0
benchmarks
10
papers
Multimodal Reasoning
Multimodal Reasoning
3
benchmarks
10
papers
Systematic Generalization
Systematic Generalization
0
benchmarks
9
papers
Mathematical Question Answering
Math Word Problem Solving
12
benchmarks
10
papers
Math Word Problem Solving
Math Word Problem Solving
12
benchmarks
10
papers
Arithmetic Reasoning
Arithmetic Reasoning
1
benchmarks
10
papers