[NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding
-
Updated
Jul 12, 2025 - Jupyter Notebook
[NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding
[CVPR-2024] The First High Definition (HD) Event based Visual Object Tracking Benchmark Dataset
The MAMA-MIA Dataset: A Multi-Center Breast Cancer DCE-MRI Public Dataset with Expert Segmentations
[Pattern Recognition 2025] A large-scale benchmark dataset for color-event based visual tracking
[IJCV-2026, arXiv:2408.09764] Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and Algorithms
This repository contains a gym environment that can be used for developing solvers for robotic 3D bin packing problems.
A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
Dataset package for facile training and testing of machine learning/AI algorithms that predict drug response in cancer model systems.
AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. It aims to benchmark the robustness of ASV models in the face of such attacks and offers vital resources for researchers to explore the characteristics of adversarial and replay attacks in this domain.
The SWAN-SF dataset is now fully preprocessed, optimized, and ready for binary classification tasks. Our team is excited to release the enhanced version of the SWAN-SF dataset across all five partitions.
[FAccT '25] Characterizing Bias: Benchmarking LLMs in Simplified versus Traditional Chinese
Documentation associated with preparing and formatting datasets LARRY datasets for ML applications with pytorch / pytorch lightning
Code for LEMMA-RCA website
Collaborating to improve population dynamics models through benchmark dataset validation
Open synthetic benchmark dataset for dental clinical note extraction and summarization with ICD-10-CM diagnoses and structured tooth-level annotations.
Code repository for PhD dissertation "Statistical Extensions of Multi-Task Learning with Semiparametric Methods and Task Diagnostics"
Libyan Restaurants: A Benchmark Dataset for Sentiment Analysis in the Libyan Arabic Dialect
Open topology-grounded benchmark for datacenter RCA, hidden-target localization, and counterfactual remediation validation.
Add a description, image, and links to the benchmark-dataset topic page so that developers can more easily learn about it.
To associate your repository with the benchmark-dataset topic, visit your repo's landing page and select "manage topics."