<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Benchmark Datasets | Vedaant Jain</title><link>https://vedaantjain.netlify.app/tags/benchmark-datasets/</link><atom:link href="https://vedaantjain.netlify.app/tags/benchmark-datasets/index.xml" rel="self" type="application/rss+xml"/><description>Benchmark Datasets</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Fri, 07 Jun 2024 00:00:00 +0000</lastBuildDate><image><url>https://vedaantjain.netlify.app/media/icon_hu68170e94a17a2a43d6dcb45cf0e8e589_3079_512x512_fill_lanczos_center_3.png</url><title>Benchmark Datasets</title><link>https://vedaantjain.netlify.app/tags/benchmark-datasets/</link></image><item><title>HumorDB</title><link>https://vedaantjain.netlify.app/publication/preprint/</link><pubDate>Fri, 07 Jun 2024 00:00:00 +0000</pubDate><guid>https://vedaantjain.netlify.app/publication/preprint/</guid><description>&lt;p>This work introduces HumorDB, a novel dataset designed to advance visual humor understanding in AI systems. The dataset consists of carefully curated image pairs with contrasting humor ratings, emphasizing subtle visual cues that trigger humor while mitigating potential biases.&lt;/p>
&lt;p>Key features of HumorDB include:&lt;/p>
&lt;ul>
&lt;li>Evaluation through multiple tasks: binary classification, range regression, and pairwise comparison&lt;/li>
&lt;li>Focus on capturing the subjective nature of humor perception&lt;/li>
&lt;li>Potential as a zero-shot benchmark for large multimodal models&lt;/li>
&lt;/ul>
&lt;p>Our initial experiments reveal that while vision-only models struggle with humor detection, vision-language models, particularly those leveraging large language models, show promising results. This work contributes to pushing the boundaries of AI&amp;rsquo;s ability to comprehend nuanced human communication, specifically in the domain of visual humor.&lt;/p></description></item></channel></rss>