<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Natural Language Processing | Vedaant Jain</title><link>https://vedaantjain.netlify.app/tags/natural-language-processing/</link><atom:link href="https://vedaantjain.netlify.app/tags/natural-language-processing/index.xml" rel="self" type="application/rss+xml"/><description>Natural Language Processing</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Fri, 10 May 2024 00:00:00 +0000</lastBuildDate><image><url>https://vedaantjain.netlify.app/media/icon_hu68170e94a17a2a43d6dcb45cf0e8e589_3079_512x512_fill_lanczos_center_3.png</url><title>Natural Language Processing</title><link>https://vedaantjain.netlify.app/tags/natural-language-processing/</link></image><item><title>Curriculum Learning for Embodied Planning with LLMs</title><link>https://vedaantjain.netlify.app/project/embodiedcurriculum/</link><pubDate>Fri, 10 May 2024 00:00:00 +0000</pubDate><guid>https://vedaantjain.netlify.app/project/embodiedcurriculum/</guid><description>&lt;p>This project explores the application of Curriculum Learning to improve the performance of GPT-2 models in Embodied Natural Language Processing tasks using the ALFWorld dataset. We developed curricula for both Action Modeling and Reinforcement Learning stages, demonstrating significant improvements in task success rates and action efficiency.&lt;/p>
&lt;p>Authors: Bohan Liu, Vedaant Jain, Aarohi Gupta&lt;/p>
&lt;p>Key aspects of this research include:&lt;/p>
&lt;ul>
&lt;li>Developing difficulty scoring mechanisms for task demonstrations&lt;/li>
&lt;li>Creating &amp;ldquo;Easy&amp;rdquo; and &amp;ldquo;Hard&amp;rdquo; curriculum sets to structure model training&lt;/li>
&lt;li>Investigating the impact of curricula on model generalization across task types&lt;/li>
&lt;li>Exploring the potential of few-shot learning with large language models&lt;/li>
&lt;li>Demonstrating the effectiveness of a two-stage &amp;ldquo;easy-then-hard&amp;rdquo; curriculum in Reinforcement Learning&lt;/li>
&lt;/ul>
&lt;p>Our results show that carefully designed curricula can enhance model performance, improve generalization to unseen tasks, and increase learning efficiency in embodied AI environments.&lt;/p></description></item><item><title>LLMs Mimic Reddit</title><link>https://vedaantjain.netlify.app/project/redditsimulations/</link><pubDate>Fri, 10 May 2024 00:00:00 +0000</pubDate><guid>https://vedaantjain.netlify.app/project/redditsimulations/</guid><description>&lt;p>This project explores the potential of Large Language Models (LLMs) to accurately simulate user behavior in Reddit communities. We investigate if LLMs can effectively mimic the communication patterns of specific users when provided with their comment history as context, focusing on the r/science subreddit.&lt;/p>
&lt;p>Authors: Vedaant Jain*, Yoshee Jain∗, Ishq Gupta, Aditi Shrivastava, Koustuv Saha, Eshwar Chandrasekharan&lt;/p>
&lt;p>Key aspects of this research include:&lt;/p>
&lt;ul>
&lt;li>Developing prompting strategies for comment prediction and masked fill-in-the-blank tasks&lt;/li>
&lt;li>Evaluating LLM performance on style similarity (formality, syntax) and content similarity (semantics, emotions)&lt;/li>
&lt;li>Analyzing the accuracy of LLMs in replicating user-specific communication nuances&lt;/li>
&lt;li>Exploring the potential applications in automated moderation and prosocial behavior promotion&lt;/li>
&lt;/ul></description></item><item><title>Multi-Modal Information Extraction from Academic Resumes</title><link>https://vedaantjain.netlify.app/project/resumeextraction/</link><pubDate>Wed, 10 May 2023 00:00:00 +0000</pubDate><guid>https://vedaantjain.netlify.app/project/resumeextraction/</guid><description>&lt;p>This project addresses the challenge of extracting structured information from academic resumes, which often span multiple pages and contain complex, domain-specific content. We developed a novel approach combining document layout analysis and sequence tagging to accurately segment and extract key information from various resume sections.&lt;/p>
&lt;p>Key aspects of this research include:&lt;/p>
&lt;ul>
&lt;li>Utilizing Document-Image-Transformer (DiT) for title detection and resume sectioning&lt;/li>
&lt;li>Implementing BERT-based sequence tagging models for information extraction from specific sections (education, employment, publications)&lt;/li>
&lt;li>Creating a labeled dataset of 30+ academic resumes (250+ pages) for model training and evaluation&lt;/li>
&lt;/ul></description></item></channel></rss>