LLMs Mimic Reddit

May 10, 2024 · 1 min read

This project explores the potential of Large Language Models (LLMs) to accurately simulate user behavior in Reddit communities. We investigate if LLMs can effectively mimic the communication patterns of specific users when provided with their comment history as context, focusing on the r/science subreddit.

Authors: Vedaant Jain*, Yoshee Jain∗, Ishq Gupta, Aditi Shrivastava, Koustuv Saha, Eshwar Chandrasekharan

Key aspects of this research include:

  • Developing prompting strategies for comment prediction and masked fill-in-the-blank tasks
  • Evaluating LLM performance on style similarity (formality, syntax) and content similarity (semantics, emotions)
  • Analyzing the accuracy of LLMs in replicating user-specific communication nuances
  • Exploring the potential applications in automated moderation and prosocial behavior promotion