top | item 41136393

Using Llama3.1 405B to generate political synthetic data

5 points| gregschoeninger | 1 year ago |oxen.ai

3 comments

order

gregschoeninger|1 year ago

We thought it'd be interesting to see what political biases Llama 3.1 405B has by generating a bunch of "spam" or "ham" messages with it. We started with 5 hand crafted messages and let the LLM take it from there ending up with over 1k.

Full process was documented here:

https://www.oxen.ai/blog/create-your-own-synthetic-data-with...

Next up we are going to train a classifier on the outputs, as well as do some classical NLP (named entities, keywords, sentiment, etc) on it to see what we find.

Mainly a fun side project, but could have some interesting implications assuming candidates are using LLMs in the upcoming elections.

sthoward|1 year ago

This has the legitimate possibility to be the most telling spam or ham (and funniest) yet. Very interested in the state by state breakdowns.

mydpy|1 year ago

Is anyone using Oxen.ai? This is the first use of it I've seen.