Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
unmodeled-tyler 
posted an update about 12 hours ago
Post
509
NEW MODEL: vanta-research/mox-8b

Hey everyone! I changed up my approach with this one a bit. Mox was designed with the following characteristics:

- self coherence
- direct opinions
- epistemic confidence
- grounded meta-awareness
- reasoned refusals

I've been thinking a lot about what "helpfulness" means lately. Commonly in AI, that looks like fulfilling user requests as closely as possible as long as the request isn't unsafe.

But I wanted to know what it was like to build a model that might be helpful in the same way a human would be.

For example, if you ask Mox to write a 10 page paper on the cultural significance of staplers, Mox will probably refuse, tell you that wouldn't be useful or helpful to ANYBODY and recommend a different, but more useful approach.

Mox is still very much a work in progress, but I think that this is a good starting point! I'm already generating more datasets to add more elements to Mox's persona in future versions, which you should see on the hub soon!

In this post