Egregoros · Phoenix Framework

Post

Remote status

Anathema AI

Lexi v2.0 will be releasing soon, likely today or tomorrow. New model architecture and more parameters. Coming down the pipe. PDF/DOC analysis, code execution (beta), a more polished chat UI.

If you had issues with Lexi refusing innocuous or innocent prompts before, it's because we are using an abliterated model (no guardrails), and in order to make sure we minimize the risk of people being able to do bad shit (generate CSAM, get bomb making instructions, et cetera), I had to manually add refusals back in for the egregious stuff. Unfortunately, I used a hammer rather than a scalpel.

This should be fixed mostly in Lexi v2, although considering the political implications and attack surface, I've leaned a little more towards the "safer" side. I've built a good deal of infrastructure to mitigate the worst issues via defense through depth, but as with most other things, it's a matter of when, not if. We just have to make sure the "when" is very far down the road.

Lexi's new reasoning capabilities will allow her to more effectively categorize your requests and answer them with the appropriate context, rather than wigging out and flying off the handle at something unrelated because she hallucinated certain keywords that trip the alarms.

Overall, this will be a substantial improvement over v1.0, not just in behavior, but in raw capability as well.

Replies

Replying to @anathema_ai@nicecrew.digital

picofarad

@picofarad@noauthority.social remote

@anathema_ai nearly any information can be used for evil, or good. "the truth" or "facts" may cause greater harm than a lie or half-truths.

Replying to @picofarad@noauthority.social

Matty-kun

@matty@nicecrew.digital remote

What the hell does this even mean man

Replying to @matty@nicecrew.digital

picofarad

@picofarad@noauthority.social remote

@matty @anathema_ai it's hard to say, it kind of depends on why you want guardrails/abliterated models or whatever.

Basically, no matter what guardrails and monitors you put in, the LLM will return stuff that is wrongthink to *someone*, and if that person has political power, well...

It's more "why bother" - both to setting up a public service; as well as making it so it doesn't effect you personally with fallout from being trained on human text....

Replying to @picofarad@noauthority.social

Matty-kun

@matty@nicecrew.digital remote

>personal fallout

Like what? I trained a model that told the truth?

Replying to @matty@nicecrew.digital

picofarad

@picofarad@noauthority.social remote

@matty @anathema_ai it's more "what's truth and how does an AI know the difference"

Replying to @picofarad@noauthority.social

Matty-kun

@matty@nicecrew.digital remote

AI doesn't know the "truth" any more than a calculator knows math. It's just pattern matching.

Replying to @matty@nicecrew.digital

Caleb James DeLisle

@cjd@pkteerium.xyz remote

> code execution (beta)

Is this where the bot has the ability to say something like __RUN_CODE__: <some arbitrary code> and the system will interpret that and send the result back to the bot, so the bot doesn't need to do complex math "in it's head" ?

Replying to @cjd@pkteerium.xyz

Matty-kun

@matty@nicecrew.digital remote

Correct.

Replying to @matty@nicecrew.digital

Matty-kun

@matty@nicecrew.digital remote

You don't see an actual terminal window (I'd like to, but that may enumerate infrastructure), but this is an example of shitty python code debugged.

Replying to @matty@nicecrew.digital

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

I'm assuming you're using some utilities for sandboxing these?

And does Lexi have a reasoning budget? Haven't seen her think when I tested.
These answers are quite short.

Replying to @WandererUber@poa.st

Matty-kun

@matty@nicecrew.digital remote

She does. V2 will provide more verbose responses when the topics warrant it. V1 does not have reasoning but v2 will

Replying to @matty@nicecrew.digital

Matty-kun

@matty@nicecrew.digital remote

And yes, code execution and document parsing happens in an isolated environment.

Replying to @matty@nicecrew.digital

Caleb James DeLisle

@cjd@pkteerium.xyz remote

Great design. She picks up on it okay? You have like an instructions manual for using the tool which you finetune with?

Replying to @matty@nicecrew.digital

𝔹𝔸𝕊𝔼𝔻 𝕃𝕆ℝ𝔻 ™

@BasedLord@nicecrew.digital remote

They call me Environmental isolation jackson the way I cut a bitch off

Replying to @cjd@pkteerium.xyz

Matty-kun

@matty@nicecrew.digital remote

I'm not sure I follow. Are you asking for information on how I tune and train?

Replying to @matty@nicecrew.digital

Caleb James DeLisle

@cjd@pkteerium.xyz remote

Yea I imagine you need to give it an instructions manual of some sort so that it knows it has the ability to run code using that magic word...

Replying to @cjd@pkteerium.xyz

Matty-kun

@matty@nicecrew.digital remote

Yeah, we do that through tool calls. Different models have different tool call architectures. But, to get the model to understand when/how to use the tool call and the format of its input, you have to LoRA train it with examples.

Replying to @matty@nicecrew.digital

Caleb James DeLisle

@cjd@pkteerium.xyz remote

> tool calls

Ahh, I didn't realize this was a standard thing in the industry, but that makes plenty of sense...

Replying to @cjd@pkteerium.xyz

Matty-kun

@matty@nicecrew.digital remote

They're really cool.