Noryen

Started:
Completed

Tech Stack:

GolangNextJSTypeScriptPostgreSQLSupabase
Noryen

When I was building Biometryx, I ran into a problem I didn’t expect.

In a health related context AI can become dangerous fast.

The model can:

I wouldn’t know unless I manually tested every edge case. And even then - I wouldn’t catch falty output before my users.

It was not scalable. And definitely not safe.

How is my AI actually behaving in production?

Most of the time we focus on prompts, UX, speed … But we rarely ask the question - how does my AI behave in real-world scenario?

Are we going to be digging through raw JSON logs? Debugging by copy-pasting prompts into ChatGPT? Or … do we simply cross fingers and hope for the best ?

I looked into existing solutions.

They were:

I just needed something simple:

So I built it.

AI isn’t deterministic. You can’t fully predict what it will say. And if you’re building in sensitive areas, “probably safe” is not good enough.

You can try Noryen here