Question 1

Do I have to bring my own API keys?

Accepted Answer

Yes for ongoing use, on every tier. New accounts get 7 platform-funded calls (Critique, Improve, Rewrite) to explore the workbench. Connect a Claude, OpenAI, or Google key when you're ready to keep going. Inference is billed by your provider, never by us.

Question 2

I have never used an API key. Can I still try this?

Accepted Answer

Yes. Your first 7 calls (Critique, Improve, Rewrite) run on us so you can try the workbench before setting anything up. Provider API keys take about two minutes after that. Anthropic, OpenAI, and Google all let you generate one for free, and you only pay for what your prompts use. We will walk you through it the first time you sign in.

Question 3

Does my team need to write code to use Prompt Assay?

Accepted Answer

No. The editor is a writing surface; anyone on your team who writes prompts can use it directly. Engineers wire the prompts into your app via the API or SDK, but PMs, content writers, and domain experts can author and version prompts in the editor without touching code.

Question 4

Which providers are supported?

Accepted Answer

Anthropic, OpenAI, and Google. Every workflow, including playground, critique, and evaluation, runs against all three.

Question 5

How is the critique different from asking a model what it thinks?

Accepted Answer

The critique scores on six fixed dimensions and returns structured, actionable notes. The radar chart and per-dimension scores give you something to act on, not a paragraph of prose.

Question 6

Can I bring prompts from another tool?

Accepted Answer

Yes. Import supports Anthropic JSON, OpenAI JSON, Markdown, and bundled archives. Export produces the same. If you are coming from another prompt tool, your library moves with you.

Question 7

Who owns the prompts and Skills I write here?

Accepted Answer

You do. Fully. We do not train on your content, we do not share it with anyone, and you can export everything · prompts, Skills, fragments, version history, evaluation suites, Behavioral Eval runs · at any time.

Question 8

Is there a free tier, really?

Accepted Answer

Yes. 250 AI calls a month on your keys, a single seat, full authoring features. No credit card to start.

Question 9

What is the difference between Solo and Team?

Accepted Answer

Solo is a single-seat workspace. Team adds a shared org library, roles, invitations, and seat-based billing starting at three seats. Upgrading is a click.

Question 10

Do I need SSO?

Accepted Answer

Only if your company requires it. SAML SSO is on the Enterprise tier. Everything else is email and password or OAuth through Google or GitHub.

Question 11

What happens to my data if I cancel?

Accepted Answer

You can export everything first. After cancellation, your content is retained according to the schedule in our data processing agreement, then deleted.

Question 12

Which features need a paid tier?

Accepted Answer

Free covers every AI instrument, full evaluation suites, Skills authoring, and the multi-provider Behavioral Eval, with a 250-call monthly cap and a single seat. The public REST API and TypeScript SDK start on Solo. Multi-seat workspaces with shared libraries and roles start on Team.

Question 13

How does the 250 monthly call cap on Free work?

Accepted Answer

Each AI instrument run counts as one call. Compare-models counts one call per model in the run. A 5-model compare burns 5 calls. Skills Behavioral Eval cells count too: a 3-model run with 4 trigger and 2 non-trigger probes is 18 inference calls plus 18 judge calls, so it adds up fast on Free. Solo and above are unlimited on your own keys.

Question 14

What does Convert do?

Accepted Answer

Convert is the AI pair's prompt-to-Skill bridge. Paste a prompt, click Convert, and in one shot the AI proposes a complete Agent Skill bundle (SKILL.md plus optional scripts and references). Preview before applying. BYOK-only, not part of the 7-call demo budget. Available on both the Prompts and Skills workbenches.

Question 15

What's a Skill, and how is this different from Anthropic's Skill Creator?

Accepted Answer

An Agent Skill is a folder with a SKILL.md plus optional scripts and references that Claude (and now OpenAI Codex, Cursor, VS Code, GitHub Copilot, and Gemini CLI) load on demand to specialize behavior. Anthropic's skill-creator plugin authors them inside Claude Code; that's the right tool if you only ship to Claude. Prompt Assay is the cross-provider workbench around the same SKILL.md format: the linter that catches secrets and footguns, the six-dimension critique scorecard, and the multi-provider Behavioral Eval that scores how reliably Claude, GPT, and Gemini each activate the skill on the cases that matter to you.

Question 16

What's the Behavioral Eval, and what do I need to run one?

Accepted Answer

An eval that takes your Skill, runs it against trigger probes (positive cases the skill should activate on) and non-trigger probes (negative cases it should stay dormant on) across the providers you pick, then has a judge model score activation and instruction adherence per cell. You need a BYOK key for every provider you want to test, the SKILL.md plus optional scripts and references already authored in the workbench, and three or four sample probes. Caps are uniform across every paid tier (5 models × 10 trigger × 6 non-trigger probes per run); the per-run cost preflight blocks anything over $5 unless you confirm.

Question 17

Can I share a Skill report publicly?

Accepted Answer

Yes. Save a Behavioral Eval run as a Skill Report at /share/skill-report/<id>. Defaults are conservative: noindex on robots, the SKILL.md body is hidden until the original author opts to publish it, and you can revoke the share from /settings/shares at any time. The same artifact backs a Shields.io README badge so the score on your repo stays in sync with the latest published run.

Question 18

Does the Skills Behavioral Eval also run on my BYOK keys?

Accepted Answer

Yes. Every cell · the inference call against each model and the judge call that scores activation · routes through your provider keys. We never proxy that traffic and we never aggregate it for our own use. Provider bills go to your provider account exactly the same way prompt critique and Compare-models do.

Ship prompts and Agent Skillsthat hold up in production.

Find the leaks before they ship.

Everyone moved upstream or downstream. We stayed where the craft lives.

Run one prompt across every provider. Let a judge call the winners.

Author Agent Skills. Run evals across every provider. Ship the badge.

A day in the life of your artifacts.

Your keys. Your bill. No markup on a single token.

One workbench. Two ways to work in it.

Ship production prompts and Skills by yourself · no team required.

Review prompts and Skills the way you review code.

Pull any version into production.

Trust, before testimonials.

No demo call. No card. Free tier ships every AI instrument.

Questions, answered plainly.

Open the workbench. Ship prompts and Skills that hold up.