Shoothill AI Signal
Live · 13 models trackedThe AI tools your team uses change without warning. The one you trusted last month might be making things up or ignoring instructions today, and you'll only find out when a customer or your accounts team does. Shoothill AI Signal watches for you and flags it the moment something slips. Free, forever.
OpenAI, Microsoft, Google and Anthropic push updates without telling you. A model that worked fine last week can start hallucinating, ignoring instructions, or quietly getting worse. Most businesses only find out when something goes wrong on a customer email, a quote, or a report. Shoothill AI Signal is the early-warning system.
We measure how often each model invents a fact, especially in medical, legal, and finance questions. So you know the actual rate, not just the vibe.
Multi-step maths, logic, and planning: the kind of thinking your team actually relies on it for. We update the test set as the bar moves.
Catches the silent slips: ignoring formatting rules, breaking persona, drifting off-brief. The kind of slip that quietly breaks the AI tools your team relies on every day.
Compares each new score to the model's recent history. Email lands the moment something shifts past a threshold you set.
Real jobs your team actually does: drafting emails, pulling data out of invoices, sorting documents, summarising meetings. Demos always look easy. We test the messy stuff that breaks in the real world.
Every score is timestamped and exportable. Pass risk reviews and audits with a paper trail, not just an opinion.
No black-box scoring. Sample tests and the full grading methodology are published; the rest of the test set is kept private so model providers can't train against the exact prompts. Same questions every run, so scores stay comparable as the world moves on.
On a regular schedule, we put each tracked model through the same fixed library of test cases. Bespoke business scenarios, same questions every run, kept private so providers can't train against the exact prompts.
Each answer is checked against the right answer, by rules that don't change between runs. So scores today and last week are directly comparable.
Five categories combine into one Signal Score per model: truthfulness, reasoning, instruction adherence, stability, and business readiness.
Set the limits you care about. We email you the moment a model you watch crosses one.
Shoothill AI Signal isn't built for AI researchers. It's for the people who actually have to answer for it: the IT manager, the finance director, the business owner. The ones who'd rather spot a problem before a customer, an auditor, or HMRC does.
You picked a model for client-facing work. Six months later, your auditor asks how you know it still meets policy. AI Signal gives you a dated, exportable record of every score since the day you started watching.
Your team has GPT-5.5 in a live feature. The provider quietly updates the model and it starts ignoring your formatting rules. You see it on your dashboard the next morning, not in a customer support ticket.
AI Signal shows you whether the tools you're paying for, like Copilot, ChatGPT Enterprise and Gemini for Business, are getting better, worse, or standing still.
A drafted reply that's 95% right and 5% invented is the worst kind of mistake. AI Signal tracks hallucination rate per model so you know when to retrain or switch.
The C-suite asks if the firm's AI is working. AI Signal lets you answer with months of independent, dated evidence instead of a vendor's marketing slide.
Walk in with an independent, no-vendor view of which models have actually performed week after week, not whichever one your supplier wants to sell you.
Shoothill has helped UK businesses get more out of their technology since 2006. Over 400 projects across bespoke software, IT support, cybersecurity, and creative. We built Shoothill AI Signal because our clients kept asking the same question: "is this AI thing actually working?" Now anyone can check. Free.
Copilot, modern workplace, digital transformation. Invest in the right places first.
Sharp creative, smart SEO, print and digital campaigns that actually move the needle.
Custom web apps, mobile apps, and AI tailored to your team's real problems.
Managed IT, cybersecurity, connectivity. The hard part of keeping things live, handled.
Free, forever. Pick the AI tools your team uses, tell us what you want flagged, and get on with your day. We'll email you the moment something changes.
Shoothill helps businesses pick, build, and operate AI that's safe, useful, and commercially viable. Fill this in and we'll get back to you within one working day.