How to Judge a Competition Fairly: Best Practices for Event Organizers
Learn proven strategies for fair competition judging — picking between a closed expert panel and audience voting, defining clear criteria, and minimizing bias.
Article Contents
Your competition's reputation lives or dies on one thing: do participants trust the results?
Nobody remembers who came second at a well-run event. But everyone remembers the competition where the judging felt rigged, where the criteria changed mid-way, or where one judge's cousin somehow won.

First, Decide Who's Actually Doing the Judging
Before any of the rest of this matters, settle the more fundamental question: should a closed panel of judges decide the result, or should the audience?
These are not interchangeable, even when they end up at the same winner. A panel produces a credible result; an audience vote produces a popular result. The two answer different questions, and getting clear on which question your event needs the answer to is the most important fairness decision you'll make.
Panel Judging is the right choice when there's meaningful prize money, career impact, or domain expertise needed to evaluate entries. A wine competition judged by a sommelier panel produces a different (and more defensible) result than the same competition judged by 200 audience members. Same goes for code quality at a hackathon final, surgical technique, or any judged event where credibility of the result has to hold up to challenge.
Audience Voting is the right choice when audience engagement is part of the show, when the entries are similar enough that anyone in the room can have a useful opinion, or when the prize is explicitly a popularity prize ("People's Choice"). Talent shows, cook-offs, watch parties, classroom favourites, internal company innovation days — all stronger with audience involvement than without.
A surprisingly common pattern at well-run events is to run both in parallel — a "Judges' Choice" award decided by the panel and a "People's Choice" award decided by audience vote. Same entries, two awards, two leaderboards. When they agree, you get a strong endorsement. When they split, you get a memorable moment with two acts on stage.
For the full breakdown, see panel judging vs audience voting. The fairness practices in the rest of this guide apply to both formats, but the specifics differ — most of what follows is written for panel judging, with a separate section near the end on what changes when the audience is voting.
Publish Your Criteria Before Anyone Submits
The most common judging disaster happens when organizers make up criteria as they go. We've seen hackathon judges suddenly decide "team diversity" matters after entries close. We've watched talent shows add "audience engagement" to scoring rubrics mid-performance. The result is always the same: angry participants who feel the rules shifted under them.
Lock your criteria down before the first submission. Then publish them everywhere.
Here's a sample rubric for a hackathon:
| Criterion | Weight | Description |
|---|---|---|
| Functionality | 30% | Core features work as intended |
| Innovation | 25% | Novel approach to the problem |
| Design | 20% | User interface and experience |
| Presentation | 15% | Clear communication of concept |
| Technical Merit | 10% | Code quality and scalability |
Each criterion needs clear scoring levels. Don't ask judges to pick a number between 1 and 10 for "innovation" — define what a 2 looks like versus an 8. Otherwise you'll get one judge who thinks everything deserves a 7 and another who never gives higher than 4.
Bias Creeps In Whether You Want It To
Three to five judges works best. A single judge's weird preferences get smoothed out across a panel.
Where possible, hide identifying information from judges. You'd be surprised how much institutional affiliation affects scoring — a judge from Stanford rates other Stanford entries higher without realizing it. Same goes for demographic factors, past interactions, and personal relationships.
Conflicts of interest need hard rules. A judge shouldn't evaluate entries from their employer, their drinking buddies, or their direct competitors. Write this policy down. Enforce it.
Before judging starts, run a calibration session. Have everyone score the same two practice entries, then compare. When Judge A gives an entry an 8 and Judge B gives it a 4, that's your chance to align interpretations. Skip calibration, and your final rankings will be partly determined by which entries happened to land on which judge's desk.

Pick Judges Who Know the Domain
Your judges need actual expertise. A coding competition judged by marketing executives produces embarrassing results. So does a cooking competition judged by people who've never worked in a kitchen.
Look for varied perspectives within the domain. Five senior developers from the same company will score differently than a panel mixing frontend specialists, backend architects, and product managers. Homogeneous panels produce homogeneous blind spots.
Make sure your judges have time. Rushed judging produces inconsistent scores. A judge evaluating 40 entries in three hours will start rubber-stamping by entry 15.
Weighted Scoring Makes Your Priorities Explicit
Not every criterion matters equally. The World Food Championships weights taste at 50%, execution at 35%, and appearance at 15%. That weighting tells contestants exactly what to optimize for.
Decide weights before competition day. Communicate them to participants in advance. And never — under any circumstances — adjust weights after judging begins. We've seen organizers tweak weights to push a preferred entry up the rankings. Participants notice. Word spreads.
Use software to calculate weighted totals. Manual math introduces errors, and errors that happen to favor certain contestants look suspicious even when they're innocent mistakes.
Decide Tie-Breakers Before You Need Them
Ties happen more than you'd expect, especially when judges cluster around the same scores. Making up tie-breaking rules after a tie occurs invites accusations of favoritism.
Pick your method in advance: the entry with the higher score in the most important criterion wins, earlier submission time wins, or a head-to-head review by a subset of judges. Document it in your competition rules. When the tie happens, you just execute the procedure.
Tell Participants Everything
Trust comes from transparency. Before judging, share your criteria, weights, judge qualifications, scoring methodology, and timeline. During judging, post progress updates if the event spans multiple days. After judging, give participants their scores — at minimum, their own results, ideally with judge feedback.
The organizers who hide scores create suspicion. The organizers who publish everything build reputations that attract better participants next year.
What Changes When the Audience Is Voting
Most of the practices above are written for a panel of judges. When the audience is the voter, the fairness levers shift. Here's what to do differently.
Criteria still matter, but keep them short. A panel of five judges scoring on six criteria is fine. An audience of 200 people on phones, scoring twelve entries on six criteria, gets bored at criterion three. For audience voting, one to three criteria is the sweet spot. If you need more dimensions, run a panel.
Calibration moves to the host, not the voters. You can't run a calibration session with 200 anonymous audience members. Instead, communicate the scale clearly from the stage: "A 10 is the best you've ever seen at a school talent show. Most acts should be in the 5–8 range." A 30-second guidance from the host before voting opens sets the frame.
Bias becomes structural, not personal. With a panel, you mitigate bias by hiding identifying information and enforcing conflict-of-interest rules. With audience voting, individual bias still happens — performer relatives vote-stuff for their kid — but the effect is bounded by group size. The bigger your audience, the less any single bad-faith voter matters. For audiences under 30 people, audience voting is too vulnerable to small-group bias; consider a panel instead.
Per-device duplicate prevention replaces per-judge accountability. Score Judge's audience voting caps each device to one ballot per competition, which stops casual double-voting but isn't unbreakable. For high-stakes events, this is why you want a panel; for casual events, the trade-off is fine.
The leaderboard is part of the format. With a panel, you usually hold results until judging is complete. With audience voting, the live leaderboard updating during voting is often part of the show — the audience is voting precisely because they want to see the result land. Pick whether to show it live or hold the reveal based on whether you want running tension or a single big moment.
Use Audience Ranking when the question is "which is best", not "how good is each". Score Judge offers two audience modes — Audience Scoring (numeric scores per criterion) and Audience Ranking (drag-to-rank, Eurovision-style point distribution). Ranking forces voters to make a clear preference order rather than giving everything a 7, which produces a more decisive winner. For talent shows and "best of" polls, it's usually the better default.
For the practical setup, see how to set up audience voting at live events.
Score Judge Handles the Mechanics
Running judging with spreadsheets creates opportunities for errors. Copying scores from paper forms to Excel, calculating weighted averages by hand, tracking which judge scored which entry — it's tedious and error-prone.
Score Judge lets multiple judges score from their own devices while results aggregate automatically. You set up your rubric once. Every judge uses the same framework. Weighted scoring and tie-breaking apply without manual calculation. Timestamps log every score for accountability.
Participants can watch the leaderboard update in real-time. That transparency removes the black-box feeling that makes people distrust results.
Running a competition soon? Give your participants the fair, transparent experience they deserve.
For use-case specific guidance, see judging software for talent shows, judging software for hackathons, and judging software for dance competitions.