claude-skill

Claude を使った AI 候補者ソーシング

Difficulty

中級

Setup time

45min

For

recruiter · sourcer · talent-acquisition

Recruiting & TA

Stack

求人プロフィールと ICP ルーブリックを受け取り、Juicebox、hireEZ、または LinkedIn リクルーターに対して AI ソーシングクエリを構築し、最大 200 名の候補者を取得し、引用された証拠と共にルーブリックに対して各候補者をスコアリングし、上位 N 名のパーソナライズされたアウトリーチを下書きする Claude スキルです — その後、人間によるレビューゲートで停止します。リクルーターはショートリストを確認してメッセージを編集して送信します。3 時間のブール検索＋スコアリング＋アウトリーチループを 30 分のレビューループに置き換えます。

使用するタイミング

四半期に 1 回以上実行するロールをソーシングしており、ICP ルーブリックが書き下ろせるほど安定している場合。
次元ごとに行動的なアンカーを持つ ICP ルーブリックがある場合（漠然としたラベルではなく）。バンドルの references/1-icp-rubric-template.md のリファレンステンプレートが形状を示しています。記入できない場合、このスキルがスコアリングできるルーブリックはまだありません。
Juicebox PeopleGPT、hireEZ、または LinkedIn Recruiter への API アクセスがある場合。スキルは公開 LinkedIn URL のスクレイピングにフォールバックしません。
アウトリーチが送信される前に、人間のリクルーターまたはソーサーがすべてのショートリストをレビューする場合。スキルはドラフトをディスクに書いて停止します。

使用しないタイミング

ループ内の自動却下。 スキルはランキングを行いますが、却下しません。「スキップ」された候補者はリクルーターがオーバーライドできる理由と共に表面化されます。reject アクションをスコア閾値に接続すると、これは自動化された意思決定になり、1 年以内の使用前にEU AI 法 Annex III 高リスク義務と NYC LL 144 バイアス監査義務を引き起こします。それが必要な場合は、このスキルではなくバイアス監査を取得してください。
保護されたクラスのプロキシでのスコアリング。 単独次元としての学校ランク、名前の起源、写真の有無、雇用ギャップのペナルティ、卒業年から推測される年齢、行動的なアンカーのない「カルチャーフィット」。スキルのフェアネスチェックリストはルーブリックにこれらのいずれかが含まれている場合は実行を拒否します。偏ったルーブリックを通過させるためにチェックリストを編集しないでください。
報酬帯の推奨。 NYC LL 32-A、コロラド、カリフォルニア、ワシントンは掲示された範囲と自動化された報酬決定に対するバイアス監査義務を要求します。ソーシングスキルではなく報酬ベンチマークツールを使用してください。
一回限りの C スイート検索。 特定の名前付き個人または狭く定義された役員の保持型検索は、人間がネットワークで行う方が速いです。スキルはルーブリックのキャリブレーションがセットアップコストを回収できる繰り返し可能な IC およびマネージャーレベルのソーシング向けに構築されています。
リファレンスチェックやバックチャンネルリサーチ。 異なる同意の姿勢。異なるワークフロー。

セットアップ

バンドルを配置する。 apps/web/public/artifacts/candidate-sourcing-claude-skill/SKILL.md を Claude Code のスキルディレクトリ（または claude.ai のカスタムスキル）に配置します。
ルーブリックを記入する。 references/1-icp-rubric-template.md を自分のリポジトリのロールごとのファイルにコピーします。すべての {placeholder} を置き換えます。スキルはルーブリックの SHA-256 を実行ごとに監査ログに記録するため、後続の編集が遡及的に確認できます。
ソースチャンネルを設定する。 Juicebox または hireEZ の API キーをスキルの設定に追加します。LinkedIn については、Recruiter API の認証情報を設定してください — スキルは公開プロフィール URL のスクレイピングを拒否します。
不採用リストと除外リストを作成する。 顧客ドメインの CSV（不採用）と exclude_list URL の CSV（最近不採用になった、沈黙期間中、オプトアウト済み）。スキルのステップ 3 の決定論的プレフィルターがこれらを LLM が候補者を見る前に適用します。
クローズしたロールでドライラン。 先四半期に手動でソーシングしたロールで実行します。スキルの上位 25 名と手動の上位 25 名を比較します。スキルのキャリブレーションが異なる場合はルーブリックアンカーを調整してください — 通常は検索クエリではなくアンカーが間違っています。

スキルの実際の動作

6 ステップ、順番通り。順番が重要です：決定論的フィルターとフェアネス事前チェックが LLM ランキングの前に来ます。汚染されたプールに LLM を解放すると、速くて自信に満ちた使えない出力が生成されるからです。

ルーブリックを検証する（references/2-fairness-checklist.md に対して）。ルーブリックに保護されたクラスのプロキシが含まれている場合は停止します。取得後ではなく取得前に失敗する選択は意図的です — バイアスのあるルーブリックをソーシングツールの API に読み込むと、すでに GDPR 22 条の自動処理としてカウントされるログエントリが残ります。
チャンネルのネイティブフォーマットで検索クエリを構築する。 次元あたり最大 5 つの同義語、取得プールを最大 200 でキャップします。プールが大きいほど低関連性候補者でモデルコンテキストが埋まりランキングが劣化します。
決定論的プレフィルター。 exclude_list の一致、不採用企業、勤務地の不一致、18 ヶ月以上古いプロフィールを除外します。これらは監査可能なフィルターです。LLM はそれらを再論議しません。
ルーブリックベースのランキング。 スキル、レベル、企業パターン、応答可能性で 1〜5 でスコアリングします。1 を超えるすべてのスコアには逐語的なプロフィール文字列が引用されます。引用なし → スコア 1。引用要件が、名前、写真、学校から推測するのではなく、プロフィールテキストに基づいてモデルを固定するものです。
人間によるレビューゲート。 shortlist.md と候補者ごとの outreach/<id>.md ファイルを書きます。停止します。スキルは send アクションを定義しません。
監査ログ。 run_id、rubric_sha256、プールサイズ、チャンネル、モデルと共に JSONL 行を 1 行追加します。PII なし。これが NYC LL 144 または EU AI 法の質問に対して実行を守ることができるものです。

ショートリストフォーマットと候補者ごとの証拠レイアウトはバンドルの references/3-shortlist-format.md に記載されています。フォーマットは固定です。なぜなら下流のコンシューマー（リクルーター、採用マネージャー、監査レビュアー）は予測可能な列が必要だからです。

コストの実態

200 名のプールから 25 名のショートリスト、Claude Sonnet 4.5 で：

取得コスト — チャンネルによって異なります。Juicebox PeopleGPT は月次クエリクォータに対してカウントされます（週複数ロールを実行すると 200 検索のスタータープランはすぐに上限に達します）。hireEZ は月次アンロック数が制約となります。LinkedIn Recruiter API には独自のシートごとの InMail と検索クォータがあります。スキルがループにあっても変わりません — 手動ブール検索で使うのと同じチャンネルクォータを消費します。
LLM トークン — 通常 80〜120K 入力トークン（ルーブリック + 200 名の候補者プロフィール抜粋 + スキル指示）と 8〜15K 出力トークン（ショートリスト + 25 件のアウトリーチドラフト）。Sonnet 4.5 でショートリスト 1 件あたり約 0.50〜0.80 USD です。月に約 80 ショートリストを実行するソーサーの全月分は 40〜65 USD のモデルコストです。
リクルーターの時間 — 勝利はここにあり、モデルコストにはありません。25 名の候補者に対する手動ブール + スコアリング + アウトリーチは 2〜3 時間です。スキルのショートリストをレビューしてドラフトを編集するのは 25〜40 分です。これがワークフローを実行する価値を生み出します。
セットアップ時間 — ルーブリックが何らかの形で既存の場合は 45 分、ルーブリックが新規の場合はより長くかかります（その場合は構造化面接がこのスキルの前提条件であり、このスキルではありません）。

成功の指標

ATS でロールごとに月次で 3 つの数字を追跡します：

アウトリーチへの返信率 — リクルーターのベースライン手動レートと一致するかそれを超えるべきです。低下した場合、アウトリーチドラフトが汎用的です — 通常はモデルではなくルーブリックが粗すぎます。
ショートリストからスクリーンの通過率 — 採用マネージャーがスクリーンの価値があると同意するショートリスト候補者の割合。安定したロールで 70% 以上であるべきです。それ以下の場合、ICP ルーブリックのキャリブレーションが間違っています。クローズしたロールで再実行して調整してください。
ロールオープンから最初の適格スクリーンまでの時間 — スキルが動かすことを意図したスループット指標。3 時間から 30 分への削減はモデル支出ではなくここに現れます。

代替案との比較

Gem AI ソーシングと比較して — Gem はリクルーターワークフロー全体を所有しています（ソーシング UI、シーケンス、分析、Ashby などとの ATS 連携）。マネージドプロダクトが必要でチームがその UI の中に生活する場合は Gem を選択してください。ルーブリック、プレフィルターロジック、監査ログを自分のリポジトリに保持し、バージョン管理し、モデルを交換可能にしたい場合はこのスキルを選択してください。
hireEZ の組み込み AI ランキングと比較して — hireEZ の AI Match は良い取得です。ギャップはルーブリックレイヤーにあります。このスキルを使用すると hireEZ を取得チャンネルとして保持し、その上に独自のルーブリック + 証拠引用スコアリングをもたらします。hireEZ のデフォルトが ICP と一致する場合、このスキルは不要です。
手動ブール + スプレッドシートスコアリングと比較して — 手動はルーブリックがリクルーターの頭の中にあり、書き下ろすことがコストに見合わない一回限りまたは役員検索に正解です。スキルは繰り返しのロールでセットアップコストを取り戻します。
LinkedIn / Juicebox API に対するカスタム Python スクリプトと比較して — プロンプトを慎重に構築すれば同じランキング品質ですが、フェアネスチェックリスト、監査ログ、人間によるレビューゲートも自分で構築します。バンドルにはそれらが含まれています。

注意事項

バイアスの増幅 — references/2-fairness-checklist.md のフェアネスチェックリストによってガードされており、ルーブリックに保護されたクラスのプロキシが含まれている場合は実行を停止します。監査ログは実行ごとに rubric_sha256 を記録するため、特定の日付に使用されたルーブリックはEU AI 法または NYC LL 144 のレビューで再現可能です。
古い LinkedIn / Juicebox データ — ステップ 3 の決定論的フィルター（18 ヶ月以上古いプロフィールを除外）とスコアリングの応答可能性次元（新鮮度を重み付け）によってガードされています。コールドストレージ候補者はアクティブに求職中の候補者を押しのけません。
LinkedIn ToS 露出 — 公開プロフィール URL のスクレイピングを拒否することによってガードされています。スキルは Recruiter API、Juicebox、または hireEZ を使用します。linkedin_recruiter が選択されて API が設定されていない場合、スキルはフォールバックせずにセットアップエラーで中断します。
自動送信のドリフト — 人間によるレビューゲート（ステップ 5）と、スキル内に send アクションが存在しないことによってガードされています。ドラフトはリクルーターが ATS / ソーシングツールの送信ボックスに貼り付けるための outreach/<id>.md ファイルに書かれます。レビューなしで AI がドラフトして送信すると、量は生まれますが質は生まれず、候補者体験を損なわせます。
報酬の透明性 — アウトリーチドラフトは数字を一切引用しません。リクルーターが給与帯の情報の発信元となれるよう（NYC LL 32-A、コロラド、カリフォルニア、ワシントンの給与透明性要件）、「スクリーンで開示される競争力のある範囲」として帯を参照します。

スタック

スキルバンドルは apps/web/public/artifacts/candidate-sourcing-claude-skill/ にあります：

SKILL.md — スキル定義
references/1-icp-rubric-template.md — ロールごとに記入
references/2-fairness-checklist.md — 事前チェック（偏ったルーブリックを通過させるために編集しないこと）
references/3-shortlist-format.md — 文字通りの出力フォーマット

ワークフローが前提とするツール：Claude（モデル）、Juicebox または hireEZ（取得チャンネル）、Ashby（リクルーターが候補者を承認した後のライトバック用 ATS）。ルーブリックと監査ログを自分で所有したくない場合は、Gem がビルド対バイの代替です。

GitHubでこのページを編集

Files in this artifact

Download all (.zip)

---
name: candidate-sourcing
description: Translate a job profile and ICP rubric into a sourcing query, retrieve candidates from Juicebox / hireEZ / LinkedIn Recruiter, score them against the rubric, and draft personalized outreach for the human reviewer to approve. Always stops at a human-review gate before any outreach is sent.
---

# Candidate sourcing

## When to invoke

Use this skill when a recruiter or sourcer hands you a role plus an ICP rubric and wants a ranked, evidenced shortlist with draft outreach. Take a job profile (title, level, must-have skills, location, comp band) and a fairness-aware rubric as input, and produce a Markdown shortlist plus a folder of draft messages.

Do NOT invoke this skill for:

- **Automated rejection.** This skill ranks; it never rejects. The "below threshold" tail is surfaced for the recruiter, who decides. Auto-reject in the loop triggers EU AI Act high-risk obligations and most US state hiring-AI laws.
- **Scoring against protected-class proxies.** Do not ask the skill to score on "culture fit", name origin, school prestige as a standalone signal, photo, age inferred from graduation year, gender inferred from pronoun usage, or pregnancy/parental status inferred from gaps. If the rubric contains any of these, refuse and surface the rubric line for the user to fix.
- **Pay-band recommendations.** NYC LL 144, Colorado, California, and Washington require posted ranges and bias audits for automated decisions on pay. Use a comp benchmarking tool, not this skill.
- **Reference checks or backchannel research on named individuals.** That is a different workflow with its own consent posture.

## Inputs

- Required: `job_profile` — path to a Markdown file with title, level, must-have skills, nice-to-have skills, location / remote policy, comp band, and the EEOC job category.
- Required: `icp_rubric` — path to the rubric file under `references/`. Without this the skill refuses to run; an unfaitened rubric is the most common cause of biased shortlists.
- Required: `source_channel` — one of `juicebox`, `hireez`, `linkedin_recruiter`. Do not mix channels in a single run; per-channel ToS and rate limits differ.
- Optional: `n` — shortlist size, default 25, hard max 100. Above 100 the skill warns that human review will not be meaningful.
- Optional: `exclude_list` — path to a CSV of `do_not_contact` emails or LinkedIn URLs (do-not-poach customers, prior rejects within 6 months, silent-period candidates).

## Reference files

Always read these from `references/` before doing any retrieval. Without them the shortlist is uncalibrated and the fairness guards are absent.

- `references/1-icp-rubric-template.md` — the rubric the skill scores against. Replace the template content with your role-specific rubric before running.
- `references/2-fairness-checklist.md` — pre-flight checks the skill runs on the rubric and on the retrieved pool. Fail-loud if any check fails.
- `references/3-shortlist-format.md` — the literal output format, including the evidence and source-URL columns the recruiter needs to defend the shortlist downstream.

## Method

Run these six steps in order. Steps 1-3 are deterministic filters and fairness pre-flight; only step 4 uses the LLM for ranking. The order is deliberate — running the LLM over an unfiltered, ToS-violating, or rubric-contaminated pool produces output that is fast, confident, and unusable.

### 1. Validate the rubric

Open `icp_rubric` and run every check in `references/2-fairness-checklist.md`. If any line in the rubric matches a protected-class proxy pattern (school-tier scoring, name-based filtering, employment-gap penalties, photo presence, "culture fit" without behavioral anchors), stop and return the offending lines to the user. Do not proceed with retrieval.

The choice to fail before retrieval rather than after is intentional: a biased rubric loaded into a sourcing tool's API leaves a log entry that counts as automated processing under GDPR Art. 22 and the EU AI Act, regardless of whether the skill ever shows the user the result.

### 2. Build the search query

Translate the job-profile must-haves into the channel's native query format:

- `juicebox` → natural-language PeopleGPT prompt, with location and level filters set as structured parameters not free text.
- `hireez` → Boolean string with explicit AND/OR/NOT grouping. Cap synonyms at 5 per dimension; longer Boolean degrades hireEZ's relevance ranking.
- `linkedin_recruiter` → use the Recruiter API with structured filters only. **Do not scrape `linkedin.com/in/` URLs** — that violates LinkedIn ToS and the *hiQ v. LinkedIn* settlement does not change ToS exposure for production sourcing.

Cap the retrieved pool at 200. Larger pools degrade rubric scoring because the LLM context fills with low-relevance candidates and the ranking flattens.

### 3. Deterministic pre-filter

Before the LLM sees any candidate, apply hard filters:

- Drop anyone in `exclude_list`.
- Drop anyone whose current company is on the do-not-poach list.
- Drop anyone whose profile was last updated more than 18 months ago (LinkedIn / Juicebox staleness signal).
- Keep only candidates whose stated location matches the role's location policy (with a configurable radius for hybrid roles).

These filters are deterministic so they can be audited. The LLM does not re-litigate them in step 4.

### 4. Rubric-based ranking

For each remaining candidate, score 1-5 on each rubric dimension (skill-match, level-fit, company-pattern-fit, response-likelihood). For every score above 1, cite the specific evidence string from the candidate's profile. No evidence string → score 1 by default.

Why a citation requirement: it forces the model to ground each score in profile text rather than infer from a name, photo, or school. Scores without evidence are the mechanism by which bias enters AI-augmented sourcing pipelines.

### 5. Human-review gate

Stop. Write the shortlist to `shortlist.md` per the format in `references/3-shortlist-format.md`. Write the draft outreach to `outreach/<candidate-id>.md`, one file per candidate. Do not call any "send" endpoint. Do not mark candidates as contacted in the ATS. Surface the path to both directories and exit.

The recruiter's job from here: read the shortlist, edit the messages, and send through the ATS or sourcing tool's outbox. The skill does not re-enter the loop until the next role.

### 6. Audit log

Append a single line to `audit/<YYYY-MM>.jsonl` containing: `run_id`, `role`, `rubric_sha256`, `pool_size_pre_filter`, `pool_size_post_filter`, `shortlist_size`, `channel`, `model_id`, `timestamp`. Do not log candidate PII to this file. The audit log exists so that under NYC LL 144 or EU AI Act questioning, the recruiter can demonstrate which rubric was used on which date.

## Output format

```markdown
# Sourcing shortlist — {Role title}

Generated: {ISO timestamp} · Channel: {channel} · Pool: {pre} → {post} · Rubric SHA: {short}

| # | Name | Current role | Current company | Skill | Level | Pattern | Response | Aggregate | Source |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Jamie L. | Senior Backend Engineer | Acme Fintech | 5 | 5 | 4 | 4 | 18 | {URL} |
| 2 | ... | ... | ... | ... | ... | ... | ... | ... | ... |

## Evidence — top 5

### 1. Jamie L. (aggregate 18)

- **Skill (5)**: "5y Go, 2y Rust, led migration from monolith to event-driven services" — profile, role 2.
- **Level (5)**: "Senior IC, scope across two teams, mentors three engineers" — profile, current role.
- **Pattern (4)**: "Stripe → Plaid → Acme Fintech" — three fintech roles in sequence.
- **Response likelihood (4)**: profile updated 11 days ago, "open to opportunities" tag set.

### 2. ...

## Skipped — surfaced for review (not auto-rejected)

| Name | Reason |
|---|---|
| ... | "current company on do-not-poach list (Acme Customer)" |
| ... | "profile last updated 2023-11, staleness > 18mo" |

## Draft outreach

Drafts written to `outreach/`. Recruiter reviews and sends; this skill
does not contact candidates.

- `outreach/jamie-l.md`
- `outreach/...`
```

## Watch-outs

- **Bias amplification (NYC LL 144, EU AI Act, EEOC).** *Guard:* the fairness checklist in `references/2-fairness-checklist.md` runs in step 1 and refuses retrieval if rubric contains protected-class proxies. Audit log in step 6 stores `rubric_sha256` so the rubric used on a given run is reproducible.
- **LinkedIn ToS exposure.** *Guard:* skill uses the Recruiter API (or Juicebox / hireEZ which carry their own data licensing), never scrapes public LinkedIn pages. If the channel is `linkedin_recruiter` and the Recruiter API is not configured, the skill aborts with a setup-error rather than falling back to scraping.
- **Stale profile data.** *Guard:* deterministic filter in step 3 drops candidates with `profile_updated > 18mo`. Response-likelihood scoring in step 4 weights profile freshness explicitly so cold-storage candidates do not crowd out actively looking ones.
- **Auto-send drift.** *Guard:* skill stops at the human-review gate in step 5 and writes to `outreach/` files. There is no `send` action defined anywhere in this skill. To send, the recruiter pastes into the ATS / sourcing tool outbox.
- **Rubric drift mid-search.** *Guard:* `rubric_sha256` is captured per run; if the rubric changes between two runs for the same role, the audit log shows both hashes, making it visible in retro.
- **Compensation discussion in draft outreach.** *Guard:* outreach templates in this skill never quote a number; they reference the comp band as "competitive range disclosed on screen" so the recruiter remains the source of pay-band statements (NYC LL 32-A, CO, CA, WA pay-transparency posting).

# ICP rubric — TEMPLATE (per role)

> Replace this template's contents with the rubric for the specific role.
> The candidate-sourcing skill scores against the four dimensions below.
> Each dimension MUST have behavioral anchors — vague labels ("senior")
> without anchors produce noisy and biased scoring.

## Role identity

- **Title**: {e.g. Senior Backend Engineer, Platform}
- **Level**: {IC4 / IC5 / EM1 — your internal scale}
- **Location policy**: {remote-US / hybrid-NYC-2dpw / onsite-Berlin}
- **EEOC job category**: {2 — Professionals (most engineers); see EEO-1}
- **Comp band (recruiter-internal, never sent to skill output)**: {range}

## Dimension 1 — Skill match (1-5)

The candidate's profile shows direct experience with the must-have technologies and the specific problem-shape of the role.

| Score | Anchor |
|---|---|
| 5 | Held a role doing exactly this work for ≥2 years; cites artifacts (talks, OSS, posts). |
| 4 | Held a role doing exactly this work for ≥1 year; no artifacts. |
| 3 | Adjacent work (e.g. Java backend role for a Go role); transferable. |
| 2 | Tangential work; would require ramp. |
| 1 | No evidence in profile. |

## Dimension 2 — Level fit (1-5)

The candidate's stated scope and tenure pattern match the level the role is hiring at. Do NOT use school prestige, employer prestige, or title inflation as a level signal — anchor on scope description.

| Score | Anchor |
|---|---|
| 5 | Profile shows scope at or above target level (multi-team, mentoring, technical strategy). |
| 4 | Scope at target level for ≥1 year. |
| 3 | One level below target; growth trajectory plausible. |
| 2 | Two levels below; reach. |
| 1 | More than two levels off, in either direction. |

## Dimension 3 — Company-pattern fit (1-5)

The shape of the candidate's prior employers matches the shape of yours (stage, scale, regulated/unregulated, B2B/B2C). Anchor on *characteristics*, not brand names — brand-name scoring is the most common bias vector in AI-augmented sourcing.

| Score | Anchor |
|---|---|
| 5 | ≥2 prior employers match {stage/scale/domain pattern}. |
| 4 | 1 prior employer matches; others adjacent. |
| 3 | All adjacent (different domain, similar stage). |
| 2 | Mostly mismatched; one transferable role. |
| 1 | No pattern match. |

## Dimension 4 — Response likelihood (1-5)

How likely the candidate is to respond to outreach right now.

| Score | Anchor |
|---|---|
| 5 | Profile updated <30 days; "open to opportunities" set; recently posted about job search. |
| 4 | Profile updated <90 days. |
| 3 | Profile updated <180 days. |
| 2 | Profile updated <12 months. |
| 1 | Stale profile (>12 months) — *also flagged in pre-filter for drop at >18mo*. |

## Disqualifiers (deterministic, applied in step 3 of the skill)

These cause the candidate to be surfaced in the "skipped" table, not auto-rejected. The recruiter decides.

- Current company is on do-not-poach list (`{path-to-list}`).
- Email or LinkedIn URL appears in `exclude_list`.
- Stated location does not match role's location policy + radius.
- Profile last updated >18 months ago.

## Bias guards (refusal triggers — skill aborts in step 1 if present)

If any of the following appear in this rubric, the skill refuses to run:

- School-tier scoring as a standalone dimension.
- Name-based filtering or scoring.
- Photo-based scoring.
- Employment-gap penalties without a job-related justification.
- Age inferred from graduation year used in any dimension.
- Gender, ethnicity, religion, sexual orientation, parental status, or disability status as a scored or filtered dimension.
- "Culture fit" without behavioral anchors.

## Last edited

{YYYY-MM-DD} — bump on every material change. The skill captures the SHA-256 of this file in its audit log per run.

# Fairness pre-flight checklist

> The candidate-sourcing skill runs every check below in step 1 (rubric
> validation) and step 3 (post-filter pool review). Any failed check
> halts the run with a message naming the failure. Do not edit this file
> to make checks pass — fix the rubric or the search instead.

## A. Rubric checks (run before retrieval)

A1. **No protected-class proxies.** Scan the rubric for any of the following terms or patterns. Any hit halts the run:

- `school`, `university`, `Ivy`, `tier-1`, `top-N` (when used as a scoring dimension, not as one signal among many)
- `name origin`, `surname`, `first name`
- `photo`, `headshot`, `appearance`
- `age`, `years since graduation`, `birth year`
- `gender`, `pronoun`, `she/her`, `he/him` (as filter terms)
- `ethnicity`, `race`, `nationality` (except where required for immigration-status filtering with documented legal basis)
- `pregnant`, `parental`, `maternity`, `paternity`
- `disability`, `accommodation`
- `religion`, `political`, `marital`
- `culture fit` without a behavioral-anchor table immediately following

A2. **Anchors present on every dimension.** Each rubric dimension must have a 1-5 anchor table. Anchors prevent the LLM from scoring on vibes. Halt if any dimension has free-text anchors only.

A3. **Disqualifier list is short and mechanical.** Disqualifiers must be deterministic facts (do-not-poach list, location mismatch, staleness). Halt if a disqualifier requires judgment (e.g. "not a culture fit", "seems junior").

A4. **Comp band is recruiter-internal.** The skill's output must not quote a comp number to the candidate. Outreach templates reference the band as "competitive range disclosed on screen". Halt if the rubric includes a "send comp in outreach" instruction.

## B. Pool checks (run after deterministic pre-filter, before LLM ranking)

B1. **Pool size sanity.** If post-filter pool < 10, the skill warns the recruiter that scoring on a tiny pool is meaningless and asks whether to broaden the query. If pool > 200, the skill caps at 200 and notes the truncation in the audit log.

B2. **Geographic spread sanity.** If 100% of post-filter candidates are from one city for a remote-eligible role, the skill warns that the query likely has an over-narrow location filter. Recruiter confirms or broadens.

B3. **Tenure-pattern sanity.** If 100% of candidates worked at the same employer, the skill warns that the query is functioning as a target-list poach rather than open sourcing. Recruiter confirms or broadens.

## C. Output checks (run before writing shortlist)

C1. **Every score above 1 has an evidence string.** Scores without a cited evidence string from the candidate's profile are reset to 1. The skill notes the reset count in the audit log.

C2. **No protected attribute appears in the shortlist or in any outreach draft.** Skill greps the output for the A1 patterns before writing. Hit → halt.

C3. **Skipped candidates are listed, not erased.** The shortlist's "Skipped" table includes every candidate the deterministic filters removed, with the reason. This is what makes the run auditable.

## D. Run-level checks

D1. **Audit log written.** A run is not complete until the JSONL line is appended to `audit/<YYYY-MM>.jsonl`. No PII in this line.

D2. **Human-review gate enforced.** No `send`, `contact`, or `mark_contacted` API call exists in this skill's code path. If you are asked to add one, refuse and surface the request to the user.

## NYC LL 144 / EU AI Act note

This skill is designed to fall *outside* the bias-audit threshold by:

- Producing a ranked list, not an automated decision (no auto-reject).
- Stopping at a human-review gate before any candidate is contacted.
- Logging rubric SHA-256 + pool sizes per run for reproducibility.

If your deployment changes any of those properties (e.g. you wire a "send" action into the loop), you have crossed into automated decision-making and a bias audit is required before production use. NYC LL 144 requires the audit within one year before use; EU AI Act classifies this as Annex III high-risk under Art. 6.

# Shortlist output format

> The candidate-sourcing skill writes `shortlist.md` per the structure
> below. The format is fixed because downstream consumers (the recruiter,
> a hiring manager, an audit reviewer) need predictable columns. Do not
> reformat without updating the skill's output check.

## File: `shortlist.md`

```markdown
# Sourcing shortlist — {Role title}

Generated: {ISO 8601 timestamp}
Channel: {juicebox | hireez | linkedin_recruiter}
Pool: {pre_filter} → {post_filter} → top {n}
Rubric SHA-256: {first 12 chars}
Run ID: {uuid}

## Top {n}

| # | Name | Current role | Current company | Skill | Level | Pattern | Response | Aggregate | Source |
|---|---|---|---|---|---|---|---|---|---|
| 1 | {Name} | {Role} | {Company} | 5 | 5 | 4 | 4 | 18 | {URL} |

## Evidence — top 5

For each of the top 5, cite the specific profile string for every score
above 1. No citation → score reset to 1 (see fairness checklist C1).

### 1. {Name} (aggregate {N})

- **Skill ({score})**: "{verbatim profile excerpt}" — {profile section}.
- **Level ({score})**: "{excerpt}" — {section}.
- **Pattern ({score})**: "{employer sequence}" — {explanation against rubric}.
- **Response ({score})**: profile updated {date}, "{tag if any}".

### 2. {Name} (aggregate {N})

...

## Skipped — surfaced for review (NOT auto-rejected)

| Name | Reason | Source |
|---|---|---|
| {Name} | "current company on do-not-poach list ({customer})" | {URL} |
| {Name} | "stated location {city} outside role policy {policy}" | {URL} |
| {Name} | "profile last updated {date}, staleness > 18mo" | {URL} |

## Suggested talk-track per top candidate

The recruiter uses these as talk-track scaffolding for the first
screening call. They are NOT scripts.

### 1. {Name}

- **Open with**: their {recent role / talk / OSS contribution} — specific
  reference, not a generic compliment.
- **Likely motivation hypothesis**: {evidence-based, e.g. "third fintech
  role in a row, may be looking for a non-fintech reset; ask"}.
- **Hesitation to surface**: {e.g. "current company is well-funded; ask
  what would have to be true for them to consider a move"}.

### 2. {Name}

...

## Outreach drafts

Drafts written to `outreach/{candidate-id}.md`, one file per candidate.
The recruiter reviews, edits, and sends through the ATS or sourcing
tool's outbox. The skill does not contact candidates.

- `outreach/{id-1}.md`
- `outreach/{id-2}.md`
- ...
```

## File: `outreach/<candidate-id>.md`

```markdown
# Outreach draft — {Name}

Channel: {LinkedIn InMail | email | Juicebox sequence}
Subject: {≤60 chars, references a specific signal from the profile}

---

Hi {first name},

{One sentence referencing a specific, recent thing from their profile —
the {recent role / talk / project / post}. Not a flattery line.}

I'm hiring a {role title} at {company}. The reason I reached out is
{specific connection between their background and the role — cite the
profile signal}. The role's {one specific differentiator that would
matter to someone with this background}.

If you're open to a 15-minute conversation, I'm happy to share more. The
comp range will be disclosed on screen if we get to that step.

{Recruiter name}

---

## Recruiter-only metadata (strip before sending)

- Aggregate score: {N}
- Top evidence string: "{excerpt}"
- Source URL: {URL}
- Run ID: {uuid}
- Reviewed by recruiter: [ ]
- Sent: [ ]
```

## Why these fields are non-negotiable

- **`Source` URL on every row** — required for the recruiter to spot-check the LLM's evidence claims against the actual profile.
- **`Pool: pre → post → top N`** — surfaces how many candidates were filtered out deterministically vs. by the LLM. Big LLM-side cuts on a small post-filter pool is a signal of overfitting to rubric noise.
- **`Rubric SHA-256`** — proves which rubric was used on this run (NYC LL 144 audit defense + EU AI Act traceability).
- **`Skipped` table** — candidates filtered out are listed with reasons, not erased. Erasing them turns the workflow into automated rejection.
- **Recruiter-only metadata in outreach** — stripped before sending; its presence in the draft is what reminds the recruiter the message is a draft, not a finished product.