n8n-flow

Candidate rediscovery for silver medalists with n8n

Difficulty

Fortgeschritten

Setup time

60min

For

recruiter · sourcer · talent-acquisition

Recruiting & TA

Stack

Ein n8n-Flow, der Greenhouse auf neu eröffnete Reqs überwacht, die früheren Kandidaten findet, die bei einem verwandten Req eine späte Interviewstufe erreicht haben und aus einem nicht-disqualifizierenden Grund abgelehnt wurden — die „Silbermedaillengewinner“ —, jeden einzelnen mit Claude erneut gegen das Rubric des neuen Reqs bewertet und eine gerankte Shortlist in einen Slack-Kanal postet. Er kontaktiert niemanden, fügt nie einen Kandidaten zu einer Pipeline hinzu und bewegt nie einen Kandidaten im ATS. Über jede Kontaktaufnahme entscheidet der Recruiter. Er verwandelt „wir haben letztes Frühjahr jemand anderen eingestellt, wer war noch mal der Zweitplatzierte?“ von einer 40-minütigen archäologischen Grabung in eine Slack-Nachricht, die in der Stunde landet, in der der Req eröffnet wird.

Wann zu verwenden

Sie arbeiten mit Greenhouse (oder einem anderen ATS mit einer Lese-API — die Intake-Nodes lassen sich austauschen), und Sie eröffnen genügend Reqs in wiederkehrenden Job-Familien, sodass die Finalisten des letzten Jahres die Shortlist dieses Jahres sind.
Sie lehnen Finalisten tatsächlich mit strukturierten Ablehnungsgründen ab. Das gesamte Sicherheitsmodell des Flows beruht darauf, „jemand anderen eingestellt“ von „die Background-Prüfung nicht bestanden“ zu unterscheiden. Wenn Ihr Team alle mit einem einzigen generischen Grund ablehnt, beheben Sie das zuerst; der Flow hat nichts, woran er gaten könnte.
Sie haben Feeder-Reqs, auf die Sie zeigen können. Der Flow rät nicht, welche vergangenen Reqs „verwandt“ sind — Sie listen die vergangenen Greenhouse-Job-IDs je Job-Familie in einer Konfigurationsdatei auf. Das macht den Match auditierbar statt zu einer Ähnlichkeits-Blackbox.
Ein Recruiter geht das Digest durch und entscheidet über die Kontaktaufnahme. Der Flow bringt zutage und rankt; ein Mensch screent erneut und kontaktiert.

Wann NICHT zu verwenden

Automatische Kontaktaufnahme im Loop. Der Flow rankt und postet in Slack; er mailt nie, fügt nie zu einer Sequenz hinzu, bewegt nie eine Stufe. Einen Outreach-Versand an das Digest zu verdrahten, verwandelt einen Wiederkontakt-Vorschlag in automatisierte Verarbeitung von Kandidatendaten — und einen Kandidaten nach Ablauf der ihm gegenüber offengelegten Aufbewahrungsfrist erneut zu kontaktieren, ist eine GDPR-Verletzung, kein Growth-Hack. Die Confirm first:-Zeile pro Kandidat im Digest existiert genau deshalb, damit ein Recruiter vor jeder Nachricht Einwilligung und Aktualität prüft.
Kein Aktualitätsfenster. GDPR verlangt, dass Sie Kandidatendaten nicht über die dem Kandidaten mitgeteilte Aufbewahrungsfrist hinaus halten oder erneut verarbeiten — üblicherweise 12–24 Monate für erfolglose Bewerber. Das recency_months-Gate des Flows entfernt jeden, der außerhalb des Fensters liegt. Es länger als Ihre angegebene Aufbewahrungsfrist zu setzen, um den Pool zu erweitern, ist die eine Bearbeitung, die diesen Flow in eine Haftung verwandelt.
Ablehnungsgründe, denen Sie nicht trauen können. Wenn „Position filled“ stillschweigend für „wir hatten Bedenken“ verwendet wird, kann die Deny-List Sie nicht schützen. Der Flow ist nur so sicher wie die Disziplin bei den Ablehnungsgründen, die dahintersteht.
Winzige oder einmalige Einstellungen. Ein Team, das drei unzusammenhängende Reqs pro Jahr eröffnet, ist schneller, wenn es sein eigenes Gedächtnis durchgeht, als ein Rubric und eine Feeder-Req-Liste zu verfassen. Das Setup zahlt sich aus, wenn eine Job-Familie wiederkehrt.
Vertrauliche oder Executive-Suchen. Andere Einwilligungshaltung, andere Audit-Kette. Diese gehören nicht in einen geteilten Slack-Kanal.

Setup

Importieren Sie den Flow. Legen Sie apps/web/public/artifacts/candidate-rediscovery-n8n/candidate-rediscovery-n8n.json in Ihre n8n-Instanz. Jede Node trägt notesInFlow: true, sodass die Notizen auf dem Canvas jede Entscheidung erklären.
Verdrahten Sie die Credentials. Drei: PLACEHOLDER_GREENHOUSE_CRED_ID (Harvest-API-Key, nur Lese-Scope — Jobs, Applications, Scorecards), PLACEHOLDER_ANTHROPIC_CRED_ID (Claude-API-Key), PLACEHOLDER_SLACK_CRED_ID (Slack-Bot-Token mit chat:write für #talent-rediscovery). Die _README.md des Bundles zeigt, wo jeder Wert liegt.
Verfassen Sie eine Konfigurationsdatei pro Job-Familie unter ${CONFIG_DIR}/<family>.json. Sie enthält die match_job_ids (die Feeder-Reqs), min_stage_reached (das Gate für die späte Stufe), die Allow- und Deny-Listen der Ablehnungsgründe, recency_months, fit_threshold, top_n und das Rubric. Das vollständige Format steht in _README.md. Keine Konfiguration für eine Familie → der Flow hält mit missing_config an, statt gegen Defaults zu bewerten.
Setzen Sie den Lookback. POLL_LOOKBACK_HOURS muss ≥ dem Schedule-Intervall sein (Standard 6h), sonst rutscht ein zwischen den Polls eröffneter Req durch. Beide werden zusammen abgestimmt.
Dry-Run auf einer Familie, für die Sie gerade eingestellt haben. Die Zweitplatzierten, an die Sie sich erinnern, sollten nahe der Spitze des Digests landen. Stimmen Sie min_stage_reached und die Rubric-Anker an Ihrem Gedächtnis ab, bevor Sie ihm bei einer frischen Familie vertrauen.
Aktivieren Sie den Trigger. Stellen Sie active: true erst um, nachdem ein Digest gekommen ist, auf das Sie tatsächlich handeln würden.

Was der Flow macht

Zwölf Nodes, in Reihenfolge. Die deterministischen Einwilligungs- und Fairness-Gates laufen vor dem Modell-Aufruf, denn ein LLM auf das vollständige Ablehnungsarchiv loszulassen, ist der Weg, jemanden erneut zu kontaktieren, der Sie gebeten hat, ihn nie zu kontaktieren.

Every 6 Hours — Schedule-Trigger. Greenhouse hat keinen zuverlässigen Job-Created-Webhook, also pollt der Flow.
Fetch New Open Reqs — GET /v1/jobs?status=open&created_after=… gegen Greenhouse Harvest. Das JSON-Array wird in ein Item pro neuem Req aufgeteilt.
Load Match Config — löst die Job-Familie des Reqs auf, lädt deren Konfiguration, hasht sie für das Audit-Log. Hält bei missing_config an.
Config Loaded? — IF-Gate; Reqs ohne Konfiguration stoppen hier.
Fetch Rejected Pool — GET /v1/applications?status=rejected&last_activity_after=…, paginiert. Ein Item pro abgelehnter Application.
Eligibility Filter — der Fünf-Gate-Boden: Feeder-Req-Match, späte Stufe erreicht, Ablehnungsgrund Allow/Deny (Deny gewinnt), Aktualitätsfenster, Do-not-contact-Unterdrückung. Alles andere wird verworfen, bevor irgendein Modell es sieht.
Fetch Scorecards — zieht die früheren Interview-Scorecards des Kandidaten, den Grounding-Text für den Re-Match.
Claude Re-Match — bewertet den früheren Kandidaten gegen das Rubric des neuen Reqs auf Sonnet 4.6, mit der expliziten Anweisung, die alte Ablehnungsentscheidung nicht zu übernehmen und nicht anhand von Proxys für geschützte Merkmale zu bewerten. Evidenz erforderlich: kein wörtliches Scorecard-Zitat → Fit 1.
Parse + Keep — erzwingt die Evidenzregel, markiert Keep, wenn Fit ≥ dem Konfigurations-Threshold ist.
Audit Append — eine pseudonyme JSONL-Zeile pro bewertetem Kandidaten (Kandidaten-ID + Link, kein Name, kein Scorecard-Text).
Build Digest — gruppiert nach Req, dedupliziert einen Kandidaten, der über zwei Feeder-Reqs gematcht hat (höherer Fit gewinnt), rankt, kürzt auf top_n.
Slack Digest — postet eine gerankte Shortlist pro Req in #talent-rediscovery, jeden Kandidaten mit einem einzeiligen Grund zum erneuten Auftauchen und einer Confirm first:-Notiz.

Kostenrealität

Anthropic-API-Token — jeder Kandidat sendet Scorecard-Text + Rubric (~4–5k Input-Token) und liefert ~300 Output-Token zurück. Bei der Sonnet-4.6-Listenpreisgestaltung landet das bei rund $0,015–0,03 pro bewertetem Kandidaten, sodass eine Familie, die 200 berechtigte Silbermedaillengewinner zieht, etwa $3–6 pro eröffnetem Req kostet (aus Token-Zählungen berechnet, nicht an Ihren Daten gemessen).
Greenhouse-Harvest-Aufrufe — nur lesend: ein Jobs-Poll, ein paginierter Applications-Pull, ein Scorecards-Fetch pro berechtigtem Kandidaten. Das bleibt für jede realistische Familiengröße innerhalb des dokumentierten Per-Key-Rate-Limits von Harvest.
n8n-Kosten — selbst gehostet ist im Container kostenlos. Der Starter-Plan von n8n Cloud deckt das Polling-Volumen ab; nur sehr hoher Req-Durchsatz braucht Pro.
Recruiter-Zeit — der Gewinn. Eine Silbermedaillengewinner-Liste über vergangene Reqs hinweg von Hand zu rekonstruieren, dauert den besseren Teil einer Stunde pro Req; das Digest landet gerankt, mit Einwilligungs-Flags und Re-Screen-Prompts vorbereitet, in den Minuten nach Eröffnung des Reqs.
Die Ökonomie hinter dem Gewinn. Veröffentlichte Recruiting-Benchmarks setzen die Cost-per-Hire über $4.500 und die Ersparnis eines wiederentdeckten Hires bei rund $2.000–3.000 an, wobei die Time-to-Fill bei Wiederentdeckungs-Hires um 20–30 Tage sinkt. Teams starten typischerweise bei einer Wiederentdeckungsrate von 5–15 % und zielen innerhalb eines Jahres auf 35–50 %; die Benchmark für die Einstellungsrate von Silbermedaillengewinnern liegt bei rund 8–15 %. Der Flow existiert, um das Erreichen dieser Zahlen zum Default zu machen, nicht zu einem Quartalsprojekt.

Erfolgsmetrik

Verfolgen Sie drei Zahlen pro Job-Familie pro Quartal:

Shortlist-zu-Screen-Rate — Anteil der Digest-Kandidaten, die ein Recruiter zu einem Re-Screen mitnimmt. Unter ~20 % bedeutet, dass das Rubric oder min_stage_reached zu locker ist; ziehen Sie die Anker an, bevor Sie den Pool erweitern.
Wiederentdeckungs-Einstellungsrate — Anteil der Einstellungen in der Familie, die aus dem Digest stammen. Die Benchmark von 8–15 % ist das Ziel; unter 5 % nach zwei Quartalen bedeutet, dass die Feeder-Req-Liste oder das Aktualitätsfenster zu eng ist.
Zeit von Req-Eröffnung bis zum ersten qualifizierten Slate — die Metrik für Candidate Experience und Hiring-Manager. Das Digest sollte das von Tagen auf denselben Tag verschieben.

vs Alternativen

vs Wiederentdeckung durch Gem oder hireEZ — das sind verwaltete Talent-CRM-Produkte mit eigenen Re-Engagement-Kampagnen und einem Candidate-Graph; wählen Sie sie, wenn Sie die Plattform wollen und das Budget es trägt. Wählen Sie den Flow, wenn Sie die Matching-Regeln, die Deny-List und das Audit-Log versionskontrolliert in Ihrem eigenen Repo wollen, auf die von Ihnen gewählten Feeder-Reqs zugeschnitten, mit dem Digest, das in Ihrem Stack landet.
vs Greenhouses eigene „Prospect-Pool“-Suche — die native Suche findet Kandidaten nach Keyword und Stufe, bewertet sie aber nicht erneut gegen das Rubric eines neuen Reqs mit zitierter Evidenz, und das Relevanz-Ranking ist eine Blackbox. Wählen Sie den Flow, wenn die reason_to_resurface- und Confirm first:-Zeilen pro Kandidat das sind, was den Recruiter zum Handeln bringt.
vs einem Recruiter, der das ATS manuell durchforstet — gleiche Qualität an einem guten Tag, aber der Recruiter vergisst das Aktualitätsfenster, überspringt unter Termindruck die Deny-List und macht es nur für die Reqs, an die er sich erinnert. Der Flow macht es für jeden wiederkehrenden Req, jedes Mal, mit nicht-optionalen Einwilligungs-Gates.

Fallstricke

Wiederkontakt über die Aufbewahrungsfrist hinaus. Schutz: das recency_months-Gate entfernt vor der Bewertung jeden, der außerhalb des offengelegten Aufbewahrungsfensters liegt, und das Audit-Log erfasst das verwendete Fenster. Setzen Sie es auf Ihre angegebene Aufbewahrungsfrist oder kürzer — niemals länger, um den Pool zu vergrößern.
Disqualifizierte Kandidaten, die wieder auftauchen. Schutz: die Deny-List der Ablehnungsgründe läuft vor dem Modell, und Deny gewinnt über Allow. Nicht bestandene Background-/Referenzprüfungen, Bedenken zum Verhalten, fehlende Arbeitserlaubnis und explizite Do-not-contact-Gründe können nie das Digest erreichen. Die Disziplin hängt von ehrlichen Ablehnungsgründen vorgelagert ab.
Bias-Übertragung aus alten Entscheidungen. Schutz: das Modell wird angewiesen, das frühere Ablehnungsurteil nicht zu übernehmen — ein Kandidat, der übergangen wurde, weil jemand anderes gewählt wurde, kann für einen neuen Req eine 5 sein — und nicht anhand von Name, Schule als eigenständigem Signal, Alter, Geschlecht oder Beschäftigungslücken zu bewerten. Der config_sha im Audit-Log macht die an jedem beliebigen Datum verwendeten Matching-Regeln unter einem AI-Screening-Bias-Review reproduzierbar.
Veralteter Kandidatenstatus. Schutz: die Confirm first:-Zeile pro Kandidat im Digest zwingt den Recruiter, vor der Kontaktaufnahme zu verifizieren, dass die Person noch in der Region, noch interessiert und noch passend ist; der Flow behauptet einen Match, keine aktuelle Tatsache. Anderswo aktive Kandidaten sind die Prüfung des Recruiters in Greenhouse, vermerkt in den bekannten Grenzen des Bundles.
Dünne Scorecards, die niedrig bewerten. Schutz: der Scorecard-Text ist das einzige Grounding, sodass ein Kandidat, der vor substanziellen Interviews abgelehnt wurde, per Design niedrig bewertet. Heben Sie min_stage_reached an, statt das Modell mit Lebensläufen zu füttern, die es nicht sehen kann.

Stack

Das Artefakt-Bundle liegt unter apps/web/public/artifacts/candidate-rediscovery-n8n/ und enthält:

candidate-rediscovery-n8n.json — der n8n-Flow-Export (jede Node konfiguriert, keine Stub-Parameter)
_README.md — Credential-Setup, Konfigurationsdatei-Format, die Einwilligungs- und Fairness-Gates, die Dry-Run-Prozedur

Tools, die der Workflow voraussetzt: Greenhouse (das ATS — wechseln Sie zu Ashby oder Lever, indem Sie die Intake-Nodes ersetzen), Claude (der Re-Match-Scorer), n8n (die Orchestrierung), Slack (die Entscheidungsoberfläche des Recruiters). Für das Triagieren frischer Inbounds gegen ein Rubric siehe den Inbound-Bewerber-Triage-Flow; für das Aufwärmen der Kandidaten, die dieser Flow zutage bringt, siehe die Candidate-Engagement-Sequenz und den Candidate-Sourcing-Claude-Skill.

Diese Seite auf GitHub bearbeiten

Files in this artifact

Download all (.zip)

# Candidate rediscovery (silver medalists) — n8n flow

This flow polls Greenhouse for newly-opened reqs, finds past candidates who reached a late stage on a related req and were rejected for a non-disqualifying reason ("silver medalists"), re-scores each against the new req's rubric with Claude (Sonnet 4.6 by default), and posts a ranked shortlist to Slack. It never contacts a candidate, never adds anyone to a pipeline, and never moves a candidate in Greenhouse. The recruiter decides every outreach.

This README covers import, credentials, the per-job-family config format, the consent and fairness gates, and the dry-run procedure.

## Import

1. Open n8n → Workflows → Import from file → pick `candidate-rediscovery-n8n.json`.
2. Set the workflow timezone (top of the canvas) to your team's working timezone for sane audit-log timestamps. The default is UTC.
3. Do not enable the workflow yet. Configure credentials and at least one job-family config, complete the dry-run, then flip to enabled.

## Credentials (three required)

### `PLACEHOLDER_GREENHOUSE_CRED_ID` — Greenhouse Harvest API key

- Greenhouse admin → Configure → Dev Center → API Credential Management → Create New API Key → type "Harvest". Grant only the read permissions the flow uses: `GET` on Jobs, Applications, and Scorecards. The flow never writes to Greenhouse.
- In n8n, create an HTTP Basic Auth credential. Username = the API token. Password = empty. (Harvest authenticates as base64 of `token:` with a trailing colon — n8n's Basic Auth credential does this for you.)
- Bind the credential to the three Greenhouse nodes: `Fetch New Open Reqs`, `Fetch Rejected Pool`, `Fetch Scorecards`.

### `PLACEHOLDER_ANTHROPIC_CRED_ID` — Anthropic API key

- console.anthropic.com → API Keys → Create Key. Restrict by IP if your n8n is behind a fixed egress.
- In n8n, create a credential of type "Anthropic API". Paste the key.
- Bind to the `Claude Re-Match` node. The model is set to `claude-sonnet-4-6` in the request body — change it there if you want to test other models.

### `PLACEHOLDER_SLACK_CRED_ID` — Slack bot token

- Create (or reuse) a Slack app with the `chat:write` scope. Install to the workspace. Invite the bot into `#talent-rediscovery`.
- In n8n, create a Slack credential with the bot token (`xoxb-…`).
- Bind to the `Slack Digest` node.

### Environment variables

- `CONFIG_DIR` — directory holding the per-job-family config files. Default `/data/rediscovery`.
- `AUDIT_DIR` — directory for the JSONL audit log. Default `/data/audit`.
- `POLL_LOOKBACK_HOURS` — how far back `Fetch New Open Reqs` looks for newly-opened reqs. Must be **≥** the schedule interval (default 6) or a req opened between polls will be missed. Default 6.

## Config file format (one per job family)

The flow expects one config file per job family at `${CONFIG_DIR}/<family>.json`. The family is resolved from the new req's `job_family` custom field, or — if that is absent — the slugified name of the req's first department. Missing config → the flow halts for that req with `missing_config` and leaves the req for manual sourcing.

The config is the only place the matching rules live. Copy this, replace every value, and save as `<family>.json`:

```json
{
"job_family": "backend-engineer",
"version": "2026-06-15",
"match_job_ids": [4012, 3987, 3654],
"recency_months": 18,
"min_stage_reached": ["Onsite", "Final Interview", "Reference Check", "Offer"],
"rejection_reasons_allow": [
"Position filled — strong candidate",
"Hired another candidate",
"Kept warm for future role",
"Timing — not ready to move"
],
"rejection_reasons_deny": [
"Failed background check",
"Not legally authorized to work",
"Conduct / values concern",
"Failed reference check",
"Withdrew — compensation mismatch",
"Do not contact"
],
"do_not_contact_tags": ["do-not-contact", "gdpr-erased", "opted-out"],
"fit_threshold": 4,
"top_n": 10,
"rubric": {
"role": "Senior Backend Engineer (Distributed Systems)",
"dimensions": {
"fit": {
"must_have": [
"Production Go or Rust (3y+)",
"Owned a distributed-system migration"
],
"anchors": {
"5": "Late-stage scorecards show owned, measurable distributed-system outcomes that map to this req's must-haves",
"4": "Strong scorecards on the core skill; one must-have unconfirmed",
"3": "Adjacent skills; would need a fresh screen on the core must-have",
"2": "Partial overlap; likely a stretch for this req",
"1": "No scorecard evidence the candidate matches this req"
}
}
}
}
}
```

- `match_job_ids` are the **feeder reqs** — the past Greenhouse job IDs whose late-stage rejects count as silver medalists for this family. Find them in the URL of each job in Greenhouse. This is what scopes "related req"; the flow does not guess relatedness.
- `min_stage_reached` is the late-stage gate. A candidate rejected at "Application Review" or "Phone Screen" is not a silver medalist — they never got a real read. Use your own stage names exactly as they appear in Greenhouse.
- `rejection_reasons_deny` is the safety floor and **deny wins over allow**. Any disqualifying reason — failed background/reference check, conduct, no work authorization, an explicit do-not-contact — must be listed here so the candidate is never re-surfaced.
- The config is hashed (SHA-256, first 16 hex chars) per run and the hash is written to the audit log and the Slack footer, so the exact rules used on a given date are reproducible.

## Consent and fairness gates (do not weaken to widen the pool)

Two layers protect the candidate, both **before** the LLM call:

1. **`Eligibility Filter`** drops any application that is not a feeder-req match, did not reach a late stage, carries a disqualifying or non-allow-listed rejection reason, falls outside the recency window, or whose candidate carries a do-not-contact / erasure / opt-out tag.
2. **`Claude Re-Match`** is instructed not to inherit the prior reject decision and not to score on protected-class proxies (name, school as a standalone signal, age, gender, employment gaps), and to cite verbatim scorecard evidence — no citation forces fit to 1.

The recency window exists because GDPR requires you not to hold or re-process candidate data beyond the retention period you told the candidate about — commonly 12–24 months for unsuccessful applicants. Set `recency_months` to your stated retention period or shorter; never longer. Candidates past the window are dropped, not re-contacted.

If you find yourself wanting to delete a deny-list reason or stretch the recency window to grow the shortlist, that is exactly the decision a recruiter — not the flow — should make case by case, in Greenhouse, with the candidate's consent status in view.

## Dry-run procedure

1. Author one config file for a family where you recently filled a role and remember the runner-up candidates.
2. Temporarily point `match_job_ids` at the feeder reqs and set the new-req trigger to fire manually: in n8n, click "Execute workflow" with `Fetch New Open Reqs` returning the already-open target req (or pin a sample job item).
3. Read the Slack digest. The runner-ups you remember should appear near the top. If a known strong silver medalist is missing, check, in order: were they within the recency window, did they reach a `min_stage_reached` stage, was their rejection reason allow-listed, do they carry a suppression tag.
4. If obvious mis-fits rank high, the rubric anchors are too loose or the scorecards are thin — look at the `evidence` line in the digest. Empty or paraphrased evidence means the model had little to work with (the candidate was rejected before substantive interviews); raise `min_stage_reached`.
5. Only switch the workflow `active: true` after a digest you would actually act on.

## First-run sanity check

After enabling, watch the first real digest:

1. Confirm the `Confirm first:` line on each candidate is specific (e.g. "still in-region; was a 2024 reject so re-screen on the new framework"). Generic lines mean the model is guessing — check it is on Sonnet 4.6.
2. Confirm the `config <sha>` in the Slack footer matches the file you authored. A mismatch means the wrong family file loaded.
3. Confirm `${AUDIT_DIR}/rediscovery-<YYYY-MM>.jsonl` exists and has one line per scored candidate. No file means you are operating without the audit trail that a GDPR / EEOC inquiry about automated re-contact would require.

## Known limits

- **Active-elsewhere check is the recruiter's, not the flow's.** The pool query returns rejected applications only, so it cannot tell whether a candidate is currently active on another open req. The recruiter sees that in Greenhouse before reaching out; the flow does not auto-suppress active candidates.
- **A candidate who matched via two feeder reqs is scored twice**, then de-duplicated in `Build Digest` (the higher fit wins). The duplicate scoring is a small, bounded token cost, not a correctness problem.
- **Scorecard text is the only grounding.** Greenhouse does not return parsed resume text via Harvest, so a candidate rejected before any substantive interview has thin scorecards and will score low even if their resume is a strong match. That is intended: re-surface people you actually evaluated, not your whole archive.
- **No dedupe table across runs.** If the same req stays open across two polls it will not re-fire (the `created_after` filter only catches newly-opened reqs), but re-opening a req would re-digest it. The audit log makes repeats visible; add a seen-reqs check in front of `Load Match Config` if your audit posture needs hard idempotency.

{
  "name": "Candidate rediscovery (silver medalists)",
  "nodes": [
    {
      "parameters": {
        "rule": {
          "interval": [
            {
              "field": "hours",
              "hoursInterval": 6
            }
          ]
        }
      },
      "id": "3b3b3b3b-0001-0000-0000-000000000001",
      "name": "Every 6 Hours",
      "type": "n8n-nodes-base.scheduleTrigger",
      "typeVersion": 1.2,
      "position": [240, 400],
      "notesInFlow": true,
      "notes": "Polls for newly-opened reqs every 6 hours. Greenhouse has no reliable job.created webhook, so this is a scheduled poll. The lookback window in the next node must be >= this interval so no req is missed. Tune both together."
    },
    {
      "parameters": {
        "method": "GET",
        "url": "https://harvest.greenhouse.io/v1/jobs",
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "httpBasicAuth",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "Accept", "value": "application/json" }
          ]
        },
        "sendQuery": true,
        "queryParameters": {
          "parameters": [
            { "name": "status", "value": "open" },
            { "name": "created_after", "value": "={{ new Date(Date.now() - (($env.POLL_LOOKBACK_HOURS || 6) * 3600000)).toISOString() }}" },
            { "name": "per_page", "value": "100" }
          ]
        },
        "options": {
          "response": {
            "response": { "responseFormat": "json", "neverError": false }
          }
        }
      },
      "id": "3b3b3b3b-0001-0000-0000-000000000002",
      "name": "Fetch New Open Reqs",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [460, 400],
      "credentials": {
        "httpBasicAuth": {
          "id": "PLACEHOLDER_GREENHOUSE_CRED_ID",
          "name": "Greenhouse Harvest API (read scope)"
        }
      },
      "notesInFlow": true,
      "notes": "Greenhouse Harvest is Basic auth: username = API token, password = blank. `created_after` filters to reqs opened since the last poll. The JSON-array response is split into one item per req by n8n; downstream nodes run once per new req."
    },
    {
      "parameters": {
        "jsCode": "// Map each new req to its job-family config and load it from disk.\n// Config keys the rubric, the feeder-req list, the recency window, the\n// rejection-reason allow/deny lists, and the fit threshold. Halt (do not\n// fall back to defaults) if no config exists for the req's job family.\nconst fs = require('fs');\nconst path = require('path');\nconst crypto = require('crypto');\n\nconst CONFIG_DIR = $env.CONFIG_DIR || '/data/rediscovery';\nconst job = $json;\n\nfunction slugify(s) {\n  return String(s || '').toLowerCase().trim().replace(/[^a-z0-9]+/g, '-').replace(/^-+|-+$/g, '');\n}\n\n// Job family resolves from a `job_family` custom field if present, else the\n// first department name. This is the filename the recruiter authored.\nconst customFamily = (job.custom_fields && (job.custom_fields.job_family || job.custom_fields.Job_Family)) || '';\nconst deptName = (job.departments && job.departments[0] && job.departments[0].name) || '';\nconst family = slugify(customFamily) || slugify(deptName);\n\nif (!family) {\n  return [{ json: { status: 'halted', reason: 'no_job_family', job_id: job.id, job_name: job.name } }];\n}\n\nconst configPath = path.join(CONFIG_DIR, `${family}.json`);\nif (!fs.existsSync(configPath)) {\n  return [{ json: { status: 'halted', reason: 'missing_config', job_family: family, expected_path: configPath } }];\n}\n\nconst raw = fs.readFileSync(configPath, 'utf8');\nconst cfg = JSON.parse(raw);\nconst configSha = crypto.createHash('sha256').update(raw).digest('hex').slice(0, 16);\n\nconst recencyMonths = Number(cfg.recency_months) || 18;\nconst recencyCutoffIso = new Date(Date.now() - recencyMonths * 30 * 24 * 3600000).toISOString();\n\nreturn [{\n  json: {\n    status: 'config_loaded',\n    req_id: job.id,\n    req_title: job.name,\n    req_url: `https://app.greenhouse.io/sdash/${job.id}`,\n    job_family: family,\n    config_sha: configSha,\n    recency_months: recencyMonths,\n    recency_cutoff_iso: recencyCutoffIso,\n    match_job_ids: cfg.match_job_ids || [],\n    min_stage_reached: cfg.min_stage_reached || [],\n    rejection_reasons_allow: cfg.rejection_reasons_allow || [],\n    rejection_reasons_deny: cfg.rejection_reasons_deny || [],\n    do_not_contact_tags: cfg.do_not_contact_tags || ['do-not-contact', 'gdpr-erased', 'opted-out'],\n    fit_threshold: Number(cfg.fit_threshold) || 4,\n    top_n: Number(cfg.top_n) || 10,\n    rubric: cfg.rubric || {},\n  }\n}];"
      },
      "id": "3b3b3b3b-0001-0000-0000-000000000003",
      "name": "Load Match Config",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [680, 400],
      "notesInFlow": true,
      "notes": "One config file per job family at /data/rediscovery/<family>.json. No config -> halt (the req is left for manual sourcing). The config SHA is logged so the exact matching rules used on a given date are reproducible under audit."
    },
    {
      "parameters": {
        "conditions": {
          "options": { "caseSensitive": true, "typeValidation": "strict" },
          "conditions": [
            {
              "leftValue": "={{ $json.status }}",
              "rightValue": "config_loaded",
              "operator": { "type": "string", "operation": "equals" }
            }
          ],
          "combinator": "and"
        },
        "options": {}
      },
      "id": "3b3b3b3b-0001-0000-0000-000000000004",
      "name": "Config Loaded?",
      "type": "n8n-nodes-base.if",
      "typeVersion": 2,
      "position": [900, 400]
    },
    {
      "parameters": {
        "method": "GET",
        "url": "https://harvest.greenhouse.io/v1/applications",
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "httpBasicAuth",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "Accept", "value": "application/json" }
          ]
        },
        "sendQuery": true,
        "queryParameters": {
          "parameters": [
            { "name": "status", "value": "rejected" },
            { "name": "last_activity_after", "value": "={{ $json.recency_cutoff_iso }}" },
            { "name": "per_page", "value": "500" }
          ]
        },
        "options": {
          "pagination": {
            "pagination": {
              "parameters": {
                "parameters": [
                  { "name": "page", "value": "={{ $pageCount + 1 }}" }
                ]
              },
              "paginationCompleteWhen": "responseEmpty",
              "type": "updateAParameterInEachRequest"
            }
          },
          "response": {
            "response": { "responseFormat": "json", "neverError": false }
          }
        }
      },
      "id": "3b3b3b3b-0001-0000-0000-000000000005",
      "name": "Fetch Rejected Pool",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [1140, 320],
      "credentials": {
        "httpBasicAuth": {
          "id": "PLACEHOLDER_GREENHOUSE_CRED_ID",
          "name": "Greenhouse Harvest API (read scope)"
        }
      },
      "notesInFlow": true,
      "notes": "Pulls rejected applications active within the recency window. Paginated (500/page). Returns one item per application. The deterministic filtering happens in the next node, not in the query, so the filter rules are auditable and version-controlled rather than buried in URL params."
    },
    {
      "parameters": {
        "jsCode": "// Deterministic eligibility filter. Runs once per rejected application.\n// Keeps an application only if it is a genuine silver medalist for THIS req:\n// - applied to one of the configured feeder reqs (match_job_ids)\n// - reached a late stage (min_stage_reached)\n// - rejection reason is in the allow-list AND not in the deny-list\n// - last activity within the recency window\n// - candidate carries no do-not-contact / erasure / opt-out tag\n// Drops everything else silently. No LLM has seen the record yet.\nconst app = $json;\nconst cfg = $('Load Match Config').item.json;\n\nfunction drop(reason) { return []; }\n\n// 1) Feeder-req match: the candidate must have applied to a configured past req.\nconst appJobIds = (app.jobs || []).map((j) => j.id);\nconst isFeeder = appJobIds.some((id) => cfg.match_job_ids.includes(id));\nif (!isFeeder) return drop('not_a_feeder_req');\n\n// 2) Late-stage gate: silver medalists reached a configured late stage.\nconst stage = (app.current_stage && app.current_stage.name) || '';\nif (cfg.min_stage_reached.length && !cfg.min_stage_reached.includes(stage)) {\n  return drop('did_not_reach_late_stage');\n}\n\n// 3) Rejection-reason gates. Deny wins over allow.\nconst reason = (app.rejection_reason && app.rejection_reason.name) || '';\nif (cfg.rejection_reasons_deny.includes(reason)) return drop('disqualifying_rejection_reason');\nif (cfg.rejection_reasons_allow.length && !cfg.rejection_reasons_allow.includes(reason)) {\n  return drop('rejection_reason_not_in_allow_list');\n}\n\n// 4) Recency: last activity must be within the window.\nconst lastActivity = app.last_activity_at || app.last_activity || null;\nif (lastActivity && new Date(lastActivity) < new Date(cfg.recency_cutoff_iso)) {\n  return drop('outside_recency_window');\n}\n\n// 5) Consent / suppression tags on the candidate.\nconst tags = (app.candidate_tags || (app.candidate && app.candidate.tags) || []).map((t) => String(t).toLowerCase());\nif (cfg.do_not_contact_tags.some((t) => tags.includes(String(t).toLowerCase()))) {\n  return drop('suppressed_do_not_contact');\n}\n\nconst candidateId = app.candidate_id || (app.candidate && app.candidate.id);\n\nreturn [{\n  json: {\n    status: 'eligible',\n    req_id: cfg.req_id,\n    req_title: cfg.req_title,\n    req_url: cfg.req_url,\n    job_family: cfg.job_family,\n    config_sha: cfg.config_sha,\n    fit_threshold: cfg.fit_threshold,\n    top_n: cfg.top_n,\n    rubric: cfg.rubric,\n    candidate_id: candidateId,\n    application_id: app.id,\n    prior_stage_reached: stage,\n    prior_rejection_reason: reason,\n    prior_req_ids: appJobIds,\n    last_activity_at: lastActivity,\n  }\n}];"
      },
      "id": "3b3b3b3b-0001-0000-0000-000000000006",
      "name": "Eligibility Filter",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [1360, 320],
      "notesInFlow": true,
      "notes": "Five deterministic gates run BEFORE any LLM call: feeder-req match, late-stage reached, rejection-reason allow/deny, recency window, do-not-contact suppression. A candidate failing any gate is dropped and never scored. This is the consent + fairness floor; do not move it after the model call."
    },
    {
      "parameters": {
        "method": "GET",
        "url": "=https://harvest.greenhouse.io/v1/applications/{{ $json.application_id }}/scorecards",
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "httpBasicAuth",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "Accept", "value": "application/json" }
          ]
        },
        "options": {
          "response": {
            "response": { "responseFormat": "json", "neverError": true }
          }
        }
      },
      "id": "3b3b3b3b-0001-0000-0000-000000000007",
      "name": "Fetch Scorecards",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [1580, 320],
      "credentials": {
        "httpBasicAuth": {
          "id": "PLACEHOLDER_GREENHOUSE_CRED_ID",
          "name": "Greenhouse Harvest API (read scope)"
        }
      },
      "notesInFlow": true,
      "notes": "Pulls the prior interview scorecards for the candidate. These are the grounding text for the re-match score. `neverError: true` so a candidate with no scorecards (rejected early) does not break the run; the next node handles the empty case."
    },
    {
      "parameters": {
        "method": "POST",
        "url": "https://api.anthropic.com/v1/messages",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "Content-Type", "value": "application/json" },
            { "name": "x-api-key", "value": "={{ $credentials.anthropicApi.apiKey }}" },
            { "name": "anthropic-version", "value": "2023-06-01" }
          ]
        },
        "sendBody": true,
        "specifyBody": "json",
        "jsonBody": "={\n  \"model\": \"claude-sonnet-4-6\",\n  \"max_tokens\": 900,\n  \"system\": \"You re-match a PAST candidate against a NEW open req. Score fit 1-5 against the rubric using only evidence in the supplied scorecard text. Cite a verbatim string as `evidence`. If you cannot cite verbatim evidence, fit is 1. Do NOT inherit the prior hire/no-hire decision: a candidate rejected because someone else was chosen can be a 5 for a new req. Do NOT score on name, school as a standalone signal, age, gender, or employment gaps. Return ONLY JSON: {\\\"fit\\\":{\\\"score\\\":N,\\\"evidence\\\":\\\"...\\\"},\\\"reason_to_resurface\\\":\\\"one sentence\\\",\\\"verify_before_outreach\\\":\\\"what a recruiter must confirm is still true\\\"}.\",\n  \"messages\": [\n    {\n      \"role\": \"user\",\n      \"content\": \"New req: {{ $('Eligibility Filter').item.json.req_title }}\\n\\nRubric:\\n{{ JSON.stringify($('Eligibility Filter').item.json.rubric) }}\\n\\nPrior stage reached: {{ $('Eligibility Filter').item.json.prior_stage_reached }}\\nPrior rejection reason: {{ $('Eligibility Filter').item.json.prior_rejection_reason }}\\n\\nPrior scorecards (JSON):\\n{{ JSON.stringify($json).slice(0, 12000) }}\"\n    }\n  ]\n}",
        "options": {
          "response": {
            "response": { "responseFormat": "json", "neverError": false }
          },
          "timeout": 60000
        }
      },
      "id": "3b3b3b3b-0001-0000-0000-000000000008",
      "name": "Claude Re-Match",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [1800, 240],
      "credentials": {
        "anthropicApi": {
          "id": "PLACEHOLDER_ANTHROPIC_CRED_ID",
          "name": "Anthropic API key"
        }
      },
      "notesInFlow": true,
      "notes": "Scores the past candidate against the NEW req's rubric, grounded in the prior scorecards. System prompt explicitly tells the model NOT to inherit the old reject decision and NOT to score on protected-class proxies. Evidence-required: no citation -> fit 1."
    },
    {
      "parameters": {
        "jsCode": "// Parse Claude's JSON, enforce the evidence rule, decide keep vs drop.\nconst input = $json;\nconst ctx = $('Eligibility Filter').item.json;\n\nlet parsed;\ntry {\n  const text = (input.content && input.content[0] && input.content[0].text) || '';\n  const m = text.match(/\\{[\\s\\S]*\\}/);\n  if (!m) throw new Error('no JSON object in response');\n  parsed = JSON.parse(m[0]);\n} catch (e) {\n  return [{ json: { status: 'scored', keep: false, error: 'unparseable_score', req_id: ctx.req_id, candidate_id: ctx.candidate_id } }];\n}\n\nconst rawScore = Number(parsed.fit && parsed.fit.score) || 1;\nconst evidence = (parsed.fit && parsed.fit.evidence) || '';\nconst fit = (rawScore > 1 && evidence.trim().length > 0) ? Math.min(5, Math.max(1, rawScore)) : 1;\nconst keep = fit >= ctx.fit_threshold;\n\nreturn [{\n  json: {\n    status: 'scored',\n    keep,\n    req_id: ctx.req_id,\n    req_title: ctx.req_title,\n    req_url: ctx.req_url,\n    job_family: ctx.job_family,\n    config_sha: ctx.config_sha,\n    top_n: ctx.top_n,\n    candidate_id: ctx.candidate_id,\n    application_id: ctx.application_id,\n    prior_stage_reached: ctx.prior_stage_reached,\n    prior_rejection_reason: ctx.prior_rejection_reason,\n    fit,\n    evidence: evidence.slice(0, 240),\n    reason_to_resurface: (parsed.reason_to_resurface || '').slice(0, 240),\n    verify_before_outreach: (parsed.verify_before_outreach || '').slice(0, 240),\n    scored_at: new Date().toISOString(),\n  }\n}];"
      },
      "id": "3b3b3b3b-0001-0000-0000-000000000009",
      "name": "Parse + Keep",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [2020, 240],
      "notesInFlow": true,
      "notes": "Parses the model output, enforces the evidence-required guarantee (empty evidence -> fit 1), and flags keep when fit >= the config threshold. Unparseable responses are kept in the stream as keep:false so they show up in the audit log rather than vanishing."
    },
    {
      "parameters": {
        "jsCode": "// Append one audit line per scored candidate. Pseudonymous: candidate_id +\n// the Greenhouse link only, no name / no scorecard text. This is the record\n// that a past candidate was machine-scored for re-contact consideration.\nconst fs = require('fs');\nconst path = require('path');\n\nconst AUDIT_DIR = $env.AUDIT_DIR || '/data/audit';\nfs.mkdirSync(AUDIT_DIR, { recursive: true });\n\nconst input = $json;\nconst yyyymm = new Date().toISOString().slice(0, 7);\nconst auditPath = path.join(AUDIT_DIR, `rediscovery-${yyyymm}.jsonl`);\n\nconst entry = {\n  ts: new Date().toISOString(),\n  req_id: input.req_id,\n  job_family: input.job_family,\n  config_sha: input.config_sha,\n  candidate_id: input.candidate_id,\n  application_id: input.application_id,\n  prior_stage_reached: input.prior_stage_reached,\n  prior_rejection_reason: input.prior_rejection_reason,\n  fit: input.fit,\n  kept: !!input.keep,\n  model: 'claude-sonnet-4-6',\n};\n\nfs.appendFileSync(auditPath, JSON.stringify(entry) + '\\n', 'utf8');\nreturn [{ json: input }];"
      },
      "id": "3b3b3b3b-0001-0000-0000-00000000000a",
      "name": "Audit Append",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [2240, 240],
      "notesInFlow": true,
      "notes": "One JSONL line per scored candidate (kept or not). No PII beyond the candidate_id reference. This log is what makes a GDPR / EEOC inquiry about automated re-contact decisions answerable. Retention should match the firm's hiring-records policy."
    },
    {
      "parameters": {
        "mode": "runOnceForAllItems",
        "jsCode": "// Aggregate all scored candidates, group by req, dedupe by candidate (keep\n// the highest fit), keep the top_n above threshold, and build one Slack\n// digest payload per req. Emits one item per req for the Slack node.\nconst all = $input.all().map((i) => i.json).filter((j) => j && j.keep);\n\nconst byReq = {};\nfor (const r of all) {\n  byReq[r.req_id] = byReq[r.req_id] || { req: r, candidates: {} };\n  const existing = byReq[r.req_id].candidates[r.candidate_id];\n  if (!existing || r.fit > existing.fit) {\n    byReq[r.req_id].candidates[r.candidate_id] = r;\n  }\n}\n\nconst out = [];\nfor (const reqId of Object.keys(byReq)) {\n  const group = byReq[reqId];\n  const ranked = Object.values(group.candidates).sort((a, b) => b.fit - a.fit).slice(0, group.req.top_n);\n  if (!ranked.length) continue;\n\n  const lines = ranked.map((c, idx) =>\n    `*${idx + 1}. fit ${c.fit}/5* — <https://app.greenhouse.io/people/${c.candidate_id}|candidate ${c.candidate_id}>\\n   _Reached:_ ${c.prior_stage_reached} · _Rejected:_ ${c.prior_rejection_reason}\\n   _Why:_ ${c.reason_to_resurface}\\n   _Confirm first:_ ${c.verify_before_outreach}`\n  ).join('\\n\\n');\n\n  out.push({\n    json: {\n      req_id: group.req.req_id,\n      req_title: group.req.req_title,\n      req_url: group.req.req_url,\n      config_sha: group.req.config_sha,\n      shortlist_count: ranked.length,\n      slack_text: `*Silver-medalist shortlist — ${group.req.req_title}*\\n${ranked.length} past candidate(s) re-matched. The recruiter decides outreach — nothing has been contacted or moved.\\n\\n${lines}\\n\\n_Matching rules: config \\`${group.req.config_sha}\\`. <${group.req.req_url}|Open the req>_`,\n    }\n  });\n}\n\nreturn out;"
      },
      "id": "3b3b3b3b-0001-0000-0000-00000000000b",
      "name": "Build Digest",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [2460, 240],
      "notesInFlow": true,
      "notes": "Runs once over every scored candidate. Dedupes a candidate who matched via two feeder reqs (keeps the higher fit), ranks, truncates to top_n, and builds one Slack digest per req. A req with zero kept candidates produces no message rather than a noisy empty digest."
    },
    {
      "parameters": {
        "method": "POST",
        "url": "https://slack.com/api/chat.postMessage",
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "slackApi",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "Content-Type", "value": "application/json; charset=utf-8" }
          ]
        },
        "sendBody": true,
        "specifyBody": "json",
        "jsonBody": "={\n  \"channel\": \"#talent-rediscovery\",\n  \"text\": \"{{ $json.slack_text }}\"\n}",
        "options": {}
      },
      "id": "3b3b3b3b-0001-0000-0000-00000000000c",
      "name": "Slack Digest",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [2680, 240],
      "credentials": {
        "slackApi": {
          "id": "PLACEHOLDER_SLACK_CRED_ID",
          "name": "Slack bot token (chat:write)"
        }
      },
      "notesInFlow": true,
      "notes": "Posts one ranked digest per req to #talent-rediscovery. The message is a decision surface for the recruiter, not an action: no candidate is contacted, moved, or added to a pipeline by the flow."
    }
  ],
  "connections": {
    "Every 6 Hours": {
      "main": [[{ "node": "Fetch New Open Reqs", "type": "main", "index": 0 }]]
    },
    "Fetch New Open Reqs": {
      "main": [[{ "node": "Load Match Config", "type": "main", "index": 0 }]]
    },
    "Load Match Config": {
      "main": [[{ "node": "Config Loaded?", "type": "main", "index": 0 }]]
    },
    "Config Loaded?": {
      "main": [
        [{ "node": "Fetch Rejected Pool", "type": "main", "index": 0 }],
        []
      ]
    },
    "Fetch Rejected Pool": {
      "main": [[{ "node": "Eligibility Filter", "type": "main", "index": 0 }]]
    },
    "Eligibility Filter": {
      "main": [[{ "node": "Fetch Scorecards", "type": "main", "index": 0 }]]
    },
    "Fetch Scorecards": {
      "main": [[{ "node": "Claude Re-Match", "type": "main", "index": 0 }]]
    },
    "Claude Re-Match": {
      "main": [[{ "node": "Parse + Keep", "type": "main", "index": 0 }]]
    },
    "Parse + Keep": {
      "main": [[{ "node": "Audit Append", "type": "main", "index": 0 }]]
    },
    "Audit Append": {
      "main": [[{ "node": "Build Digest", "type": "main", "index": 0 }]]
    },
    "Build Digest": {
      "main": [[{ "node": "Slack Digest", "type": "main", "index": 0 }]]
    }
  },
  "settings": {
    "executionOrder": "v1",
    "timezone": "UTC",
    "saveExecutionProgress": true,
    "saveManualExecutions": true,
    "callerPolicy": "workflowsFromSameOwner"
  },
  "active": false,
  "versionId": "1"
}