n8n-flow

CSMs mit n8n bei Nutzungsrückgängen alarmieren

Difficulty

Fortgeschritten

Setup time

45-90 min

For

cs-ops

Customer Success

Stack

Das zuverlässigste Churn-Signal, das ein CS-Team hat, ist die Produktnutzung, die einbricht, und die häufigste Art, dieses Signal zu verpassen, ist, dass niemand die richtige Woche im Blick hat. Wenn ein QBR den Rückgang ans Licht bringt, ist der Account bereits seit zwei Monaten still. Dieser Workflow schließt diese Lücke mit dem kleinstmöglichen Mechanismus: ein wöchentlicher n8n-Flow, der die Anzahl aktiver Nutzer jedes Accounts aus Amplitude liest, die letzte Woche mit der Vorwoche vergleicht und eine Slack-Direktnachricht an den zuständigen CSM sendet, sobald der Rückgang einen Schwellenwert überschreitet, den der CS-Ops-Lead pro Account steuert. Er tut genau eine Sache —einen Nutzungsrückgang von Woche zu Woche sichtbar machen, solange er noch das Problem dieser Woche ist— und das ohne ein Dashboard, das niemand öffnet.

Das Artifact-Bundle liegt unter apps/web/public/artifacts/usage-drop-alert-n8n/. Der n8n-Export ist usage-drop-alert-n8n.json und der Credential-, Schema- und Verifizierungsleitfaden ist _README.md. Beide sind Pflichtlektüre, bevor der Schedule aktiviert wird, denn das Bundle wird mit Platzhalter-Credentials und zwei Postgres-Tabellen ausgeliefert, die vor dem ersten Lauf existieren müssen.

Wann Sie ihn einsetzen

Setzen Sie ihn ein, wenn Sie ein CS-Ops-Lead sind, der ein Account-Book betreut, das über das Stadium hinausgewachsen ist, ein Nutzungs-Dashboard mit bloßem Auge zu verfolgen —irgendwo jenseits von 50 Accounts pro CSM, wo niemand das gesamte Portfolio im Kopf behalten kann. Sie brauchen Amplitude (oder ein Product-Analytics-Tool, auf das der HTTP-Node umgepointet werden kann), das ein Signal aktiver Nutzer pro Account verfolgt, einen Slack-Workspace und einen Ort, um Schwellenwerte pro Account zu speichern. Der Flow ist die richtige Wahl, wenn die CS-Organisation der Produktnutzung als Frühindikator bereits vertraut, aber keinen Mechanismus hat, der das Signal vor dem nächsten QBR an einen Menschen pusht.

Er passt besonders gut als günstige erste Schicht unter einem schwereren Health-Modell. Wenn Sie bereits den zusammengesetzten Customer Health Score in n8n betreiben, ist dieser Flow die schnell reagierende Ergänzung: der zusammengesetzte Score wird jede Nacht neu berechnet und sagt Ihnen, wo ein Account steht, während dieser Alert wöchentlich auslöst und Ihnen sagt, was sich gerade bewegt hat. Viele Teams stellen zuerst den Alert auf, weil er einen Nachmittag Arbeit kostet und innerhalb von Tagen Vertrauen gewinnt, und steigen dann auf den zusammengesetzten Score um, sobald die CSMs auf die Pings reagieren.

Wann Sie ihn NICHT einsetzen

Lassen Sie es sein, wenn ein CSM das gesamte Portfolio von Hand lesen kann. Unter etwa 30 Accounts pro CSM fängt ein Mensch, der am Montagmorgen das Nutzungs-Dashboard scannt, dieselben Rückgänge mit mehr Kontext ab, und die Falsch-Positiv-Kosten eines automatisierten Schwellenwerts lohnen sich nicht. Der Flow verdient seinen Platz durch Volumen, nicht durch Cleverness.

Lassen Sie es sein, wenn Ihr Account-Level-Tagging in Amplitude unzuverlässig ist. Der gesamte Flow beruht darauf, dass gp:account_id (oder Ihre äquivalente Nutzereigenschaft) auf jedem Event konsistent gesetzt ist. Wenn Accounts inkonsistent getaggt sind —manche Events tragen die Eigenschaft, manche nicht— ist die wöchentliche Aktiv-Zahl bedeutungslos und der Alert löst auf einem Tagging-Artefakt aus, nicht auf einer Verhaltensänderung. Bringen Sie zuerst die Taxonomie in Ordnung; ein Alert auf schmutzigen Daten ist schlimmer als kein Alert, weil er die Autorität einer Zahl trägt.

Lassen Sie es sein, wenn der Rückgang, der Sie interessiert, auf Seat- oder Feature-Ebene statt auf Account-Ebene liegt. Dieser Flow überwacht ein Signal —wöchentliche eindeutige Aktive pro Account— und ein Rückgang von 40 % bei den Gesamt-Aktiven kann einen gesunden Account verbergen, der lediglich einen Power-User während einer Urlaubswoche verloren hat. Wenn Ihr Churn-Risiko in der Aufgabe eines bestimmten Features oder im Verstummen eines einzelnen namentlichen Champions liegt, brauchen Sie eine Feature- oder Nutzer-Kohorte, was ein anderer (schwererer) Flow ist. Und lassen Sie es sein, wenn das Team kein Playbook dafür hat, was zu tun ist, wenn ein Rückgangs-Alert auslöst; eine Benachrichtigung ohne definierte nächste Aktion trainiert Menschen darauf, sie zu verwerfen.

Setup

Das Setup ist Ende zu Ende in apps/web/public/artifacts/usage-drop-alert-n8n/_README.md dokumentiert. Die Kurzfassung: Importieren Sie das JSON in n8n unter Settings → Import From File, legen Sie die drei Platzhalter-Credentials an (Postgres, Amplitude Basic auth, Slack Bot-Token), erstellen Sie die zwei Postgres-Tabellen aus dem DDL im README (accounts_in_scope und usage_alert_history), seeden Sie einen Canary-Account und führen Sie die achtstufige Verifizierungssequenz aus, bevor Sie den Schedule aktivieren. Von einer sauberen n8n-Installation aus planen Sie 45 bis 90 Minuten ein —der Großteil davon entfällt darauf, zu bestätigen, dass die Amplitude-Segmentierungsabfrage zu Ihrer Event-Taxonomie passt und dass die Slack-App Nutzern in Ihrem Workspace DMs senden kann.

Die Tabelle accounts_in_scope ist der Ort, an dem die Policy pro Account lebt, und sie richtig zu konfigurieren, ist der Unterschied zwischen einem nützlichen Alert und einem stummgeschalteten Bot. Jede Zeile trägt drop_threshold_pct (den Prozentsatz des Rückgangs, der einen Alert auslöst) und min_baseline_events (die Untergrenze aktiver Nutzer, unter der der Account zu klein ist, um beurteilt zu werden). Enterprise-Accounts laufen oft mit einem strengeren Schwellenwert —ein Rückgang von 25 % bei einem Account mit 200 Seats ist einen Blick wert— während Self-Serve-Accounts mehr Rauschen tolerieren und mit 50 % laufen. Diese als Tabellenspalten statt als hartkodierte Konstanten zu halten bedeutet, dass das Nachjustieren ein einziges UPDATE ist, kein Redeploy.

Was der Flow tatsächlich tut

Der Cron löst montags um 09:00 in America/New_York aus (der Ausdruck ist 0 13 * * 1 —13:00 UTC— bestätigen Sie also, dass der Timezone des Workflows gesetzt ist). Der Montagmorgen ist Absicht: die Vorwoche ist vollständig abgeschlossen, es gibt also keinen Teilwochenvergleich, der jeden Montag als Rückgang lesen würde. Pull Accounts In Scope liest bis zu 500 aktive Accounts, bei denen eine CSM-Slack-id gesetzt ist; Accounts ohne Owner werden im SQL herausgefiltert, weil es niemanden zu benachrichtigen gibt. Batch Accounts (20/group) teilt sie in Gruppen, damit die parallelen Amplitude-Aufrufe unter dem Concurrency-Limit der Dashboard REST API bleiben, mit einer Wartezeit von einer Sekunde zwischen den Batches.

Amplitude — Weekly Actives (14d) ruft den Endpunkt /api/2/events/segmentation mit i=7 (wöchentliche Buckets) über ein Fenster von 14 Tagen auf, segmentiert nach der gp:account_id-Eigenschaft des Accounts. Das gibt zwei wöchentliche Punkte zurück: die letzte Woche und die Vorwoche. Compute WoW Drop ist die einzige echte Logik im Flow und trifft zwei Entscheidungen. Erstens der Rausch-Guard: liegt die Aktiv-Zahl der Vorwoche unter min_baseline_events, wird der Account als skipped_low_baseline markiert und alarmiert nie —ein Wechsel von vier Aktiven auf zwei ist ein Rückgang von 50 % und reines Rauschen. Zweitens der Schwellenwert: er berechnet (week_before - last_week) / week_before als Prozentsatz und markiert die Zeile nur dann als alert, wenn das den drop_threshold_pct des Accounts erreicht oder überschreitet, mit einem lesbaren Grund wie „wöchentliche Aktive fielen um 47 % (von 120 auf 64) gegenüber der Vorwoche”.

Crosses Threshold? routet Alert-Zeilen weiter; alles andere geht direkt zum Throttle. Lookup Recent Alert prüft dann usage_alert_history auf jeglichen Alert für diesen Account in den letzten 14 Tagen, und Outside Cooldown? unterdrückt die Wiederholung, falls einer existiert. Dies ist der zweite Guard gegen Ermüdung: ein anhaltender Rückgang würde den CSM sonst jeden Montag pingen, bis sich die Nutzung erholt, was ihn darauf trainiert, den Bot zu ignorieren. Mit dem Cooldown pingt ein echter Rückgang einmal, und der CSM übernimmt von da an das Follow-up.

Überlebende Zeilen treffen auf Slack — DM Owning CSM, das eine Block-Kit-Nachricht direkt an die Slack-user-id des CSM postet, mit Account-Name, Segment, Aktiv-Zahlen vorher/nachher, dem Prozentsatz des Rückgangs und dem Schwellenwert, der ausgelöst hat. Persist Alert (idempotent per week) schreibt den Alert in usage_alert_history mit einer ON CONFLICT-Klausel mit Schlüssel (account_id, date_trunc('week', alerted_at)), sodass ein wiederholter Lauf die bestehende Zeile aktualisiert, statt dem CSM zweimal eine DM zu senden, und stempelt last_alerted_at auf den Account für den Cooldown-Lesezugriff auf dem schnellen Pfad.

Kostenrealität

Dieser Flow ist im Betrieb nahezu kostenlos. Es gibt keinen LLM-Aufruf —der Vergleich ist Arithmetik in einem Code-Node, sodass die einzigen Kosten API-Kontingent und n8n-Ausführungszeit sind. Pro Account und Woche macht der Flow einen Amplitude-Segmentierungslesezugriff, höchstens einen Slack-Schreibvorgang und zwei oder drei Postgres-Abfragen. Amplitudes Dashboard REST API rechnet auf bezahlten Plänen nicht pro Aufruf ab; die Einschränkung ist ihr niedriges Concurrency-Limit, was genau der Grund ist, warum die Batch-Size 20 mit einem Throttle von einer Sekunde beträgt. Für 500 Accounts ist der gesamte Lauf in etwa drei bis sechs Minuten auf dem kleinen Executor von n8n Cloud abgeschlossen, dominiert von den serialisierten Amplitude-Lesezugriffen. Slacks chat.postMessage ist auf etwa eine Nachricht pro Sekunde pro Kanalkontext rate-limited, komfortabel unter dem, was ein wöchentliches Alert-Volumen benötigt.

Die echten Kosten sind menschlich, und es sind die Kosten, die Sie zu reduzieren versuchen: ein CS-Ops-Lead verbringt vielleicht eine Stunde pro Quartal damit, Schwellenwerte nachzujustieren, während sich die Segmente verschieben, gegenüber der Alternative, dass CSMs jeweils 20 bis 30 Minuten pro Woche mit dem Auge auf Dashboards starren (oder, häufiger, es nicht tun und es im QBR erfahren). In einem Team aus 10 CSMs sind das etwa 40 bis 50 Stunden manuelles Scannen pro Quartal, ersetzt durch eine Stunde Schwellenwert-Pflege —und das Scannen fing die Rückgänge ohnehin einen Monat zu spät ab.

Wie Erfolg aussieht

Beobachten Sie im ersten Quartal drei Zahlen. Erstens, die Aktionsrate auf Alerts —der Anteil der DMs, die innerhalb von fünf Werktagen zu einem protokollierten CSM-Touch führen (eine E-Mail, ein gebuchter Call, eine Notiz). Befragen oder instrumentieren Sie dies; zielen Sie bis zum Ende des ersten Monats auf über 60 %. Eine Aktionsrate unter 40 % bedeutet, dass der Schwellenwert zu locker ist und der Bot Wolf schreit —heben Sie drop_threshold_pct für die rauschenden Segmente an. Zweitens, die Lead Time bis zur Intervention —messen Sie für Accounts, die später gechurnt sind oder geschrumpft sind, um wie viele Tage der Nutzungsrückgangs-Alert dem ersten CSM-Outreach vorausging, verglichen mit der historischen Baseline „im QBR erfahren”. Der ganze Sinn ist, diese Zahl von etwa 60 Tagen auf unter 14 zu bewegen. Drittens, die Unterdrückungsrate —der Anteil der Accounts, die den Schwellenwert überschritten, aber durch den Cooldown zurückgehalten wurden. Eine gesunde Zahl ist niedrig und stabil; eine steigende Unterdrückungsrate bedeutet, dass eine Kohorte in anhaltendem Niedergang ist und der wöchentliche Alert nicht mehr das richtige Werkzeug ist —diese Accounts brauchen den zusammengesetzten Customer Health Score und ein Save-Play, nicht noch einen Ping.

Versus die Alternativen

Die Standard-Alternative ist Amplitudes eigenes Alerting —seine Anomaly- und Threshold-Monitore können einen Chart überwachen und an Slack oder E-Mail auslösen. Wenn Sie genau einen globalen Alert brauchen („gesamte wöchentliche Aktive sind gefallen”), nutzen Sie Amplitudes nativen Monitor; das ist weniger Arbeit, als n8n aufzusetzen. Der Grund, warum dieser Flow existiert, ist das Routing pro Account: Amplitudes Monitore alarmieren auf einem Chart, nicht auf einem Account-zu-CSM-Mapping, sodass ein Monitor auf Portfolio-Ebene dem zuständigen CSM nicht sagen kann, dass sein Account gefallen ist. Um aus Amplitude allein ein Routing pro-Account, pro-Owner herauszuholen, bauen Sie am Ende einen Monitor pro Account, was über eine Handvoll hinaus nicht skaliert. Dieser Flow hält die Account-zu-CSM-Karte und die Schwellenwerte pro Account in einer Tabelle, die Sie besitzen, und routet entsprechend.

Eine zweite Alternative sind die integrierten Nutzungs-Alerts Ihres CSP —Gainsight, Catalyst, ChurnZero, Vitally, Planhat und Totango liefern alle irgendeine Form von Nutzungsrückgangs-Trigger. Wenn Sie bereits einen CSP betreiben und die Produktnutzung dort hineinleiten, nutzen Sie den nativen Trigger —die Daten sind bereits dort und das Routing zum CSM ist bereits verdrahtet. Dieser Flow ist für das Team, das sein Product-Analytics in Amplitude hat, aber die Nutzung noch nicht in einem CSP zentralisiert hat, oder dessen CSP bei den Nutzungsdaten einen Sync-Zyklus hinter Amplitude liegt. Er ist die Brücke, die den Frühindikator liefert, bevor die Rollup des CSP aufholt.

Eine dritte Alternative ist ein DIY-Skript auf einem Cron —ein Python-Job, der die Amplitude-API und die Slack-API anspricht. Die erste Version ist schneller geschrieben, als den n8n-Flow zu verdrahten, aber er trägt die Last der Credential-Rotation im Code, hat keine Retry-Semantik out of the box und ist für den CS-Ops-Lead, der kein Ingenieur ist, unsichtbar. Die n8n-Version tauscht rohe Flexibilität gegen eine Credential-UI, integrierte Retries und einen visuellen Flow, den ein Nicht-Ingenieur lesen und nachjustieren kann. Wählen Sie DIY, wenn CS Ops einen festen Ingenieur hat; wählen Sie den n8n-Flow, wenn die Person, die die Schwellenwerte justiert, dieselbe ist, die die Alerts liest.

Worauf zu achten ist

Ein Tagging-Artefakt liest sich wie ein Nutzungs-Abgrund. Wenn sich die Produktinstrumentierung ändert —ein Event wird umbenannt, die account_id-Eigenschaft wird auf einer Oberfläche nicht mehr gesetzt— zeigt jeder Account auf dieser Oberfläche einen Abfall auf null, und der Bot sendet allen CSMs auf einmal eine DM. Guard: Fragen Sie vor der Aktivierung die Distinct-Anzahl der Accounts mit nicht-null gp:account_id der letzten zwei Wochen ab und bestätigen Sie, dass sie stabil ist; und behandeln Sie einen Anstieg des Alert-Volumens in derselben Woche über viele Accounts hinweg als Instrumentierungsvorfall, nicht als Churn-Welle —die Tabelle usage_alert_history macht diesen Anstieg auf einen Blick sichtbar.
Kleine Accounts erzeugen Phantom-Rückgänge. Ein Account mit vier wöchentlichen Aktiven, der auf zwei fällt, ist ein Rückgang von 50 % und bedeutet nichts. Guard: die min_baseline_events-Untergrenze in accounts_in_scope markiert jeden Account unter dem Vorwochen-Schwellenwert als skipped_low_baseline und alarmiert nie darauf. Setzen Sie die Untergrenze pro Segment —Self-Serve kann mit einer Untergrenze von 5 laufen, Enterprise braucht selten eine.
Anhaltende Rückgänge spammen den CSM zu. Ohne Unterdrückung würde ein Account, der fällt und unten bleibt, jeden Montag auslösen, bis er sich erholt. Guard: Lookup Recent Alert plus der 14-Tage-Cooldown in Outside Cooldown? stellt einen Alert pro Rückgangs-Ereignis sicher; der CSM übernimmt das Follow-up nach dem ersten Ping, und ein noch fallender Account taucht im zusammengesetzten Customer Health Score auf, nicht in einem wiederholten Alert.
Retries senden doppelte DMs. Ein Node-Ausfall mitten im Batch, der einen n8n-Retry auslöst, könnte die Slack-DM zweimal senden. Guard: usage_alert_history hat einen Unique-Index auf (account_id, date_trunc('week', alerted_at)) und Persist Alert nutzt ON CONFLICT ... DO UPDATE, sodass der zweite Versuch die bestehende Wochenzeile aktualisiert, statt eine neue einzufügen —und weil der Slack-Versand dem Persist vorausgeht, fängt der Cooldown-Lesezugriff beim Retry ihn ab.
Die DM kommt an und nichts passiert. Ein Alert ohne definierten nächsten Schritt ist Rauschen mit einem Zeitstempel. Guard: dies ist ein Prozess-Guard, kein Code-Guard —koppeln Sie das Rollout mit einem einzeiligen Playbook („Nutzungsrückgangs-DM → Account in Ihrem CSP prüfen → einen Touch innerhalb von fünf Werktagen protokollieren”) und verfolgen Sie die obige Aktionsrate. Ist die Aktionsrate niedrig, ist die Korrektur das Playbook oder der Schwellenwert, nicht mehr Alerts.

Stack

n8n —Orchestrierung, der wöchentliche Schedule, Retries, Credential-Verwaltung und ein visueller Flow, den ein CS-Ops-Lead ohne Ingenieur nachjustieren kann
Amplitude —die Produktnutzungs-Quelle; wöchentliche eindeutige Aktive pro Account über den Dashboard-REST-Endpunkt events/segmentation
Slack —der Zustellkanal; eine Block-Kit-DM an die user-id des zuständigen CSM (auf einen geteilten Kanal umpointbar)
Postgres —accounts_in_scope für Schwellenwerte pro Account und CSM-Routing, usage_alert_history für den Cooldown und den Idempotenz-Schlüssel

Diese Seite auf GitHub bearbeiten

Files in this artifact

Download all (.zip)

# Usage-drop alert for CSMs — n8n flow

## What this flow does

This flow runs every Monday at 09:00 in `America/New_York` and checks every active account for a week-over-week drop in product usage. For each account it pulls two weekly buckets of unique active users from Amplitude (last week and the week before), computes the percentage drop, and compares it against a per-account threshold stored in Postgres. Accounts whose drop crosses the threshold — and that are not inside a 14-day cooldown from a prior alert — trigger a Slack direct message to the owning CSM naming the account, the before/after active-user counts, and the percentage drop. Every alert is logged to a history table so a sustained dip pings the CSM once, not every week.

The flow is deliberately small: one external read (Amplitude), one decision (threshold), one suppression check (cooldown), one notification (Slack), one write (history). It is the leading-indicator companion to a full composite health score, not a replacement for one.

## Import

In n8n: open **Settings → Import From File → select `usage-drop-alert-n8n.json`**. After import, open the workflow and confirm the timezone in **Workflow Settings** is `America/New_York` (it ships set, but reconfirm — the schedule trigger and the cron's `13:00 UTC` expression both assume it). Activate the workflow only after credentials are wired and the verification run below has passed.

## Credentials

Two placeholder credentials are referenced by name in the export. Create each in n8n under **Credentials → New** and map the matching `PLACEHOLDER_*_CRED_ID` reference on first open. (Postgres is the third — it backs the state tables and is also referenced by name.)

### `PLACEHOLDER_POSTGRES_CRED_ID` — Postgres — usage-alert-state

Used by three nodes: `Pull Accounts In Scope`, `Lookup Recent Alert`, and `Persist Alert (idempotent per week)`. Point this at a Postgres database you control. Required tables:

```sql
CREATE TABLE accounts_in_scope (
account_id text PRIMARY KEY,
account_name text NOT NULL,
amplitude_project_id text,
segment text,
active boolean NOT NULL DEFAULT true,
drop_threshold_pct int NOT NULL DEFAULT 40, -- per-account % drop that triggers an alert
min_baseline_events int NOT NULL DEFAULT 10, -- floor below which the account is too small to judge
csm_slack_user_id text, -- Slack user id of the owning CSM (e.g. U0123ABCD)
last_alerted_at timestamptz
);

CREATE TABLE usage_alert_history (
account_id text NOT NULL,
alerted_at timestamptz NOT NULL DEFAULT now(),
week_before int,
last_week int,
drop_pct int,
threshold int,
reason text
);
-- Idempotence key: one row per account per week, so retries do not double-log or double-DM.
CREATE UNIQUE INDEX usage_alert_history_week_uniq
ON usage_alert_history (account_id, date_trunc('week', alerted_at));
```

### `PLACEHOLDER_AMPLITUDE_CRED_ID` — Amplitude — API key:secret (Basic)

Amplitude's Dashboard REST API uses HTTP Basic auth where the username is the project **API Key** and the password is the project **Secret Key**. Find both in Amplitude under **Settings → Projects → [your project] → General**. In n8n create a **Basic Auth** credential: username = API Key, password = Secret Key. The flow calls the `/api/2/events/segmentation` endpoint, which needs no extra scope beyond a valid key pair. Note the endpoint returns event-segmentation series; the node's query segments on a `gp:account_id` user property — rename that to whatever account identifier your Amplitude taxonomy uses, and replace the `_active` event with your own activity event if you do not track a synthetic `_active` event.

### `PLACEHOLDER_SLACK_CRED_ID` — Slack — bot token

In your Slack workspace under **api.slack.com/apps**, create an app with a bot user and the scopes `chat:write` and `im:write` (the latter is required to open a DM channel with a user). Install the app to the workspace and copy the bot token (`xoxb-...`). Store it as a header credential with header name `Authorization` and prefix value `Bearer `. Because the flow DMs the CSM by Slack user id, each CSM must have **"Allow users in your workspace to send you direct messages"** enabled and the app must not be blocked. If your org restricts app DMs, point the `channel` field at a shared channel such as `#cs-usage-alerts` and tag the CSM in the message text instead.

## First-run verification

Run the flow manually before activating the schedule. This sequence proves each branch without spamming CSMs.

1. **Seed one canary account.** Insert a single row into `accounts_in_scope` with a real `account_id` that exists in Amplitude, your own Slack user id in `csm_slack_user_id`, `drop_threshold_pct = 1` (so any drop fires), and `min_baseline_events = 1`.
2. **Run `Pull Accounts In Scope` in isolation.** Confirm the canary row comes back. If empty, check `active = true` and that `csm_slack_user_id` is non-null (the `WHERE` clause filters out null Slack ids).
3. **Run `Amplitude — Weekly Actives (14d)`.** Confirm a non-empty `data.series` array with at least two weekly values. A 400 usually means the `gp:account_id` property name or the event name does not match your taxonomy; a 401 means the Basic auth key/secret pair is wrong.
4. **Run `Compute WoW Drop`.** Confirm `week_before`, `last_week`, `drop_pct`, and `status` are populated. Temporarily hand-edit the canary's Amplitude data (or pin a fixture) so `last_week` is well below `week_before` and confirm `status` becomes `alert`. Then set `min_baseline_events` above `week_before` and confirm `status` becomes `skipped_low_baseline` — that proves the noise guard works.
5. **Check the cooldown path.** With `usage_alert_history` empty, `Outside Cooldown?` should route to the Slack node. Manually insert a row into `usage_alert_history` for the canary dated yesterday, re-run, and confirm `Outside Cooldown?` now routes to the throttle (suppressed). Delete the test row afterward.
6. **Fire one real DM.** With the cooldown clear, let the flow run end-to-end on the canary. Confirm you receive the Slack DM with the account name, the before/after counts, and the drop percentage, and that one row landed in `usage_alert_history`.
7. **Re-run the same day.** Confirm no second DM arrives and `usage_alert_history` still has exactly one row for the week (the `ON CONFLICT` clause is doing its job).
8. **Restore real thresholds.** Set `drop_threshold_pct` and `min_baseline_events` back to production values (40 and 10 are sensible defaults) before activating the schedule.

If any step fails, fix it before activating. A weekly cron that DMs CSMs about phantom drops will train them to mute the bot inside a month — the noise guard and the cooldown exist specifically to keep that from happening.

{
  "name": "Usage-drop alert for CSMs",
  "nodes": [
    {
      "parameters": {
        "rule": {
          "interval": [
            {
              "field": "cronExpression",
              "expression": "0 13 * * 1"
            }
          ]
        }
      },
      "id": "3e3e3e3e-0001-0000-0000-000000000001",
      "name": "Weekly Cron — Mon 9am ET",
      "type": "n8n-nodes-base.scheduleTrigger",
      "typeVersion": 1,
      "position": [240, 400],
      "notesInFlow": true,
      "notes": "Cron is 13:00 UTC = 09:00 America/New_York. Confirm the workflow timezone in Settings is America/New_York. Monday morning chosen so the prior week is fully closed before comparison."
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "SELECT\n  account_id,\n  account_name,\n  amplitude_project_id,\n  segment,\n  drop_threshold_pct,\n  min_baseline_events,\n  csm_slack_user_id,\n  last_alerted_at\nFROM accounts_in_scope\nWHERE active = true\n  AND csm_slack_user_id IS NOT NULL\nORDER BY account_id\nLIMIT 500;",
        "options": {}
      },
      "id": "3e3e3e3e-0001-0000-0000-000000000002",
      "name": "Pull Accounts In Scope",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [460, 400],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — usage-alert-state"
        }
      },
      "notesInFlow": true,
      "notes": "accounts_in_scope holds per-account threshold, min baseline floor, and the owning CSM's Slack user id. Per-account threshold lets enterprise run tighter than self-serve."
    },
    {
      "parameters": {
        "batchSize": 20,
        "options": {}
      },
      "id": "3e3e3e3e-0001-0000-0000-000000000003",
      "name": "Batch Accounts (20/group)",
      "type": "n8n-nodes-base.splitInBatches",
      "typeVersion": 3,
      "position": [680, 400],
      "notesInFlow": true,
      "notes": "Batches keep parallel Amplitude calls under the rate cap and bound retry blast radius. Amplitude's Dashboard REST API caps concurrency low — 20/group with the downstream Wait keeps us safe."
    },
    {
      "parameters": {
        "method": "GET",
        "url": "=https://amplitude.com/api/2/events/segmentation?e={\"event_type\":\"_active\"}&start={{ $now.minus({days: 14}).toFormat('yyyyMMdd') }}&end={{ $now.minus({days: 1}).toFormat('yyyyMMdd') }}&i=7&m=uniques&s=[{\"prop\":\"gp:account_id\",\"op\":\"is\",\"values\":[\"{{ $json.account_id }}\"]}]",
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "httpBasicAuth",
        "sendQuery": false,
        "options": {
          "response": {
            "response": {
              "fullResponse": false
            }
          },
          "timeout": 20000,
          "retry": {
            "maxTries": 3,
            "waitBetweenTries": 3000
          }
        }
      },
      "id": "3e3e3e3e-0001-0000-0000-000000000004",
      "name": "Amplitude — Weekly Actives (14d)",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [900, 400],
      "credentials": {
        "httpBasicAuth": {
          "id": "PLACEHOLDER_AMPLITUDE_CRED_ID",
          "name": "Amplitude — API key:secret (Basic)"
        }
      },
      "notesInFlow": true,
      "notes": "Pulls 14 days of weekly-unique actives segmented by account_id user-property. i=7 buckets by week so the response carries two weekly points: last week and the week before. Adjust the event_type and the gp:account_id property name to match your Amplitude taxonomy."
    },
    {
      "parameters": {
        "jsCode": "// Compute week-over-week drop from Amplitude's two weekly buckets and decide if it crosses the per-account threshold.\n// Amplitude segmentation with i=7 returns series values: [week_before, last_week] (oldest first).\nconst account = $('Batch Accounts (20/group)').item.json;\nconst payload = $json;\n\n// Defensive extraction — Amplitude nests the series under data.series[0].\nconst series = payload?.data?.series?.[0] || [];\nconst weekBefore = Number(series[series.length - 2] ?? 0);\nconst lastWeek = Number(series[series.length - 1] ?? 0);\n\nconst threshold = Number(account.drop_threshold_pct ?? 40); // percent drop that triggers an alert\nconst minBaseline = Number(account.min_baseline_events ?? 10); // floor below which the account is too small to judge\n\nlet dropPct = 0;\nlet status = 'ok';\nlet reason = '';\n\nif (weekBefore < minBaseline) {\n  // Baseline too small — a swing from 2 to 1 active user is not a signal, it is noise.\n  status = 'skipped_low_baseline';\n  reason = `baseline ${weekBefore} actives below floor ${minBaseline}`;\n} else {\n  dropPct = Math.round(((weekBefore - lastWeek) / weekBefore) * 100);\n  if (dropPct >= threshold) {\n    status = 'alert';\n    reason = `weekly actives fell ${dropPct}% (from ${weekBefore} to ${lastWeek}) vs the prior week`;\n  } else if (dropPct > 0) {\n    status = 'ok';\n    reason = `down ${dropPct}% — under the ${threshold}% threshold`;\n  } else {\n    status = 'ok';\n    reason = `flat or up (${weekBefore} -> ${lastWeek})`;\n  }\n}\n\nreturn [{\n  json: {\n    account_id: account.account_id,\n    account_name: account.account_name,\n    csm_slack_user_id: account.csm_slack_user_id,\n    segment: account.segment,\n    last_alerted_at: account.last_alerted_at,\n    week_before: weekBefore,\n    last_week: lastWeek,\n    drop_pct: dropPct,\n    threshold,\n    status,\n    reason,\n    checked_at: new Date().toISOString(),\n  }\n}];"
      },
      "id": "3e3e3e3e-0001-0000-0000-000000000005",
      "name": "Compute WoW Drop",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [1120, 400],
      "notesInFlow": true,
      "notes": "min_baseline_events is the noise guard: an account with 4 actives last week dropping to 2 is a 50% drop but not a signal. Below the floor we mark skipped_low_baseline and never alert."
    },
    {
      "parameters": {
        "conditions": {
          "options": {
            "caseSensitive": true,
            "typeValidation": "strict"
          },
          "conditions": [
            {
              "id": "is-alert",
              "leftValue": "={{ $json.status }}",
              "rightValue": "alert",
              "operator": {
                "type": "string",
                "operation": "equals"
              }
            }
          ],
          "combinator": "and"
        },
        "options": {}
      },
      "id": "3e3e3e3e-0001-0000-0000-000000000006",
      "name": "Crosses Threshold?",
      "type": "n8n-nodes-base.if",
      "typeVersion": 2.2,
      "position": [1340, 400]
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "SELECT alerted_at\nFROM usage_alert_history\nWHERE account_id = $1\n  AND alerted_at > now() - interval '14 days'\nORDER BY alerted_at DESC\nLIMIT 1;",
        "options": {
          "queryReplacement": "={{ $json.account_id }}"
        }
      },
      "id": "3e3e3e3e-0001-0000-0000-000000000007",
      "name": "Lookup Recent Alert",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [1560, 300],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — usage-alert-state"
        }
      },
      "notesInFlow": true,
      "notes": "Cooldown lookup: if this account was already alerted in the last 14 days, suppress the repeat so a sustained dip does not ping the CSM every Monday."
    },
    {
      "parameters": {
        "conditions": {
          "options": {
            "caseSensitive": true,
            "typeValidation": "loose"
          },
          "conditions": [
            {
              "id": "no-recent-alert",
              "leftValue": "={{ $json.alerted_at }}",
              "rightValue": "",
              "operator": {
                "type": "string",
                "operation": "empty"
              }
            }
          ],
          "combinator": "and"
        },
        "options": {}
      },
      "id": "3e3e3e3e-0001-0000-0000-000000000008",
      "name": "Outside Cooldown?",
      "type": "n8n-nodes-base.if",
      "typeVersion": 2.2,
      "position": [1780, 300]
    },
    {
      "parameters": {
        "method": "POST",
        "url": "https://slack.com/api/chat.postMessage",
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "httpHeaderAuth",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "content-type", "value": "application/json" }
          ]
        },
        "sendBody": true,
        "specifyBody": "json",
        "jsonBody": "={\n  \"channel\": \"{{ $('Compute WoW Drop').item.json.csm_slack_user_id }}\",\n  \"text\": \"Usage drop on {{ $('Compute WoW Drop').item.json.account_name }}\",\n  \"blocks\": [\n    {\n      \"type\": \"section\",\n      \"text\": {\n        \"type\": \"mrkdwn\",\n        \"text\": \":chart_with_downwards_trend: *Usage drop — {{ $('Compute WoW Drop').item.json.account_name }}* ({{ $('Compute WoW Drop').item.json.segment }})\\n{{ $('Compute WoW Drop').item.json.reason }}.\"\n      }\n    },\n    {\n      \"type\": \"context\",\n      \"elements\": [\n        { \"type\": \"mrkdwn\", \"text\": \"Weekly actives: {{ $('Compute WoW Drop').item.json.week_before }} -> {{ $('Compute WoW Drop').item.json.last_week }} | threshold {{ $('Compute WoW Drop').item.json.threshold }}% | account {{ $('Compute WoW Drop').item.json.account_id }}\" }\n      ]\n    }\n  ]\n}",
        "options": {
          "timeout": 15000,
          "retry": {
            "maxTries": 3,
            "waitBetweenTries": 2000
          }
        }
      },
      "id": "3e3e3e3e-0001-0000-0000-000000000009",
      "name": "Slack — DM Owning CSM",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [2000, 300],
      "credentials": {
        "httpHeaderAuth": {
          "id": "PLACEHOLDER_SLACK_CRED_ID",
          "name": "Slack — bot token"
        }
      },
      "notesInFlow": true,
      "notes": "channel set to the CSM's Slack user id sends a DM. The bot must have im:write and chat:write scopes and the user must allow DMs from apps. Swap the channel for a shared #cs-usage-alerts channel if you prefer a team feed."
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "INSERT INTO usage_alert_history (\n  account_id, alerted_at, week_before, last_week, drop_pct, threshold, reason\n) VALUES ($1, now(), $2, $3, $4, $5, $6)\nON CONFLICT (account_id, date_trunc('week', alerted_at)) DO UPDATE SET\n  week_before = EXCLUDED.week_before,\n  last_week = EXCLUDED.last_week,\n  drop_pct = EXCLUDED.drop_pct,\n  threshold = EXCLUDED.threshold,\n  reason = EXCLUDED.reason;\n\nUPDATE accounts_in_scope SET last_alerted_at = now() WHERE account_id = $1;",
        "options": {
          "queryReplacement": "={{ $('Compute WoW Drop').item.json.account_id }},{{ $('Compute WoW Drop').item.json.week_before }},{{ $('Compute WoW Drop').item.json.last_week }},{{ $('Compute WoW Drop').item.json.drop_pct }},{{ $('Compute WoW Drop').item.json.threshold }},{{ JSON.stringify($('Compute WoW Drop').item.json.reason) }}"
        }
      },
      "id": "3e3e3e3e-0001-0000-0000-00000000000a",
      "name": "Persist Alert (idempotent per week)",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [2220, 300],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — usage-alert-state"
        }
      },
      "notesInFlow": true,
      "notes": "ON CONFLICT key (account_id, week) keeps a retried run from double-DMing the CSM and double-logging. last_alerted_at on accounts_in_scope is the fast-path cooldown read."
    },
    {
      "parameters": {
        "amount": 1,
        "unit": "seconds"
      },
      "id": "3e3e3e3e-0001-0000-0000-00000000000b",
      "name": "Throttle Between Batches",
      "type": "n8n-nodes-base.wait",
      "typeVersion": 1.1,
      "position": [1780, 500]
    }
  ],
  "connections": {
    "Weekly Cron — Mon 9am ET": {
      "main": [
        [{ "node": "Pull Accounts In Scope", "type": "main", "index": 0 }]
      ]
    },
    "Pull Accounts In Scope": {
      "main": [
        [{ "node": "Batch Accounts (20/group)", "type": "main", "index": 0 }]
      ]
    },
    "Batch Accounts (20/group)": {
      "main": [
        [{ "node": "Amplitude — Weekly Actives (14d)", "type": "main", "index": 0 }]
      ]
    },
    "Amplitude — Weekly Actives (14d)": {
      "main": [
        [{ "node": "Compute WoW Drop", "type": "main", "index": 0 }]
      ]
    },
    "Compute WoW Drop": {
      "main": [
        [{ "node": "Crosses Threshold?", "type": "main", "index": 0 }]
      ]
    },
    "Crosses Threshold?": {
      "main": [
        [{ "node": "Lookup Recent Alert", "type": "main", "index": 0 }],
        [{ "node": "Throttle Between Batches", "type": "main", "index": 0 }]
      ]
    },
    "Lookup Recent Alert": {
      "main": [
        [{ "node": "Outside Cooldown?", "type": "main", "index": 0 }]
      ]
    },
    "Outside Cooldown?": {
      "main": [
        [{ "node": "Slack — DM Owning CSM", "type": "main", "index": 0 }],
        [{ "node": "Throttle Between Batches", "type": "main", "index": 0 }]
      ]
    },
    "Slack — DM Owning CSM": {
      "main": [
        [{ "node": "Persist Alert (idempotent per week)", "type": "main", "index": 0 }]
      ]
    },
    "Persist Alert (idempotent per week)": {
      "main": [
        [{ "node": "Throttle Between Batches", "type": "main", "index": 0 }]
      ]
    },
    "Throttle Between Batches": {
      "main": [
        [{ "node": "Batch Accounts (20/group)", "type": "main", "index": 0 }]
      ]
    }
  },
  "active": false,
  "settings": {
    "executionOrder": "v1",
    "timezone": "America/New_York",
    "saveExecutionProgress": true,
    "saveManualExecutions": true,
    "callerPolicy": "workflowsFromSameOwner"
  },
  "versionId": "3e3e3e3e-0001-0000-0000-0000000000ff",
  "meta": {
    "templateCreatedBy": "ooligo",
    "instanceId": "ooligo-pilot"
  },
  "id": "usage-drop-alert-n8n",
  "tags": [
    { "name": "customer-success" },
    { "name": "cs-ops" },
    { "name": "alerting" }
  ]
}