Methodological proof of concept. This case is a thought-out architecture for the private-dentistry niche, especially focused on FZ-152 (patient PII in RF DB, GigaChat without identifiers). Haven't done it for external clients in this vertical yet. For the first engagement: −30% off list + 90 days extended warranty + full FZ-152 sign-off.

Ready to deliver · Healthcare · FZ-152

Voice agent for dentistry: pain triage + IDENT booking

A private dental clinic in St. Petersburg. The chief doctor personally responded to WhatsApp evenings and on weekends because 60% of inquiries were "acute pain" and the reception couldn't tell urgent from routine. A voice agent with a priority triage classifier and IDENT booking dropped reaction time from 47 minutes to 4.

Industry

Dentistry (private, NDA)

Stack

Vapi · GigaChat · IDENT · n8n

Timeline

~6 weeks

Outcome

47 min → 4 min reaction

Compliance

FZ-152

01 · Pain Point

Chief doctor responding on WhatsApp at 23:40

Private dentistry with two rooms. 60% of inquiries — "it hurts a lot, need it now". The rest — routine: cleaning, checkup, aesthetics. The cost of a triage error is high in both directions: missing acute pain means a lost patient and a medical issue; treating routine as urgent means a missed slot needed for an acute patient.

Daytime reception and WhatsApp at night and on weekends poorly distinguished one from the other. Scoring on "my tooth hurts" text gives nothing: the same phrase is written by someone who ate cold food yesterday and by someone with fever and a swollen cheek.

In the end, the chief doctor read every message and made the decision. That's 18 hours per week of personal time, half of which is evenings and weekends.

02 · Solution

Priority 1/2/3 triage classifier → direct booking to IDENT

The voice agent collects a structured intake by 8 items: pain character, duration, radiation, temperature, swelling, response to temperature/pressure, previous interventions, medications. The GigaChat classifier assigns priority 1 (immediately), 2 (within 24 hours), or 3 (routine booking). IDENT receives the booking directly via an n8n pipeline. Priority 1 is escalated in parallel to the chief doctor on Telegram.

01

Vapi inbound

Replacement number + WhatsApp bridge, voice or text by choice

02

Intake

GigaChat asks 8 structured questions about pain character

03

Triage P1/P2/P3

Classifier per medical prompt template + red-flag rules

04

n8n → IDENT

Booking into the right time slot by priority; conflict resolution

05

Escalation

P1 → Telegram to chief doctor; P2/P3 → SMS to patient with confirmation

Triage protocol assembled from 8 items

The prompt classifier doesn't analyze a single "it hurts" line — it leads a dialogue following a fixed template: pain character (sharp/dull/throbbing), duration (hours/days), radiation (does it radiate to jaw/temple/ear), body temperature, soft-tissue swelling, reaction to cold/hot/pressure, previous interventions in this area, medications taken. Output — structured JSON that feeds the priority classifier and the IDENT record.

Red flags override the LLM decision

On top of GigaChat run deterministic red-flag rules: temperature > 38°C + swelling = automatic priority 1, bypassing LLM; inability to open the mouth = priority 1; tooth trauma in the last 6 hours = priority 1. The LLM cannot "change its mind" on these cases — that's the insurance against hallucination.

FZ-152: PII on RF servers, no identifiers in LLM

Patient PII (full name, phone, medical data) is stored in PostgreSQL on Selectel in RF. GigaChat receives de-identified data: age group ("45-55"), gender, symptoms — no name, phone, passport data. The session_id ↔ patient mapping is stored locally and doesn't leave the clinic perimeter. Before launch, consent for processing was obtained via the integrated consent flow.

Direct booking to IDENT via n8n

The n8n pipeline takes the classified request and writes to the IDENT API: picks the room, doctor, nearest priority-appropriate slot. If no slot is available for priority 1 — escalation to chief doctor on Telegram with info for manual decision (reschedule a routine, open an additional window, or redirect to a duty clinic).

03 · Stack

FZ-152-compliant stack, no PII leaving the country

Vapi

Voice channel: replacement number + bridge to clinic's WhatsApp channel

GigaChat 2 Max

LLM intake and triage classifier; runs on de-identified data

IDENT API

Patient booking into the right room/slot, conflict checks

n8n (self-hosted)

Pipeline: classifier → IDENT → escalation, conflict resolution

Telegram Bot API

Priority 1 escalation to chief doctor with brief intake summary

PostgreSQL on Selectel RF

Patient PII, session ↔ patient mapping — inside RF perimeter

FZ-152 consent flow

PII processing consent before dialogue start; consent audit log

Detect-rules layer

Deterministic red flags override LLM decision on acute cases

VapiGigaChat 2 MaxIDENT APIn8nTelegramSelectel RFPostgreSQLFZ-152 layer

04 · Results

What changed in numbers

Emergency slot occupancy

41% 88%

correctly classified P1 lands in the right slot

P1 reaction time

47 min 4 min

from first contact to confirmed booking

Chief doctor — personally

18 hrs/wk 2 hrs/wk

only real P1 + review of ambiguous cases

+ operating day

+₽180k/wk

freed-up time = +1 working day/wk

The main win isn't the chief doctor's hours, it's medical quality: an acute patient gets into the chair in minutes, a routine one — into a convenient routine slot. No more "come in, we'll take a look" or "let me check and call you back within the hour".

Bonus we didn't bake into the KPI: the chief doctor now sees a dashboard distribution of P1/P2/P3 by day of week and hour — it became clear when extra slots are needed for emergencies and when for planning meetings and aesthetics.

05 · Where it fits

Where else the same methodology applies

Universal pattern — "structured intake → triage → booking with escalation". Fits anywhere incoming flow is mixed by urgency and the cost of sorting errors is high:

→ Multi-specialty clinics — triage between specialists based on symptoms
→ Veterinary clinics — acute animal pain vs routine vaccination
→ Equipment service centers — breakdown stop vs preventive maintenance; urgent dispatch
→ Legal consultations — burning deadlines vs standard procedural questions
→ B2B technical support — production-down vs feature request

What's reused on subsequent projects

Intake-prompt template with structured items and JSON-schema output
Detect-rules layer: deterministic rules override LLM on critical cases
FZ-152-compliant architecture: PII in RF, LLM gets de-identified data
Escalation pipeline to Telegram/Slack with brief summary and action buttons

Similar challenge in your business?

If your incoming flow is mixed by urgency and people make decisions manually — it can be automated

Medical and service scenarios with PII separately require FZ-152-compliant architecture — we know how to make do with LLM on de-identified data and store PII inside RF. Time to production — 5-7 weeks depending on CRM/medsystem integration complexity.

Discuss my project All cases

Готовы начать?

Аудит за 5 000 ₽ — с конкретным отчётом и сметой

Расскажу что внедрить в вашем бизнесе в первую очередь, какая будет окупаемость, и нужен ли вообще AI для вашей задачи (иногда — нет).

Записаться на аудит Написать в Telegram

Или просто напишите свой вопрос — отвечу в течение 2 часов