All work
122024SCAD

Smart Form Validator (Arabic Address & Name Parsing)

Challenge

Field-collected forms had inconsistent Arabic name and address formatting that broke downstream record linkage.

Approach

Light GPT-3.5 normaliser + transliteration model that standardises names, splits address components, and validates them against the national address registry.

Impact

  • Improved record-linkage match rate from 72% to 94%
  • Replaced 8 brittle regex rule sets with one model + small ruleset
  • Runs in-line on form submit, sub-200ms p95 latency
+22pp
match
200ms
latency
-8 systems
rules

Tech stack

GPT-3.5Azure FunctionsArabic Transliteration

Ask anything about Smart Form Validator (Arabic Address & Name Parsing)

AI scoped to this project · Llama 3.3 70B