Humans have been generally considered as experts for face identity processing (FIP). However, increasing evidence demonstrates considerable individual differences in behaviorally measured FIP ability - from individuals with profound impairments, to so-called "Super-Recognizers" (SRs). Despite their theoretical and practical implications, e.g. for law enforcement and the judicial system, the sources of FIP ability variation remain elusive. To date \textit{objective and reliable} human FIP ability assessment is critical but lacking, and our understanding of the brain-behavior mechanisms involved is limited. We explored the reliability of frequency-tagged neural correlates of high-level visual perception obtained across considerable time periods using fast periodic visual stimulation (FPVS) EEG in a unique cohort of participants: law enforcement practitioners with documented superior FIP ability. Adopting a deep-data approach, we provide an in-depth report of behavioral and neural data collected from 11 Berlin Police SRs. Specifically, we relate behavioral performance across eight challenging and professionally relevant behavioral FIP tests, to FPVS EEG data for three experiments measuring neural face categorization and face discrimination, measured twice with a 2.5 year re-test interval. Overall, we find medium-to-high test/re-test reliability of SRs' FPVS EEG responses –– both at the group and the individual level, and across experiments. Notably, this contrasts with previous reports of low-to-modest behavioral and neural re-test reliability among neurotypical civilian samples. This seminal report of neural reliability in law enforcement SRs represents a benchmark for future research and practice, in particular high-stakes applied scenarios relying on human (expert, witness) judgments of unfamiliar facial identity.