Open AccessArticle Transitioning from Laboratory-Developed Tests to a Single Commercial Reagent Kit in a National Newborn Screening Program: Impact on Analytical Performance and Harmonization 1 Biochemical Sciences, Synnovis, Guys & St Thomas’ NHSFT, London SE1 7EH, UK 2 GKT School of Medical Education, Kings College London, London WC2R 2LS, UK 3 Newcastle Metabolic and Newborn Screening, Royal Victoria Infirmary, Newcastle upon Tyne NE1 4LP, UK 4 Biochemical Genetics, St James’s University Hospital, Leeds LS9 7TF, UK 5 Newborn Screening, South Yorkshire and Bassetlaw Pathology, Sheffield Teaching Hospitals NHSFT, Sheffield S10 2TH, UK 6 Alder Hey Children’s NHS(FT), Liverpool L12 2AP, UK 7 Newborn Screening and Metabolic Biochemistry Department, North Bristol NHS Trust, Southmead Hospital, Bristol BS10 5NB, UK 8 Newborn Screening and Biochemical Genetics, Birmingham Women’s and Children’s Hospitals NHSFT, Birmingham B4 6NH, UK 9 Blood Sciences, Portsmouth Hospitals Trust, Portsmouth PO6 3LY, UK 10 Biochemical Genetics Unit, Cambridge University Hospitals NHS FT, Cambridge CB2 0QQ, UK add Show full affiliation list remove Hide full affiliation list * Author to whom correspondence should be addressed. Int. J. Neonatal Screen. 2026, 12(2), 41; https://doi.org/10.3390/ijns12020041 (registering DOI) Submission received: 1 May 2026 / Revised: 30 May 2026 / Accepted: 3 June 2026 / Published: 9 June 2026 Newborn screening in England is a national program with laboratories adhering to common screening algorithms. Until recently, screening for inherited metabolic disorders was provided by ten laboratories using laboratory-developed tests (LDTs) and three using commercial assays: harmonization of results proved challenging. Introduction of hereditary tyrosinemia type 1 screening meant LDTs required modification to include the measurement of succinylacetone, and subsequent re-validation. This provided an opportunity to implement a single commercial reagent kit in all laboratories. It was anticipated that this would improve analytical performance and harmonization. This study aimed to determine whether these goals were achieved. Verification across the 13 laboratories revealed that the commercial kit reduced inter-laboratory variation for all analytes demonstrating improved harmonization. However, this was achieved by applying instrument-specific correction factors to all analytes, the magnitude of which were significant, indicating a lack of standardization. Performance of succinylacetone was limited by instrument-dependent background interference from the methionine stable isotope label, underscoring the need to establish evidence-based screening cut-off values (COV) rather than adopting published thresholds. This study emphasizes the need for traceable reference materials to improve laboratory quality and the value of screening outcome data. In 2024, the English government approved the introduction of screening for hereditary tyrosinemia type 1 (HT1; OMIM #276700) and specified that screening should be performed using succinylacetone (SA) in a dried blood spot (DBS). Irrespective of sample matrix or methodology, measurement of SA is acknowledged to be challenging [ 11, 12, 13]. Extraction of SA from a DBS requires an acidic solution which means that methanol, which is used to extract amino acids and acylcarnitines in the LDTs, is unsuitable. Early methods [ 14] accommodated this by introducing a two-step process: extracting the amino acids and acylcarnitines from the DBS and removing the supernatant, then adding hydrazine to the residual DBS punch to extract and derivatize the SA. Further optimization of the methods enabled the simultaneous extraction of amino acids, acylcarnitines and SA through two different approaches. The first extracted and derivatized amino acids and acylcarnitines using butyl esterification and then extracted and derivatized SA from the residual DBS punch using hydrazine, before combining the extracts and measuring all analytes by FIA-MS/MS [ 15, 16]. The second approach extracted all analytes from the DBS in a single step via the sequential addition of methanol and hydrazine containing solutions [ 17]. In addition to these LDTs, at least four commercial reagent kits are now available. It is evident from previous reports [ 18, 19] that there are significant differences between methods used to measure SA, with LDTs showing more variability than commercial kits, irrespective of whether they are derivatized or non-derivatized methods. The Neobase™ 2 (NB2) method group displays the smallest inter-laboratory variation and has the highest number of users. Under-recovery is an issue for all methods: a proficiency sample enriched to a concentration of 50 µmol/L had a mean measured concentration of 23.4 µmol/L (approximate range 10 to 100) [ 18]. As expected, this absence of harmonization and/or standardization is reflected in the range of SA COVs used in screening programs throughout the world. A worldwide survey of NBS programs [ 19] found screening COVs ranged from 0.3 to 7.0 µmol/L (median 1.5, n = 31), whilst data from the Newborn Screening Quality Assurance Program (NSQAP) in 2022 reported the ranges in U.S and international laboratories were 0.4 to 6.1 µmol/L (median 2.0, n = 40) and 0.4 to 8.0 µmol/L (median 1.8, n = 112) respectively. The higher screening COVs tend to be associated with LDTs, irrespective of whether they are derivatized or non-derivatized, with less variation evident within a kit method group; for example, NB2 COVs ranged from 0.6 to 2.0 μmol/L ( n = 10), whereas LDTs ranged from 1.0 to 7.0 μmol/L ( n = 9). In the absence of harmonization, comparison of COVs has limited utility at best, and at worst could be misleading. This study aimed to verify analytical performance of NB2 across the 13 laboratories, compare population distributions obtained using NB2 with historic LDT data and determine whether the transition to a single commercial kit improved harmonization. 2. Materials and Methods Neobase™ 2 non-derivatized MSMS installation kit, non-derivatized assay solutions, succinylacetone solution, frozen controls (low, high), frozen internal standards, and multi-level DBS were obtained from Revvity (Turku, Finland). Third-party internal quality control (IQC) materials, MassCheck ପ୍ପ Amino Acids, and Acylcarnitines EXTENDED dried blood spot control levels I, II and III were obtained from Chromsystems (Munich, Germany). All materials were used as specified by the manufacturer. All 13 English NBS laboratories participated in this study and each laboratory verified the method on two instruments; 4 Waters TQD; 8 Waters Xevo TQD; 5 Waters TQ-S micro; 3 AB Sciex API 4500; 1 AB Sciex API 5000; 1 AB Sciex API 5500; 2 AB Sciex API 6500; and 2 Shimadzu 8050. Initial optimization of the NB2 method on each instrument was performed by Revvity Field Service Engineers. Mass positions, mass resolution and compound dependent parameters, e.g., cone voltage, collision energy, and dwell time, were optimized by direct infusion of a tune solution prepared from the frozen internal standards. MStune and acquisition files, autosampler configurations and inlet method files, and post-analytical data processing files were created for the following analytes and their corresponding stable isotope label (SIL): phenylalanine (PHE); tyrosine (TYR); leucine (LEU); methionine (MET); isovalerylcarnitine (C5); glutarylcarnitine (C5DC); octanoylcarnitine (C8); decanoylcarnitine (C10); and SA. All samples were prepared as instructed by the manufacturer. Each of the nine analytes was quantified from the ratio of analyte to internal standard (IS) signal response, multiplied by IS concentration. A default relative response factor (RRF) for each analyte was included in the data processing method. RRF are analyte specific multiplication factors applied to the final measured result for each analyte. Preliminary assessment of intra-batch imprecision and linearity was performed by replicate analysis of frozen kit controls ( n = 30, two levels) and multi-level DBS controls ( n = 4, six levels) respectively. Results for the nine analytes were compared with the manufacturer-assigned values. Subsequent optimization of the RRFs for each instrument was performed by each laboratory. Replicates ( n = 12) of low and high kit control materials were analyzed on each of three different days. The mean measured results for each material were compared with the manufacturer-assigned values for the nine analytes. The final RRF for each analyte was the mean of the RRF from the replicates of the low and high control material. Verification of the method on each instrument was performed by all laboratories according to a common protocol. Linearity was verified using the six DBS multi-level materials with acceptable performance defined as R 2 > 0.99 ( n = 6). The lower limit of quantitation (LLOQ) was not formally assessed due to the lack of matrix matched reference materials, the complexity associated with accurately preparing DBS samples and the absence of clinical impact at this level. Instead, replicates ( n = 12) of endogenous material (DBS multi-level 1) were analyzed and imprecision at this level was compared with standard acceptance criterion (coefficient of variation (CV) ≤ 20%, n = 5 from at least three different batches). Recovery was assessed by replicate analysis ( n = 3) of the six DBS multi-level materials, after the measured value had been corrected for endogenous concentration (back calculated). Carry-over was assessed by injection of an extracted blank (filter paper only) directly after injecting extracted kit high IQC material. Replicates of the blank ( n = 6) were injected followed by six alternate replicates of a high IQC and blank, followed by a further six blanks: BBBBBBCBCBCBCBCBCBBBBBB. Acceptable performance was defined as signal intensity for the blank ≤ 20% of the signal intensity in DBS multi-level L1. Intra-laboratory variation was assessed by replicate analysis ( n = 20) of three third party IQC materials, on each instrument. Inter-laboratory variation was assessed by replicate analysis ( n = 5) of three third-party IQC materials on each of five different days, on all instruments. Acceptable performance for inter-laboratory variation was defined as CV ≤ 15%, except at the LLOQ where CV ≤ 20% was accepted. Inter-lot variation was determined using the same 5 × 5 format but with the IS kit lot becoming the variable, rather than instrument or laboratory. The percentage relative difference between the mean of each lot of IS was determined rather than the mean bias to account for the fact that it is not known which lot represents the ‘true’ value. On-board stability was assessed by repeat measurement of residual DBS specimens ( n = 82) and IQC materials that had been left on the autosampler at 10 °C overnight. Samples were reconstituted with 100 µL of (a) 50% methanol and (b) 50% acetonitrile and mixed prior to re-injection. Stability was defined as no significant constant or proportional bias, and plate means within ±10%. The degree of interference from the MET SIL to endogenous SA was estimated by monitoring the signal intensity ratio of SA to MET SIL in extraction working solution on representative instruments, with extended longitudinal monitoring performed at laboratory 11. To evaluate whether the clinical COV currently specified in the national screening algorithm for the LDTs would remain applicable for NB2, each laboratory analyzed approximately n = 1000 residual NBS specimens on each instrument (final data set n = 22,049). Population data was reviewed by instrument, laboratory and collectively. Statistical analysis included determination of mean, median, and population centiles (P1, P10, P90, P99) for each analyte and comparison with historic data ( n = 131,128) obtained using the LDTs between October and December 2024. The median absolute deviation (MAD) was calculated to provide a robust measure of statistical dispersion around the median; however, emphasis was placed on the P90 as this was the centile closest to the clinical COV at which the data set size would provide a statistically meaningful confidence interval across all 26 instruments. The analytical COV is set approximately 20% below the screening COV. Samples which exceed the analytical COV are re-tested in duplicate and if the mean of the three results exceeds the screening COV, the baby is classified as screen-positive and referred for confirmatory testing. Inter-laboratory variability was assessed using the standard deviation (SD) of laboratory level MADs. The ratio of between-laboratory SDs of MADs (NB2/LDT) was used to assess relative variability with ratios 1.2. A review of the instrument model showed that the Waters TQ-S Micro was associated with the highest mean RRFs for PHE (1.13), TYR (1.14) and LEU (1.09) compared to the other models, although the significance of this finding is unclear. Laboratory 7 stood out as anomalous with all RRFs > 1.0 and a mean across both instruments of 1.16 (range 1.01–1.55). This pattern is quite striking: there is not a single RRF below 1.0 across the entire laboratory, which is unusual given that RRFs could logically be expected to be scattered either side of 1.0. Overall, 13% of RRFs were adjusting the measured result by more than 20%. The underlying principle of stable isotope dilution mass spectrometry (SID-MS) is that the presence of an isotopically enriched version of the analyte of interest will compensate for any variability due to extraction efficiency, ionization efficiency and matrix suppression/enhancement because it will behave identically to the native analyte. The magnitude and range of the RRFs reported here are at odds with this and appear more indicative of fundamental differences between instruments, particularly given the commonality of stable isotopes, reagents and other materials. Linearity was demonstrated to 1434, 1304, 1260, 842, 42, 20, 7, 7 and 183 µmol/L for PHE, LEU, TYR, MET, C8, C5, C5DC, C10 carnitines and SA respectively for most instruments, consistent with the manufacturer’s claims. The exceptions to this were: Sciex 4500 (3A, 4B) which went non-linear between 717 and 1434 µmol/L for PHE, and 652 and 1304 µmol/L for LEU; Sciex 4500 (3A) which went non-linear between 21 and 42 µmol/L for C8 carnitine; and Sciex 5000 (9B) which went non-linear between 717 and 1434 µmol/L for PHE, 21 and 42 µmol/L for C8, and 3.6 and 7.1 µmol/L for C10 carnitine. As these regions of non-linearity are well removed from the COVs typically used by NBS programs around the world, this is not problematic per se; however, it highlights the need for laboratories to establish linearity for each instrument to ensure the screening outcome data is meaningful. Importantly, it also emphasizes that SID-MS-based screening methods may be unsuitable for monitoring dietary therapy in patients with phenylketonuria. At a PHE concentration of 600 µmol/L, a key decision point in terms of patient management [ 22], instrument 3A and 4B would report concentrations of approximately 503 and 442 µmol/L respectively. The mean recovery from all instruments was 100 ± 20% for all analytes except for succinylacetone (53%, range 31–72). See Table S2 in Supplementary Data. Also notable was the consistent over-recovery of C10 (112%, range 97–130) on all but two instruments. Recovery was determined relative to the Revvity multi-level DBS materials which is an acknowledged limitation as these are not higher-order standards but have manufacturer-assigned values; however, given that the mean RRF for C10 was 1.22 ( Table S1), this points to a property of the kit materials, either the frozen controls or the C10 SIL, rather than an instrument-specific analytical issue, and underlines the absence of higher-order traceable reference materials against which to independently verify these values. Carry-over was negligible for all analytes on all instruments. Mean intra- and inter-laboratory variation is summarized in . Mean inter-lot variation across all laboratories for all analytes in each of the three levels of third-party IQC materials was ≤15% and directly comparable with the manufacturers’ claims. The exception to this was SA with 35.2, 18.0 and 25.3% CV for levels 1, 2 and 3 respectively. The mean bias (range) between lots was within ± 3% (−1.7 to 2.6). Results are summarized in Table S3. These data were used to establish an evidence-based process for verifying a new lot of IS prior to routine use and specifying the acceptance criteria. On-board stability was demonstrated for samples reconstituted in 100 µL of either 50% methanol or 50% acetonitrile. When reconstituted in 50% methanol, plate means were within ±10% for all analytes except SA (20.1%). When reconstituted in 50% acetonitrile, plate means were within ±10% for all analytes except C10 (+13%) and SA (34%). As it is acknowledged that endogenous SA is below the LLOQ, the nominal difference between plate means was not considered to be of clinical significance and importantly, at the higher concentrations seen in IQC materials, mean bias was within ±10% for all analytes, and the bias across the different analytes was random, not consistent, irrespective of which solvent was used for reconstitution. 3.2. Lower Limit of Quantitation and the Impact of Isotopic Interference on Succinylacetone Quantitation Replicate measurements of DBS multi-level 1 material on each instrument met the imprecision criteria for LLOQ for all analytes except for SA. SA failed the imprecision criteria on Waters TQD (10B, 12B), Waters Xevo TQD (1A, 1B, 2A, 8A, 13B) and Shimadzu 8050 (9A), and failed bias criteria on all instruments. Endogenous concentrations of SA in DBS specimens from healthy neonates are essentially undetectable by this methodology; the signal measured in DBS multi-level 1 reflects cross contribution from the MET SIL rather than true endogenous SA. The MET SIL is 2H 3-methionine (transition 153.1 > 107.1), which contains a naturally abundant +2 isotopic contribution (155.1 > 109.1) of approximately 4% due to 34S. This transition is identical to that of the stable SA derivative, 3-(5-methyl-1H-pyrazol-3-yl) propanoic acid (MPP), formed during reaction with hydrazine and measured as a surrogate marker for SA. Consequently, background signal attributable to the MET SIL contributes directly to the apparent endogenous SA concentration. At higher concentrations, SA can be quantified with acceptable precision despite the background contribution from the MET SIL. This was demonstrated using the third-party level two QC material (nominal concentration 2.2 µmol/L) where mean intra-laboratory imprecision was 11.6% and inter-laboratory variation was 9.6%. These findings indicate that the LLOQ for SA lies between the instrument specific background signal and that of the level two IQC material. Importantly, the magnitude of this background is instrument-specific, varying from approximately 0.2 µmol/L on a Waters Xevo TQD, to 0.7 µmol/L on a Sciex 6500. This signal does not reflect the theoretical 4% isotopic contribution expected from natural 34S abundance, as the manufacturer ‘de-tunes’ each MS to minimize the interference. De-tuning is achieved by direct infusion of a solution containing only the SILs into the MS. The MET SIL transition is optimized to achieve maximum signal intensity as normal, whereas the cone and collision energy (CE) for the SA SIL transition are adjusted to minimize the signal intensity recorded in the SA channel, i.e., that which is due to the MET SIL, whilst ensuring the signal from the SA SIL remains adequate. The sweet spot appears to be when the signal ratio of SA SIL: SA is >1000 which is typically achievable with a Waters Xevo TQD. However, on an older instrument such as a Waters TQD, this ratio is closer to 500, and on an AB Sciex API 6500, it is only around 50 because SA and MET show very similar CE vs intensity curves making it impossible to discriminate between them in this way (A–E and Figure S1). Longitudinal monitoring of the signal intensity ratio for MET SIL to apparent endogenous SA on two Waters Xevo TQDs (11A and B) demonstrated that the magnitude of the background interference remains stable for a given instrument despite expected fluctuations in instrument performance (). The only notable deviation occurred during a three-day period when the collision gas supply failed, during which the ratio gradually increased toward the theoretical 4% isotopic contribution. Regular monitoring of this ratio provides a pragmatic means of identifying instrument-specific limitations of the assay and differences in SA background interference between instruments, within and across laboratories. 3.3. Population Distributions and Evaluation of Screening Cut-Off Values Determining the COV for SA required careful consideration of the instrument-specific background interference from the MET SIL, which effectively defines the LLOQ for each instrument. Review of the literature highlighted the wide range of COVs in use internationally and the impossibility of making meaningful comparisons across methods that are neither harmonized nor standardized. Kuypers et al. [ 19] reported a median SA COV of 1.5 µmol/L (range 0.3–7.0) across 31 screening programs; for those using NB2 specifically, COVs ranged from 0.3 to 2.0 µmol/L. Given the instrument-dependent background interference reported here (mean 0.22 µmol/L, range 0.05–0.62), adoption of the lower end of this range would provide little or no discrimination from background signal in the normal population, underscoring the inadequacy of adopting published COVs without reference to local analytical performance. The UK population data were therefore central to establishing the COV; however, it should be noted that standard confidence interval methods for population centiles could not be applied to SA as the data reflected the instrument specific interference, rather than the endogenous concentration of SA in healthy neonates. Across 26 instruments, the P99 for SA ranged from 0.21 to 0.99 µmol/L ( and ). Most instruments achieved P99 values 109.10) and the +2 ( 34S) isotope of the methionine SIL (155.1 > 109.10) during direct infusion of internal standard mixture on a Waters Xevo TQD (red) and a Sciex 6500 (blue). The dotted line indicates the optimised collision energy for each instrument. On the Xevo TQD, maximum signal intensity of the two transitions occurs at different collision energies, enabling discrimination of the interference by collision energy optimisation. On the Sciex 6500, maximum intensity for both transitions occurs at very similar collision energies, precluding effective discrimination. Table S3: Inter-lot variation for each analyte across all thirteen laboratories for the three third party IQC materials. Lot A and B refer to internal standard kit lots 759382 and 766157 respectively. Figure S2: Comparison of succinylacetone population data by instrument type, and for all 13 laboratories. The dashed line and the shaded band represent the P90 and the MAD around the P90 for all laboratories. The solid line represents the clinical cut-off value. Concentration in µmol/L. Author Contributions Conceptualization, R.S.C.; methodology, R.S.C.; software, S.C.W.; validation, R.S.C., Z.J.B., R.B., H.K., S.D., T.G., S.L.H., R.M., D.P., K.W. and T.H.Y.W.; formal analysis, R.S.C. and S.C.W.; investigation, Z.J.B., M.A., H.B., K.C., P.C., N.C., S.H., O.P., T.R., J.S. and N.T.; resources, J.R.B.; data curation, R.S.C. and S.C.W.; writing—original draft preparation, R.S.C.; writing—review and editing, All; visualization, R.S.C. and S.C.W.; project administration, J.R.B. All authors have read and agreed to the published version of the manuscript. Funding This research received no external funding. Article processing charges were kindly paid by UK Newborn Screening Laboratory Network. Institutional Review Board Statement The study was conducted in accordance with the Declaration of Helsinki, and approved by the NHS Population Screening Research, Innovation and Development Advisory Committee (RIDAC) (protocol code ANNB-2425-007, approved on 23 April 2025). Informed Consent Statement Patient consent was waived. All specimens were anonymized. The data that support the findings of this study are available from NHS England. Restrictions apply to the availability of these data. Data are available from the corresponding author (RSC) with the permission of NHS England. Acknowledgments We would like to thank all staff in the English NBS laboratories who provide the routine screening service daily, in particular Rachelle Garstone, Philippa Goddard Fiona Jenkinson, Laura Wainwright, Catherine Brewis, Deborah Finnerty, Caroline Griffith, Vriti Hansraj, Sophie Khaihra, Lynette Shakespeare and Krystal Rawstron. We would also like to thank the NHSE Hereditary Tyrosinemia Type 1 Project team for their support and encouragement, and the team at Revvity for their technical support during the implementation. Conflicts of Interest James R. Bonham is the Editorial Board Member of the International Journal of Neonatal Screening. To ensure editorial independence, James R. Bonham had no involvement in the peer-review process or editorial decision for this manuscript. Other authors declare no conflicts of interest. Abbreviations The following abbreviations are used in this manuscript: COV Cut-off value CV Coefficient of variation C10 Decanoylcarnitine DBS Dried blood spot CE European conformity FIA-MS/MS Flow injection analysis tandem mass spectrometry GA1 Glutaric aciduria type 1 C5DC Glutarylcarnitine HT1 Hereditary tyrosinemia type 1 IQC Internal quality control IS Internal standard IVDR In Vitro Diagnostic Regulation (EU) 2017/746 IVA Isovaleric acidemia C5 Isovalerylcarnitine LDT Laboratory-developed test LEU Leucine LLOQ Lower limit of quantitation MSUD Maple syrup urine disease MS Mass spectrometer MAD Median absolute deviation MCADD Medium-chain Acyl-CoA dehydrogenase deficiency MET Methionine MHRA Medicines & Healthcare products Regulatory Agency MPP 3-(5-methyl-1H-pyrazol-3-yl) propanoic acid NB2 Neobase™ 2 NBS Newborn screening NSQAP Newborn Screening Quality Assurance Program C8 Octanoylcarnitine PHE Phenylalanine PKU Phenylketonuria PPV Positive predictive value HCU Pyridoxine unresponsive homocystinuria RRF Relative response factor SIL Stable isotope label SD Standard deviation SA Succinylacetone SID-MS Stable isotope dilution mass spectrometry TYR Tyrosine References George, R.S.; Moat, S.J. Effect of Dried Blood Spot Quality on Newborn Screening Analyte Concentrations and Recommendations for Minimum Acceptance Criteria for Sample Analysis. Clin. Chem. 2016, 62, 466–475. [ Google Scholar] [ CrossRef] Carling, R.S.; John, C.; Goddard, P.; Griffith, C.; Cowen, S.; Hopley, C.; Moat, S.J. Evaluation of a Common Internal Standard Material to Reduce Inter-Laboratory Variation and Ensure the Quality, Safety and Efficacy of Expanded Newborn Screening Results When Using Flow Injection Analysis Tandem Mass Spectrometry with Internal Calibration. Int. J. Neonatal Screen. 2020, 6, 92. [ Google Scholar] [ CrossRef] [ PubMed] Knowles, R.L.; Oerton, J.; Cheetham, C.; Butler, G.; Cavanagh, C.; Tetlow, L.; Dezateux, C. Newborn Screening for Primary Congenital Hypothyroidism: Estimating Test Performance at Different TSH Thresholds. J. Clin. Endocrinol. Metab. 2018, 103, 3720–3728. [ Google Scholar] [ CrossRef] [ PubMed] Carling, R.S.; Whyte, E.; John, C.; Garstone, R.; Goddard, P.; Greenfield, T.; Hogg, S.L.; Le Masurier, C.; Cowen, S.; Moat, S.J.; et al. Improving Harmonization and Standardization of Expanded Newborn Screening Results by Optimization of the Legacy Flow Injection Analysis Tandem Mass Spectrometry Methods and Application of a Standardized Calibration Approach. Clin. Chem. 2022, 68, 1075–1083. [ Google Scholar] [ CrossRef] Cantley, N.W.P.; Barski, R.; Kemp, H.; Hogg, S.L.; Wu, H.Y.T.; Bowron, A.; Collingwood, C.; Cundick, J.; Hart, C.; Shakespeare, L.; et al. Incidental Detection of Classical Galactosemia through Newborn Screening for Phenylketonuria: A 10-Year Retrospective Audit to Determine the Efficacy of This Approach. Int. J. Neonatal Screen. 2024, 10, 2. [ Google Scholar] [ CrossRef] Carling, R.S.; Hedgethorne, K.; Chakrapani, A.; Hall, P.L.; Flynn, N.; Greenfield, T.; Moat, S.J.; Ssali, J.; Shakespeare, L.; Taj, N.; et al. Retrospective Review of Positive Newborn Screening Results for Isovaleric Acidemia and Development of a Strategy to Improve the Efficacy of Newborn Screening in the UK. Int. J. Neonatal Screen. 2024, 10, 24. [ Google Scholar] [ CrossRef] Flynn, N.; Moat, S.J.; Hogg, S.L. A computer vision approach to the assessment of dried blood spot size and quality in newborn screening. Clin. Chim. Acta 2023, 547, 117418. [ Google Scholar] [ CrossRef] Pickens, C.A.; Sternberg, M.; Seeterlin, M.; De Jesús, V.R.; Morrissey, M.; Manning, A.; Bhakta, S.; Held, P.K.; Mei, J.; Cuthbert, C.; et al. Harmonizing Newborn Screening Laboratory Proficiency Test Results Using the CDC NSQAP Reference Materials. Int. J. Neonatal Screen. 2020, 6, 75. [ Google Scholar] [ CrossRef] Dorley, M.C.; Dizikes, G.J.; Pickens, C.A.; Cuthbert, C.; Basheeruddin, K.; Gulamali-Majid, F.; Hetterich, P.; Hietala, A.; Kelsey, A.; Klug, T.; et al. Harmonization of Newborn Screening Results for Pompe Disease and Mucopolysaccharidosis Type I. Int. J. Neonatal Screen. 2023, 9, 11. [ Google Scholar] [ CrossRef] la Marca, G.; Carling, R.S.; Moat, S.J.; Yahyaoui, R.; Ranieri, E.; Bonham, J.R.; Schielen, P.C.J.I. Current State and Innovations in Newborn Screening: Continuing to Do Good and Avoid Harm. Int. J. Neonatal Screen. 2023, 9, 15. [ Google Scholar] [ CrossRef] Bouva, M.J.; Kuypers, A.M.; Kemper, E.A.; Maase, R.E.; Bosch, A.M.; van Spronsen, F.J.; Heijboer, A.C.; Heiner-Fokkema, M.R.; Heil, S.G.; Boelen, A. Evaluation of the Performance of Newborn Screening for Tyrosinemia Type 1 in The Netherlands: Suggestions for Improvements Using Additional Biomarkers in Addition to Succinylacetone. Int. J. Neonatal Screen. 2025, 11, 35. [ Google Scholar] [ CrossRef] Laeremans, H.; Turner, C.; Andersson, T.; Angel Cocho De Juan, J.; Gerrard, A.; Heiner-Fokkema, M.R.; Herebian, D.; Janzen, N.; la Marca, G.; Rudebeck, M. Inter-laboratory analytical improvement of succinylacetone and nitisinone quantification from dried blood spot samples. JIMD Rep. 2020, 53, 90–102. [ Google Scholar] [ CrossRef] De Jesús, V.R.; Adam, B.W.; Mandel, D.; Cuthbert, C.D.; Matern, D. Succinylacetone as primary marker to detect tyrosinemia type I in newborns and its measurement by newborn screening programs. Mol. Genet. Metab. 2014, 113, 67–75. [ Google Scholar] [ CrossRef] Allard, P.; Grenier, A.; Korson, M.S.; Zytkovicz, T.H. Newborn screening for hepatorenal tyrosinemia by tandem mass spectrometry: Analysis of succinylacetone extracted from dried blood spots. Clin. Biochem. 2004, 37, 1010–1015. [ Google Scholar] [ CrossRef] Turgeon, C.; Magera, M.J.; Allard, P.; Tortorelli, S.; Gavrilov, D.; Oglesbee, D.; Raymond, K.; Rinaldo, P.; Matern, D. Combined Newborn Screening for Succinylacetone, Amino Acids, and Acylcarnitines in Dried Blood Spots. Clin. Chem. 2008, 54, 657–664. [ Google Scholar] [ CrossRef] Chace, D.H.; Lim, T.; Hansen, C.R.; De Jesus, V.R.; Hannon, W.H. Improved MS/MS analysis of succinylacetone extracted from dried blood spots when combined with amino acids and acylcarnitine butyl esters. Clin. Chim. Acta 2009, 407, 6–9. [ Google Scholar] [ CrossRef] [ PubMed] la Marca, G.; Malvagia, S.; Pasquini, E.; Innocenti, M.; Fernandez, M.R.; Donati, M.A.; Zammarchi, E. The inclusion of succinylacetone as marker for tyrosinemia type I in expanded newborn screening programs. Rapid Commun. Mass Spectrom. 2008, 22, 812–818. [ Google Scholar] [ CrossRef] Centers for Disease Control and Prevention. Newborn Screening Quality Assurance Program. In 2022 Annual Summary Report; Centers for Disease Control and Prevention: Atlanta, GA, USA, 2023; Volume 40. [ Google Scholar] Kuypers, A.M.; Bouva, M.J.; Loeber, J.G.; Boelen, A.; Dekkers, E.; Petritis, K.; Pickens, C.A.; The ISNS Representatives; van Spronsen, F.J.; Heiner-Fokkema, M.R. Evaluation of Neonatal Screening Programs for Tyrosinemia Type 1 Worldwide. Int. J. Neonatal Screen. 2024, 10, 82. [ Google Scholar] [ CrossRef] [ PubMed] Lee, B.; Heo, W.Y.; Kim, J.A.; Lee, H.S.; Hwang, N.; Park, H.D.; Sung, S.I.; Chang, Y.S.; Park, W.S.; Lee, S.Y. Comprehensive Evaluation of the NeoBase 2 Non-derivatized MSMS Assay and Exploration of Analytes with Significantly Different Concentrations Between Term and Preterm Neonates. Ann. Lab. Med. 2023, 43, 153–166. [ Google Scholar] [ CrossRef] Cicalini, I.; Valentinuzzi, S.; Pieragostino, D.; Consalvo, A.; Zucchelli, M.; Donzelli, S.; Ambrogi, D.; Brown, H.A.; Calton, L.J.; Stuppia, L.; et al. Analytical Evaluation of the Ideal Strategy for High-Throughput Flow Injection Analysis by Tandem Mass Spectrometry in Routine Newborn Screening. Metabolites 2021, 11, 473. [ Google Scholar] [ CrossRef] [ PubMed] van Wegberg, A.M.J.; MacDonald, A.; Ahring, K.; Bélanger-Quintana, A.; Beblo, S.; Blau, N.; Bosch, O.M.; Burlina, A.; Campistol, J.; Coskun, T.; et al. European guidelines on diagnosis and treatment of phenylketonuria: First revision. Mol. Gen. Metab. 2025, 145, 109125. [ Google Scholar] [ CrossRef] Marrington, R.; Sinclair, G.; MacKenzie, F. The missing piece: Who is responsible for ensuring clinical chemistry assays used in the UK are fit for purpose? Ann. Clin. Biochem. 2026, 63, 141–151. [ Google Scholar] [ CrossRef] Illustration of de-tuning showing the relationship between signal intensity and collision energy for succinylacetone and methionine obtained whilst infusing a solution of succinylacetone SIL and methionine SIL (61 and 100 µmol/L respectively) on ( A) Sciex 6500; ( B) Waters TQD; ( C) Sciex 4500; ( D) Waters TQ-S micro; ( E) Waters Xevo TQD. The signal intensity from 155.1 > 109.10 represents the contribution from the background signal attributable to the methionine SIL (+2 isotopic contribution due to 34S). The x-axis represents collision energy (eV). The primary y-axis (log scale) represents the signal intensity of the 160.1 > 109.1 and 155.1 > 109.1 transitions, represented in blue and orange respectively. The secondary y-axis represents the ratio of the signal intensities (160.1 > 109.1 to 155.1 > 109.1) which are shown in red. The dashed vertical green line indicate the optimized collision energy (eV) for each instrument. Illustration of de-tuning showing the relationship between signal intensity and collision energy for succinylacetone and methionine obtained whilst infusing a solution of succinylacetone SIL and methionine SIL (61 and 100 µmol/L respectively) on ( A) Sciex 6500; ( B) Waters TQD; ( C) Sciex 4500; ( D) Waters TQ-S micro; ( E) Waters Xevo TQD. The signal intensity from 155.1 > 109.10 represents the contribution from the background signal attributable to the methionine SIL (+2 isotopic contribution due to 34S). The x-axis represents collision energy (eV). The primary y-axis (log scale) represents the signal intensity of the 160.1 > 109.1 and 155.1 > 109.1 transitions, represented in blue and orange respectively. The secondary y-axis represents the ratio of the signal intensities (160.1 > 109.1 to 155.1 > 109.1) which are shown in red. The dashed vertical green line indicate the optimized collision energy (eV) for each instrument. Percentage contribution (y axis) from methionine stable isotope label to 3-(5-methyl-1H-pyrazol-3-yl) propanoic acid (MPP), the marker analyte for endogenous succinylacetone, over sequential injections (x axis) of DBS specimens on two Waters Xevo TQDs: ( A) Instrument 11A; ( B) Instrument 11B. Percentage contribution (y axis) from methionine stable isotope label to 3-(5-methyl-1H-pyrazol-3-yl) propanoic acid (MPP), the marker analyte for endogenous succinylacetone, over sequential injections (x axis) of DBS specimens on two Waters Xevo TQDs: ( A) Instrument 11A; ( B) Instrument 11B. Comparison of LDT (purple) and NB2 (green) population distributions for ( A) phenylalanine (PHE); ( B) tyrosine (TYR); ( C) leucine (LEU); ( D) methionine (MET); ( E) isovalerylcarnitine (C5); ( F) glutarylcarnitine (C5DC); ( G) octanoylcarnitine (C8); ( H) decanoylcarnitine (C10) by laboratory. The dashed purple and dotted green horizontal lines represent the overall 90th centiles for LDT and NB2 respectively; the horizontal red line denotes the screening cut-off value (COV). Note that TYR and C10 are secondary markers; hence, no COV is denoted. Comparison of LDT (purple) and NB2 (green) population distributions for ( A) phenylalanine (PHE); ( B) tyrosine (TYR); ( C) leucine (LEU); ( D) methionine (MET); ( E) isovalerylcarnitine (C5); ( F) glutarylcarnitine (C5DC); ( G) octanoylcarnitine (C8); ( H) decanoylcarnitine (C10) by laboratory. The dashed purple and dotted green horizontal lines represent the overall 90th centiles for LDT and NB2 respectively; the horizontal red line denotes the screening cut-off value (COV). Note that TYR and C10 are secondary markers; hence, no COV is denoted. NB2 population distribution for succinylacetone by laboratory. The dotted horizontal line and the shaded band indicate the P90 for the overall data set and the MAD around the P90 respectively. The solid horizontal line denotes the screening COV. NB2 population distribution for succinylacetone by laboratory. The dotted horizontal line and the shaded band indicate the P90 for the overall data set and the MAD around the P90 respectively. The solid horizontal line denotes the screening COV. All laboratory mean imprecision by analyte determined from replicate measurements ( n = 25) of the three third-party internal quality control (IQC) materials. All laboratory mean imprecision by analyte determined from replicate measurements ( n = 25) of the three third-party internal quality control (IQC) materials. Analyte All Lab Mean (µmol/L) Mean Intra-Lab Imprecision (%CV) All Lab SD Inter-Laboratory Imprecision (%CV) Phenylalanine 76 8.8 4.2 5.5 146 7.6 8.6 5.9 294 8.7 15.4 5.2 Tyrosine 70 8.2 3.7 5.3 258 7.1 12.5 4.9 523 7.8 22.6 4.3 Leucine 218 9.6 8.5 3.9 303 8.7 13.8 4.6 511 10.1 22.3 4.4 Methionine 23 9.8 0.9 3.8 56 9.0 2.5 4.5 178 10.0 6.8 3.8 C5 carnitine 0.2 10.5 0.02 10.2 0.6 8.9 0.1 10.1 2.3 9.7 0.2 8.8 C5DC carnitine 0.1 12.1 0.01 18.8 0.3 9.5 0.04 14.4 1.4 9.7 0.2 14.9 C8 carnitine 0.2 10.4 0.01 8.7 0.6 8.5 0.1 8.2 2.3 9.1 0.2 6.7 C10 carnitine 0.2 10.8 0.01 9.1 0.5 9.9 0.04 8.6 2.1 11.1 0.2 7.4 Succinylacetone 0.41 20.8 0.09 20.8 2.3 11.6 0.2 9.6 5.3 12.2 0.4 8.3 Comparator statistics for Neobase™ 2 (NB2) ( n = 23,791) vs. laboratory-developed tests (LDT) ( n = 139,444) for the existing analytes. Comparator statistics for Neobase™ 2 (NB2) ( n = 23,791) vs. laboratory-developed tests (LDT) ( n = 139,444) for the existing analytes. % Bias to LDT Ratio of Between-Laboratory Standard Deviations Analyte (µmol/L) Median P90 MAD P90 P99 Period Centered P99 Phenylalanine −15.2 −15.1 0.80 0.80 0.89 0.22 Tyrosine −1.4 −2.2 0.78 0.62 0.91 0.19 Leucine −12.2 −12.1 1.11 0.75 0.84 0.97 Methionine −30.0 −29.2 0.88 0.74 0.84 1.14 C5 carnitine −13.2 −14.6 0.47 0.85 0.94 0.67 C5DC carnitine −39.6 −40.3 0.81 0.48 0.40 0.66 C8 carnitine −28.4 −28.8 0.33 0.27 0.32 0.28 C10 carnitine −24.0 −24.0 0.43 0.30 0.51 0.34 Summary of population data obtained using the LDTs ( n = 139,444) and NB2 ( n = 23,791), the latter denoted by grey shading. Summary of population data obtained using the LDTs ( n = 139,444) and NB2 ( n = 23,791), the latter denoted by grey shading. Analyte (µmol/L) P1 P10 Median Mean MAD P90 95% CI P90 P99 Separation Margin % (p99) Phenylalanine 36.8 44.1 54.6 55.9 6.2 69.2 68.5–69.6 88.8 63.1 31.2 37.4 46.3