Magnetic resonance elastography combined with fibrosis-4 index for diagnosing at-risk metabolic dysfunction-associated steatohepatitis: a systematic review and meta-analysis of diagnostic test accuracy studies

Konstantinos Malandrisa, Anastasia Katsoulab, Tarek Nayfehc, Kalliopi Tsapad, Dimitra Tsapad, Georgios Kalopitase, Aris Liakosa, Thomas Karagiannisa, Eleni Theocharidoua, Emmanouil Sinakosf, Georgios Germanidise, Apostolos Tsapasa,g

Aristotle University of Thessaloniki, Greece; Georgetown University, Baltimore, Maryland, USA; Harris Manchester College, University of Oxford, UK

aClinical Research and Evidence-Based Medicine Unit, Second Medical Department, Aristotle University of Thessaloniki, Greece (Konstantinos Malandris, Aris Liakos, Thomas Karagiannis, Eleni Theocharidou, Apostolos Tsapas); bSecond Propaedeutic Medical Department, Aristotle University of Thessaloniki, Greece (Anastasia Katsoula); cUnion Memorial Hospital, Georgetown University, Baltimore, Maryland, USA (Tarek Nayfeh); dSchool of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, Greece (Kalliopi Tsapa, Dimitra Tsapa); eFirst Medical Department, Aristotle University of Thessaloniki, Greece (Georgios Kalopitas, Georgios Germanidis); fFourth Medical Department, Aristotle University of Thessaloniki, Greece (Emmanouil Sinakos); gHarris Manchester College, University of Oxford, UK (Apostolos Tsapas)

Correspondence to: Konstantinos Malandris, MD, MSc, Clinical Research and Evidence-Based Medicine Unit, Second Medical Department, Aristotle University of Thessaloniki, Konstantinoupoleos 49, 54642 Thessaloniki, Greece, e-mail: kostas_malandris@yahoo.gr
Received 24 June 2025; accepted 16 September 2025; published online 10 October 2025
DOI: https://doi.org/10.20524/aog.2025.1010
© 2025 Hellenic Society of Gastroenterology

Abstract

Background Patients with metabolic dysfunction-associated steatohepatitis (MASH; nonalcoholic fatty liver disease activity score ≥4) and significant fibrosis (≥F2; at-risk MASH) are at increased risk for disease progression. Magnetic resonance elastography (MRE) combined with the fibrosis-4 (MEFIB) index enables the noninvasive diagnosis of at-risk MASH and significant fibrosis. We assessed the performance of the MEFIB index for ruling in/out both target conditions.

Methods We analyzed studies up to February 2025 assessing the performance of MEFIB index for ruling in (MRE≥3.3 kPa plus FIB-4≥1.6) and out (MRE<3.3 kPa plus FIB-4<1.6) at-risk MASH or significant fibrosis, using liver biopsy as the reference standard. We calculated pooled diagnostic accuracy estimates using bivariate random-effects models.

Results We included 7 studies with 3356 participants. For ruling in at-risk MASH, the MEFIB index yielded a pooled specificity of 0.94 (95% confidence interval [CI] 0.74-0.99), and a positive likelihood ratio (LRp) of 5.3 (95%CI 1.8-15.7). For ruling out at-risk MASH, the MEFIB index had a pooled sensitivity of 0.77 (95%CI 0.62-0.88) and a negative likelihood ratio (LRn) of 0.34 (95%CI 0.23-0.52). For ruling in significant fibrosis, the MEFIB index achieved a summary specificity of 0.93 (95%CI 0.85-0.97) with LRp 8.2 (95%CI 4.5-14.9). For excluding significant fibrosis, the pooled sensitivity and LRn of the MEFIB index were 0.88 (95%CI 0.79-0.94) and 0.16 (95%CI 0.08-0.31), respectively.

Conclusions MEFIB index has acceptable accuracy for diagnosing at-risk MASH and significant fibrosis. Proposed thresholds can be used to identify both target conditions in high prevalence settings and facilitate patient recruitment in clinical trials.

Keywords MEFIB index, metabolic dysfunction-associated steatohepatitis, fibrosis, systematic review, meta-analysis

Ann Gastroenterol 2025; 38 (6): 681-690


Introduction

Metabolic dysfunction-associated steatotic liver disease (MASLD) is characterized by the accumulation of hepatic fat in the presence of specific cardiometabolic risk factors, after the exclusion of secondary causes of liver steatosis [1,2]. Its progressive form, metabolic dysfunction-associated steatohepatitis (MASH), has emerged as the second most common indication for liver transplantation in the United States [3]. Individuals with MASH and significant fibrosis (F≥F2), referred to as “at-risk MASH”, are at increased risk for disease progression and liver-related mortality, constituting the target population for MASH clinical trials [4].

The requirement for specific histopathologic criteria to identify candidates for enrollment in MASH clinical trials raise significant challenges, one of which is the high rate of screening failure [4]. To mitigate this issue, and reduce the need for unnecessary liver biopsies, several noninvasive biomarkers have been proposed for the selection of potentially eligible participants [5]. Fibrosis-4 index (FIB-4) and vibration controlled transient elastography (VCTE) are the most validated biomarkers for the assessment of fibrosis, serving as initial steps of many recommended pathways [1,2,6]. However, their low positive predictive values (PPVs), attributed mainly to the low prevalence of MASLD with significant fibrosis, limit their ability to set the diagnosis [7].

Following the approval of resmetirom and semaglutide for MASLD, there is an even greater need to identify patients with at-risk MASH, ideally without requiring a liver biopsy [7]. In response, there has been a growing trend towards the development of sequential testing strategies that integrate serum-based and imaging-based indices [8]. Previous studies have shown that the combination of magnetic resonance elastography (MRE) and FIB-4 index, known as the MEFIB index, is superior to its individual components, and to the FibroScan-aspartate aminotransferase (FAST) score, for identifying candidates for MASH clinical trials [8,9]. We conducted a systematic review and meta-analysis to summarize, and critically appraise, findings from individual studies assessing the accuracy of the MEFIB index for diagnosing at-risk MASH and significant fibrosis.

Materials and methods

We conducted this systematic review and meta-analysis following a prespecified protocol registered in PROSPERO (CRD420251041430). Our methodology and results adhere to the Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies (PRISMA-DTA) guidelines (Supplementary Table 1) [10].

Eligibility criteria

We included cross-sectional studies assessing the accuracy of the MEFIB index for diagnosing at-risk MASH or significant fibrosis (fibrosis stage ≥F2) in adults with MASLD, using liver biopsy as the reference standard. At-risk MASH was defined as MASH with nonalcoholic fatty liver disease activity score (NAS) ≥4 and fibrosis stage ≥F2. For the MEFIB index we considered only the diagnostic thresholds recommended by the respective American and European guidelines [1,2] as follows: rule-in threshold: MRE≥3.3 kPa plus FIB-4≥1.6; and rule-out threshold: MRE<3.3 kPa plus FIB-4<1.6.

Two-gate diagnostic accuracy studies, studies lacking sufficient data to reconstruct 2×2 classification tables, and studies reporting diagnostic accuracy estimates for MEFIB index thresholds other than those prespecified were excluded [11].

Search strategy and study selection

We searched Medline, Cochrane library and Web of Science from inception to February 25th, 2025, without restrictions. We structured our search strategy using free text words and controlled vocabulary (Supplementary Tables 2-4). We used the Polyglot Search Translator to convert search strings across databases [12]. We did not search conference proceedings from relevant scientific meetings.

Search results were imported into reference manager software and duplicates were removed. The remaining records were then imported into the Covidence web application. Pairs of reviewers, working independently, assessed record eligibility, initially at title and abstract level and then in full text. Disagreements were resolved either through discussion between the original reviewers, or by a senior reviewer.

Data extraction and quality assessment

Two reviewers working independently extracted data from eligible studies using predesigned and pilot-tested forms. Data extraction items included study characteristics, participant characteristics and diagnostic accuracy results in terms of true positives (TP), false positives (FP), true negatives (TN) and false negatives (FN). If raw data for TP, FP, TN, FN were not available in the original studies, we computed them from the sample size, prevalence and other diagnostic accuracy measures using RevMan’s calculator. To identify overlapping cohorts among included studies, we took into consideration recruitment periods, participating centers and authors. In case of overlapping cohorts across publications, we prioritized results from the cohorts with the largest sample size, provided they reported sufficient information for 2×2 classification tables.

Two reviewers working independently assessed the risk of bias and applicability of the included studies using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool [13]. Details on risk of bias and applicability judgements are presented in the Supplementary material. Disagreements during the data extraction and quality assessment process were resolved through discussion, or by a senior reviewer.

Data synthesis and statistical analysis

The primary outcomes of interest were the accuracy of the MEFIB index for ruling in at-risk MASH (NAS≥4 and fibrosis stage ≥F2), and significant fibrosis (F≥F2). Secondary outcomes of interest were the accuracy of the MEFIB index for ruling out at-risk MASH, and significant fibrosis.

For all outcomes we reconstructed 2×2 classification tables from eligible studies. Using respective data, we recalculated sensitivity and specificity estimates, with their 95% confidence intervals (CIs), and created coupled forest plots to visually present these estimates. In view of the homogeneity of thresholds for the index test among primary studies, we calculated pooled specificity, sensitivity, positive likelihood ratio (LRp) and negative likelihood ratio (LRn), using the bivariate random-effects model [14,15]. We graphically present individual and pooled study estimates in receiver operating characteristic (ROC) space alongside 95% confidence and prediction regions. We assessed heterogeneity through visual inspection of forest plots and the size of prediction regions [14]. Given the limited number of included studies, we did not investigate for potential sources of heterogeneity through meta-regression analysis [16]. We assessed for the presence of small-study effect bias by means of Deeks’ funnel plots, with P<0.10 for the slope coefficient indicating significant asymmetry [17]. We used Cook’s distance approach and standardized residuals to identify potentially influential studies (Supplementary material) [18]. We conducted prespecified sensitivity analyses, excluding influential studies identified using Cook’s distance approach, studies with unclear or high applicability concerns, and studies of retrospective design, because of potential bias related to the disease spectrum and the overestimation of diagnostic accuracy estimates [19]. All these analyses were conducted solely for the primary outcomes. To assess the clinical utility of the MEFIB index for ruling in at-risk MASH and significant fibrosis we used Fagan nomograms, assuming various Pretest probabilities reflecting both high and low prevalence settings. In addition, using the pooled estimates of sensitivity and specificity, we calculated PPVs and negative predictive values (NPVs) for all outcomes for the same prevalence scenarios. We performed all analyses using STATA statistical software v.11.2 and MetaDTA [20,21].

Results

After removing duplicates, we screened 682 records at title and abstract level, from which 30 full-text articles were assessed for eligibility. Eventually, 7 studies with 3356 participants were included in the systematic review and meta-analysis (Fig. 1) [8,22-27].

thumblarge

Figure 1 Flow diagram of the study selection process

Study and participant characteristics

Table 1 presents the main characteristics of the included studies and participants. Most studies were multicenter, following a prospective design, recruiting mainly participants from tertiary healthcare facilities. One study included participants from a low prevalence setting (those referred for routine colorectal cancer screening) [24]. Two studies were identified solely as conference abstracts [24,27]. The study by Loomba et al provided the largest amount of data, comprising nearly 2000 participants who were screened for enrollment in the MAESTRO-MASH clinical trial [27]. The mean age of participants ranged from 39.0 to 65.0 years. Among the 3356 participants, almost half (46%) were males and 55.8% (1,872 participants) had type 2 diabetes. The average mean body mass index (BMI) was 30.3 kg/m2, with a trend towards lower values for Asian cohorts (27.8 kg/m2). The mean aspartate transaminase (AST) and alanine transaminase (ALT) values ranged from 36.6-56.6 IU/L and from 50.6-84.0 IU/L, respectively. The average mean FIB-4 index was 1.75, ranging from 0.98-2.80. Similarly, the average mean ΜRE value was 3.6 kPa, ranging from 2.7-5.1 kPa. Among studies with available data, the prevalence of at-risk MASH was 31.3% (393 of 1255 participants), while the prevalence of significant fibrosis was 60.1% (1,916 of 3186 participants).

Table 1 Baseline characteristics of included studies

thumblarge

Risk of bias assessment and applicability

Three studies were at unclear or high risk for bias, because of concerns related to patient selection [23,24,26]. One study raised applicability concerns due to the low prevalence setting from which participants were recruited (during referral for colon cancer screening) [24]. A detailed presentation of risk of bias and applicability assessment is presented in Supplementary Table 5.

Accuracy of MEFIB index for ruling in/out at-risk MASH

Five studies with 1255 participants contributed data to this analysis [8,22-25]. The study by Kim et al included 2 different cohorts (USCD cohort and Yokohama cohort), which were handled separately to facilitate analysis [8]. Fig. 2 presents individual study estimates for ruling in at-risk MASH. Sensitivity and specificity estimates across studies ranged from 0.05-0.64 and from 0.63-1.00, respectively. MEFIB index (MRE≥3.3 kPa plus FIB-4≥1.6) yielded a pooled sensitivity of 0.34 (95%CI 0.18-0.55), specificity 0.94 (0.74-0.99), LRp 5.3 (95%CI 1.8-15.7) and LRn 0.71 (95%CI 0.57-0.88).

Figure 2 Coupled forest plot of sensitivity and specificity of MEFIB index for ruling in at-risk MASH

MEFIB index, magnetic resonance elastography combined with the fibrosis-4 index; MASH, metabolic dysfunction-associated steatohepatitis; CI, confidence interval

thumblarge

For ruling out at-risk MASH, individual study estimates for sensitivity and specificity ranged from 0.45-0.93 and from 0.43-0.90, respectively (Supplementary Fig. 1). The MEFIB index (MRE<3.3 kPa plus FIB-4<1.6) yielded a pooled sensitivity of 0.77 (95%CI 0.62-0.88), specificity 0.66 (95%CI 0.49-0.80), LRp 2.3 (95%CI 1.6-3.2), and LRn 0.34 (95%CI 0.23-0.52).

Accuracy of MEFIB index for ruling in/out significant fibrosis

Fig. 3 presents individual study estimates for ruling in significant fibrosis. Sensitivity and specificity estimates across studies ranged from 0.33-0.88 and from 0.68-0.98, respectively. Based on aggregated data from 4 studies with 2909 participants [8,22,26,27], the MEFIB index (MRE≥3.3 kPa plus FIB-4≥1.6) yielded a pooled sensitivity of 0.56 (95%CI 0.34-0.76), specificity 0.93 (95%CI 0.85-0.97), LRp 8.2 (95%CI 4.5-14.9), and LRn 0.47 (95%CI 0.30-0.75) for ruling in significant fibrosis.

Figure 3 Coupled forest plot of sensitivity and specificity of the MEFIB index for ruling in significant fibrosis

MEFIB index, magnetic resonance elastography combined with the fibrosis-4 index; CI, confidence interval

thumblarge

Two studies (808 participants) provided diagnostic accuracy estimates of MEFIB index for ruling out significant fibrosis [8,22]. Individual estimates for sensitivity and specificity ranged from 0.79-0.94 and from 0.66-0.78, respectively (Supplementary Fig. 2). MEFIB index (MRE<3.3 kPa plus FIB-4<1.6) yielded a pooled sensitivity of 0.88 (95%CI 0.79-0.94), specificity 0.73 (95%CI 0.67-0.79), LRp 3.3 (95%CI 2.5-4.3), and LRn 0.16 (95%CI 0.08-0.31) for ruling out significant fibrosis.

Additional analysis

Visual inspection of the forest plots and the size of the prediction regions indicated substantial heterogeneity across all outcomes (Supplementary Fig. 3). To explore for potential sources of heterogeneity for the primary outcomes, we conducted several sensitivity analyses, with results presented in Supplementary Table 6. Specifically, we assessed the impact of excluding studies that: (i) exclusively recruited participants with type 2 diabetes (T2D); (ii) raised applicability concerns; (iii) were conducted retrospectively; and (iv) were deemed influential based on Cook’s distance approach and standardized residuals. Across all sensitivity analyses, results remained consistent with our main findings, with specificity estimates exceeding 90% for both primary outcomes. Notably, among the 3356 participants included in our analyses, 2166 were from 2 studies reported as conference abstracts [24,27]. A sensitivity analysis excluding these studies yielded results consistent with the main analyses, with specificity estimates of 0.93 for both primary outcomes. Only 1 study recruited patients with T2D exclusively [22]. This study reported specificity estimates of 0.85 and 0.92 for ruling in at risk-MASH and significant fibrosis respectively. In post hoc analyses by cohort region, Asian cohorts yielded pooled specificity estimates of 0.87 and 0.92 for at-risk MASH and significant fibrosis, respectively. The respective estimates from Western cohorts were similar at 0.97 and 0.93. In sensitivity analyses including only studies at low risk of bias for all QUADAS domains, pooled specificity estimates for ruling in at-risk MASH and significant fibrosis were 0.82 and 0.96, respectively. The study by Noureddin et al was influential for ruling in at-risk MASH, yielding the highest specificity estimate (Supplementary Fig. 4, 5). This study was identified solely as a conference abstract, limiting detailed assessment of baseline characteristics. However, it is worth mentioning that this study had the lowest prevalence of at-risk MASH among the included studies (11.8%).

Based on Deeks’ funnel plots (Supplementary Fig. 6, 7) there was evidence of small study effect bias for ruling in at risk MASH (P=0.02). However, the number of included studies was limited, thus limiting the reliability of the respective analysis.

Clinical utility

Assuming a prevalence of 10-50%, the probability of having at-risk MASH following a positive test was 37-84%, respectively (Supplementary Fig. 8). For a prevalence of 60-80%, the probability for at-risk MASH increased, ranging from 89-96% respectively. For ruling in significant fibrosis, and for a prevalence setting ranging from 10-50%, the post-test probability after a positive test result ranged from 48-89%, respectively (Supplementary Fig. 9). For higher prevalence settings (60-80%), respective post-test probabilities for having significant fibrosis ranged from 92-97%. Table 2 presents PPVs and NPVs of the MEFIB index for all outcomes and for the same prevalence scenarios.

Table 2 Positive and negative predictive values for all outcomes across different prevalence scenarios

thumblarge

Discussion

In this systematic review and meta-analysis, we evaluated the accuracy of the MEFIB index for identifying at-risk MASH and significant fibrosis in adults with MASLD, using biopsy as the reference standard. We limited our analysis to the thresholds recommended by relevant societies: MRE≥3.3 kPa plus FIB-4≥1.6 to rule in the target conditions, and MRE<3.3 kPa plus FIB-4<1.6 to rule them out. Given that the MEFIB index was developed to address the low PPVs of existing noninvasive tests, and to facilitate participant selection for clinical trials, our analysis primarily focused on assessing its accuracy in diagnosing at-risk MASH and significant fibrosis.

Based on our findings, the MEFIB index demonstrated robust performance in identifying both target conditions. For ruling in at-risk MASH, MEFIB index achieved a pooled specificity of 0.94 and an LRp of 5.3. Similarly, for ruling in significant fibrosis, the index yielded a summary specificity of 0.93 and an LRp of 8.2. In a prevalence setting of 60%, the MEFIB index resulted in a PPV exceeding 90% for significant fibrosis and 89% for at-risk MASH. For ruling out the target conditions, the MEFIB index yielded pooled sensitivity estimates of 0.77 for at-risk MASH and 0.88 for significant fibrosis.

Our systematic review and meta-analysis provides a timely placed synthesis of evidence concerning the diagnostic performance of the MEFIB index. Using robust methodology, in line with Cochrane recommendations, we searched several databases and included 7 studies with more than 3000 participants. Our clinically focused results employed the dual cutoff approach, using the most widely used MEFIB index thresholds for ruling in or ruling out at-risk MASH and significant fibrosis. By focusing on specific MEFIB index positivity thresholds, we were able to provide summary estimates of sensitivity and specificity—metrics that offer greater clinical utility than the less informative area under the ROC curve (AUROC). Furthermore, for at-risk MASH, we employed the definition most commonly used for patient selection in MASH clinical trials. This choice was made to maximize the external validity and translatability of our findings to clinical trial settings and real-world practice.

Certain limitations must be acknowledged. Visual inspection of forest plots and the size of prediction regions indicated high heterogeneity for all outcomes of interest. Given the limited number of studies included in our meta-analysis (7 studies), we were unable to assess for potential sources of heterogeneity through meta-regression analysis [16]. Nevertheless, several exploratory sensitivity analyses were conducted, with results consistent with our main findings. Sparse reporting of relevant data prevented us from performing subgroup analyses based on specific factors previously suggested to influence the diagnostic performance of newly developed noninvasive biomarkers, including T2D, BMI, and age (≥65 years). Additionally, most included studies were at unclear or high risk for bias, primarily due to patient selection concerns. This was mainly attributable to the retrospective design of the studies and the possibility of convenience sampling, or suboptimal reporting of enrolment procedures.

To our knowledge, this is the first meta-analysis evaluating the diagnostic performance of the MEFIB index for identifying or excluding at-risk MASH and significant fibrosis. For ruling in significant fibrosis, our findings (specificity: 0.93) closely align with the results reported by Kim et al [8]. In their study, Kim et al combined 2 geographically distinct cohorts—a testing cohort from the USA (UCSD) (specificity: 0.98) and a validation cohort from Japan (Yokohama) (specificity: 0.94). For ruling in at-risk MASH, our pooled specificity estimates significantly differed from the combined estimates reported by Kim et al (0.94 vs. 0.77). Notably, our specificity estimates closely match that reported by the UCSD cohort alone (0.94 vs. 0.91), while the primary discrepancy arises from the Yokohama cohort, which reported a considerably lower specificity of 0.63. Although the Yokohama cohort had a lower mean BMI compared to our study population (27.9 vs. 30.3 kg/m2), we do not consider this difference in BMI as the main reason for the observed discrepancy. Emerging evidence from well-conducted individual patient data meta-analyses suggests that BMI does not substantially confound MRE metrics in MASLD [28], thus highlighting the need for further validation of the MEFIB index in other cohorts.

Recently, the MRI-AST (MAST) score, combining MRI-proton density fat fraction (PDFF), MRE, and AST levels, was introduced for diagnosing at-risk MASH [29]. When comparing MEFIB and MAST directly, MEFIB appears superior based on AUROC comparisons; however, the MAST score has the advantage of yielding a lower percentage of unclassified participants (gray zone) [8]. Specifically, the pooled prevalence of gray zone results for MEFIB index in our analysis was 26.5%, whereas the respective reported prevalence for the MAST score is 18.1% [29]. The FAST score is another noninvasive biomarker that was recently developed in order to facilitate patient selection for clinical trials [30]. Published meta-analyses report a FAST score specificity of around 0.90 for ruling in at risk MASH, with a PPV of 87% for a prevalence of 60% [31,32]. Nevertheless, results from comparative diagnostic accuracy studies support the superiority of the MEFIB index over the FAST score in terms of AUROC comparison (0.76 vs. 0.68), with similar gray zone magnitudes (26.1% vs. 30.8%) [8]. On the other hand, the FAST score offers the advantages of lower cost and easier applicability compared to an MRI examination. A structured comparison between FAST, MEFIB and MAST score is presented in Supplementary Table 7.

Early identification of at-risk MASH or significant fibrosis is important for timely initiation of appropriate pharmacotherapy, intensification of comorbidity management and close monitoring for disease progression. With a pooled specificity of 0.94, the MEFIB index accurately classifies nearly 9 of 10 patients with at-risk MASH. Similarly, with a summary specificity of 0.93, MEFIB reliably identifies approximately 9 of 10 patients without significant fibrosis, yielding roughly 1 false positive per 10 patients tested. In addition, a positive MEFIB result indicates that patients are approximately 5 times more likely to have at-risk MASH (LRp 5.3) and nearly 8 times more likely to have significant fibrosis (LRp 8.2) compared to those testing negative. As a result, it seems that MEFIB performs better for diagnosing significant fibrosis compared to at-risk MASH. This might be related to the fact that both MEFIB components mainly target fibrosis rather than other histological features of MASH, such as steatosis, inflammation and ballooning.

It should be noted that a substantial proportion of patients initially classified within the low or indeterminate risk categories based on FIB-4 scores have subsequently been identified as having clinically significant fibrosis [33]. As a result, a low FIB-4 during MEFIB should be followed by further examination and diagnostic evaluation in the presence of clinical uncertainty. Noureddin et al provide an example of such a case, where a 50-year-old patient with MASLD had AST 45 U/L, ALT 60 U/L, platelet count 270×109/L, MRI-PDFF 15%, MRE 4 kPa, controlled attenuation parameter 345 dB/m, and VCTE 12 kPa [34]. This patient would have a FIB-4 of score of 1.08, while his FAST and MAST scores suggest the presence of at-risk MASH [34]. As a result, MEFIB, MAST and FAST should not be considered as competing candidates, rather as useful tools in the holistic evaluation of a patient with MASLD.

Similarly to other scores utilizing a dual cutoff approach, the MEFIB index suffers the limitation of gray zone results (26% of participants). Assessment of these patients should be done by taking into account proximity to thresholds, patient characteristics, and additional testing by means of other noninvasive scores, before liver biopsy. Notably, a recently published meta-analysis of individual participant data found that a positive MEFIB index had a strong association with liver-related outcomes, hepatocellular carcinoma and death, and a high NPV of 99% for hepatic decompensation at 5 years [35,36].

Limitations in the diagnostic accuracy, availability and cost of current noninvasive tests have led to recommendations advocating for their sequential application. This strategy typically begins with tests that are widely accessible and easy to apply, followed by more specialized ones [37]. Although various combinations of tests may be employed, the underlying principle remains the same: increasing the prevalence of the target condition within the tested population to enhance the PPV of the subsequent test.

In conclusion, the MEFIB index has acceptable accuracy for diagnosing at-risk MASH and significant fibrosis. The proposed thresholds can be used to identify both target conditions in high prevalence settings, and to facilitate patient recruitment in clinical trials.

Summary Box

What is already known:

  • Patients with metabolic dysfunction-associated steatohepatitis ([MASH], nonalcoholic fatty liver disease activity score ≥4) and significant fibrosis (≥F2) (at-risk MASH) are at increased risk for disease progression

  • Magnetic resonance elastography (MRE) combined with the fibrosis-4 index (MEFIB index) enables the noninvasive diagnosis of at-risk MASH and significant fibrosis

  • The MEFIB index was originally developed to address the low positive predictive values (PPVs) of existing noninvasive tests, and to facilitate participant selection for clinical trials

What the new findings are:


  • For ruling in at-risk MASH, the MEFIB index achieved a pooled specificity of 0.94 and a positive likelihood ratio (LRp) of 5.3

  • For ruling in significant fibrosis, the index yielded a summary specificity of 0.93 and an LRp of 8.2

  • In a prevalence setting of 60%, the MEFIB index resulted in a PPV exceeding 90% for significant fibrosis and 89% for at-risk MASH

  • The MEFIB index can be used to identify both target conditions in high prevalence settings, and to facilitate patient recruitment in clinical trials

References

1. Rinella ME, Neuschwander-Tetri BA, Siddiqui MS, et al. AASLD Practice Guidance on the clinical assessment and management of nonalcoholic fatty liver disease. Hepatology 2023;77:1797-1835.

2. European Association for the Study of Obesity (EASO). EASL-EASD-EASO Clinical Practice Guidelines on the management of metabolic dysfunction-associated steatotic liver disease (MASLD). J Hepatol 2024;81:492-542.

3. Younossi ZM, Stepanova M, Ong J, et al. Nonalcoholic steatohepatitis is the most rapidly increasing indication for liver transplantation in the United States. Clin Gastroenterol Hepatol 2021;19:580-589.

4. Harrison SA, Allen AM, Dubourg J, Noureddin M, Alkhouri N. Challenges and opportunities in NASH drug development. Nat Med 2023;29:562-573.

5. ZoncapèM, Liguori A, Tsochatzis EA. Non-invasive testing and risk-stratification in patients with MASLD. Eur J Intern Med 2024;122:11-19.

6. Younossi ZM, Zelber-Sagi S, Lazarus JV, et al. Global consensus recommendations for metabolic dysfunction-associated steatotic liver disease and steatohepatitis. Gastroenterology 2025;11:S0016-5085(25)00632-8.

7. Brennan PN, Kopka CJ, Agirre-Garrido L, et al. Reviewing MAESTRO-NASH and the implications for hepatology and health systems in implementation/accessibility of Resmetirom. NPJ Gut Liver 2025;2:3.

8. Kim BK, Tamaki N, Imajo K, et al. Head-to-head comparison between MEFIB, MAST, and FAST for detecting stage 2 fibrosis or higher among patients with NAFLD. J Hepatol 2022;77:1482-1490.

9. Jung J, Loomba RR, Imajo K, et al. MRE combined with FIB-4 (MEFIB) index in detection of candidates for pharmacological treatment of NASH-related fibrosis. Gut 2021;70:1946-1953.

10. McInnes MDF, Moher D, Thombs BD, et al;and the PRISMA-DTA Group. Preferred Reporting Items for a Systematic review and Meta-Analysis of diagnostic test accuracy studies:the PRISMA-DTA statement. JAMA 2018;319:388-396.

11. Bossuyt PM. Chapter 3:Understanding the design of test accuracy studies. In:Deeks JJ, Bossuyt PM, Leeflang MM, Takwoingi Y (editors). Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy. Version 2.0 (updated July 2023). Cochrane, 2023. Available from:https://training.cochrane.org/handbook-diagnostic-test-accuracy/current [Accessed 23 September 2025].

12. Clark JM, Sanders S, Carter M, et al. Improving the translation of search strategies using the Polyglot Search Translator:a randomized controlled trial. J Med Libr Assoc 2020;108:195-207.

13. Whiting PF, Rutjes AW, Westwood ME, et al;QUADAS-2 Group. QUADAS-2:a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 2011;155:529-536.

14. Macaskill P, Takwoingi Y, Deeks JJ, Gatsonis C. Chapter 9:Understanding meta-analysis. In:Deeks JJ, Bossuyt PM, Leeflang MM, Takwoingi Y (editors). Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy. Version 2.0 (updated July 2023). Cochrane, 2023. Available from:https://training.cochrane.org/handbook-diagnostic-test-accuracy/current [Accessed 23 September 2025].

15. Takwoingi Y, Dendukuri N, Schiller I, Rücker G, Jones HE, Partlett C, Macaskill P. Chapter 10:Undertaking meta-analysis. In:Deeks JJ, Bossuyt PM, Leeflang MM, Takwoingi Y (editors). Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy. Version 2.0 (updated July 2023). Cochrane, 2023. Available from:https://training.cochrane.org/handbook-diagnostic-test-accuracy/current [Accessed 23 September 2025].

16. Deeks JJ, Higgins JPT, Altman DG, McKenzie JE, Veroniki AA (editors). Chapter 10:Chapter 10:Analysing data and undertaking meta-analyses [last updated November 2024]. In:Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA (editors). Cochrane Handbook for Systematic Reviews of Interventions version 6.5. Cochrane, 2024. Available from:https://www.training.cochrane.org/handbook [Accessed 23 September 2025].

17. Deeks JJ, Macaskill P, Irwig L. The performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed. J Clin Epidemiol 2005;58:882-893.

18. Skrondal A, Rabe-Hesketh, S. Generalized latent variable modeling:multilevel, longitudinal, and structural equation models (1st ed.). Chapman and Hall/CRC, 2004. doi:10.1201/9780203489437

19. Rutjes AW, Reitsma JB, Di Nisio M, Smidt N, van Rijn JC, Bossuyt PM. Evidence of bias and variation in diagnostic accuracy studies. CMAJ 2006;174:469-476.

20. Freeman SC, Kerby CR, Patel A, Cooper NJ, Quinn T, Sutton AJ. Development of an interactive web-based tool to conduct and interrogate meta-analysis of diagnostic test accuracy studies:MetaDTA. BMC Med Res Methodol 2019;19:81.

21. Patel A, Cooper N, Freeman S, Sutton A. Graphical enhancements to summary receiver operating characteristic plots to facilitate the analysis and reporting of meta-analysis of diagnostic test accuracy data. Res Synth Methods 2021;12:34-44.

22. Castera L, Garteiser P, Laouenan C, et al;QUID NASH investigators. Prospective head-to-head comparison of non-invasive scores for diagnosis of fibrotic MASH in patients with type 2 diabetes. J Hepatol 2024;81:195-206.

23. Qi S, Wei X, Zhao J, et al. Performance of MAST, FAST, and MEFIB in predicting metabolic dysfunction-associated steatohepatitis. J Gastroenterol Hepatol 2024;39:1656-1662.

24. Noureddin M, Alkhouri N, Chaldaureille C, et al. Head-to-head comparison of FAST, MAST, MEFIB, and cT1 in identifying at-risk NASH patients in a low-prevalence population. Hepatology 2023;78:S806-S810.

25. Imajo K, Saigusa Y, Kobayashi T, et al. M-PAST score is better than MAST score for the diagnosis of active fibrotic nonalcoholic steatohepatitis. Hepatol Res 2023;53:844-856.

26. Inada K, Tamaki N, Kurosaki M, et al. Validation of magnetic resonance elastography plus fibrosis-4 for significant fibrosis in nonalcoholic fatty liver disease. J Gastroenterol Hepatol 2022;37:1726-1731.

27. Loomba R, Harrison SA, Taub RA, et al. 102 |Utility of FIB-4, MRE, MRI-PDFF, and FibroScan to identify patients with at-risk F2-F3 NASH based on screening data from a 2000 patient biopsy confirmed cohort of resmetirom phase 3 clinical trial, MAESTRO-NASH. Hepatology (Baltimore, Md) 2022;76 (Suppl 1):S92-S94.

28. Liang JX, Ampuero J, Niu H, et al;LITMUS Consortium Investigators. An individual patient data meta-analysis to determine cut-offs for and confounders of NAFLD-fibrosis staging with magnetic resonance elastography. J Hepatol 2023;79:592-604.

29. Noureddin M, Truong E, Gornbein JA, et al. MRI-based (MAST) score accurately identifies patients with NASH and significant fibrosis. J Hepatol 2022;76:781-787.

30. Newsome PN, Sasso M, Deeks JJ, et al. FibroScan-AST (FAST) score for the non-invasive identification of patients with non-alcoholic steatohepatitis with significant activity and fibrosis:a prospective derivation and global validation study. Lancet Gastroenterol Hepatol 2020;5:362-373.

31. Malandris K, Arampidis D, Mainou M, et al. FibroScan-AST score for diagnosing fibrotic MASH:a systematic review and meta-analysis of diagnostic test accuracy studies. J Gastroenterol Hepatol 2024;39:2582-2591.

32. Ravaioli F, Dajti E, Mantovani A, Newsome PN, Targher G, Colecchia A. Diagnostic accuracy of FibroScan-AST (FAST) score for the non-invasive identification of patients with fibrotic non-alcoholic steatohepatitis:a systematic review and meta-analysis. Gut 2023;72:1399-1409.

33. Hagström H, Talbäck M, Andreasson A, Walldius G, Hammar N. Repeated FIB-4 measurements can help identify individuals at risk of severe liver disease. J Hepatol 2020;73:1023-1029.

34. Noureddin M, Harrison SA, Alkhouri N. MEFIB vs. MAST and FAST:Not a competition but useful tools. J Hepatol 2024;80:e35-e36.

35. Tincopa MA, Loomba R. Non-invasive diagnosis and monitoring of non-alcoholic fatty liver disease and non-alcoholic steatohepatitis. Lancet Gastroenterol Hepatol 2023;8:660-670.

36. Ajmera V, Kim BK, Yang K, et al. Liver stiffness on magnetic resonance elastography and the MEFIB index and liver-related outcomes in nonalcoholic fatty liver disease:a systematic review and meta-analysis of individual participants. Gastroenterology 2022;163:1079-1089.

37. tsochatzis ea. screening for liver fibrosis—sequential non-invasive testing works best. J Hepatol 2023;79:263-265.

Notes

Conflict of Interest: None