目录

  • 1 基本理论与方法
    • 1.1 循证实践概述(Introduction)
      • 1.1.1 最佳证据的涵义
      • 1.1.2 证据分级的涵义
      • 1.1.3 证据分级标准
      • 1.1.4 临床经验的涵义
      • 1.1.5 患者价值观的涵义
    • 1.2 问题构建(Forming Questions )
      • 1.2.1 问题构建技巧
      • 1.2.2 临床问题构建案例
      • 1.2.3 PICO问题构建模式
    • 1.3 证据查找(Finding the Evidence)
      • 1.3.1 各类问题的证据类型选用
      • 1.3.2 利用6S模型查找最佳证据
      • 1.3.3 证据来源选择
      • 1.3.4 PubMed数据库检索
    • 1.4 证据评价(Critical Appraisal of Evidence)
      • 1.4.1 干预性研究证据的评价与使用
      • 1.4.2 诊断试验证据的评价与使用
      • 1.4.3 危害/病因研究证据的评价与使用
      • 1.4.4 预后研究证据的评价与使用
      • 1.4.5 系统评价证据的评价与使用
    • 1.5 临床问题循证实践
      • 1.5.1 危害问题循证实践(Harm Scenario)
      • 1.5.2 诊断问题循证实践(Diagnosis Scenario)
      • 1.5.3 治疗问题循证实践 (Single Trials) Scenario
      • 1.5.4 治疗问题循证实践(Systematic Review) Scenario
      • 1.5.5 预后问题循证实践(Prognosis Scenario)
    • 1.6 证据综合:Meta分析
    • 1.7 循证实践案例(Practice Case)
      • 1.7.1 案例2
      • 1.7.2 案例3
    • 1.8 拓展学习工具与资源(EBM Library)
      • 1.8.1 教育处方
      • 1.8.2 自我评价方法
      • 1.8.3 EBM相关计算工具
      • 1.8.4 术语
  • 2 实践探索
    • 2.1 循证护理
    • 2.2 循证全科医学
    • 2.3 循证补充替代医学
    • 2.4 循证外科
    • 2.5 循证老年医学
    • 2.6 循证采购
    • 2.7 循证新生儿学
    • 2.8 循证精神卫生
    • 2.9 循证危重症
    • 2.10 发展中国家的EBM
  • 3 课程课件
    • 3.1 循证医学概述
    • 3.2 问题构建
    • 3.3 证据分类分级
    • 3.4 证据来源与检索
    • 3.5 医学研究设计概述
    • 3.6 疾病的测量与分布
    • 3.7 病因与病因推断
    • 3.8 描述性研究
    • 3.9 队列研究
    • 3.10 病例对照研究
    • 3.11 实验性研究
    • 3.12 诊断性试验
    • 3.13 诊断问题循证实践
    • 3.14 预后研究
  • 4 教学视频
    • 4.1 循证医学总论
    • 4.2 问题的构建
    • 4.3 证据的检索
    • 4.4 患者价值观与意愿
    • 4.5 病因研究证据的评价与应用
    • 4.6 诊断性证据的评价与应用
    • 4.7 防治性证据的评价与应用
    • 4.8 预后研究证据的评价与应用
    • 4.9 患者安全
证据综合:Meta分析
课程目标与工具(Goals and Tools

At the end of this module, you will be able to:

  • Know the steps in performing a meta-analysis: identification, selection, abstraction, and analysis.

  • Distinguish between random and fixed effects models when reading a meta-analysis

  • Evaluate the validity of a meta-analysis

Later in this module, we will have you look at a simple meta-analysis, and critically appraise it.  Please have the following article available:

  • Fahey T, Stocks N, Thomas T. Quantitative systematic review of randomised controlled trials comparing antibiotic with placebo for acute cough in adults.  BMJ 1998; 316: 906-10.

It is also available online, where you can view it or download a printable PDF file.

Meta分析概述

The second, third, and fourth modules have helped you learn how to read articles about diagnosis, therapy, and prognosis respectively.   Now, it's time to learn about how to read an article about meta-analysis. While on the surface more complex and involved than most studies, by applying the principles learned in the first few modules you'll soon be comfortable reading and interpreting these studies.

One area of confusion is the distinction between review articles, systematic reviews, and meta-analyses. The diagram below illustrates their relationship using a Venn diagram:

relationship Venn diagram

Review articles (also called overviews) are the broadest category. Most review articles are unsystematic, because the author does not look at all of the evidence. A systematic review has a formal approach to gathering, evaluating, and presenting the evidence. Some systematic reviews are meta-analyses; a meta-analysis goes the final step by using formal statistical methods to calculate a summary result or results.

There are two major reasons to do a meta-analysis:

  1. To quantitatively combine the results of previous studies to arrive at a summary estimate

  2. As a "study of studies", to help guide further research and identify reasons for heterogeneity between studies

Meta-analyses of the first kind can help resolve medical controversies caused by conflicting studies, are an inexpensive alternative to very large randomized trials, and can in this way shape health policy. The second kind of meta-analysis is particularly useful for designing future studies, by systematically identifying key patient and study characteristics from previous work.

Meta-analyses were initially used in the social sciences in the mid-1970's, and were adapted to medical data sets in the early to mid-1980's. The number of meta-analyses is growing rapidly, based on the results of this Medline search for the keyword "meta-analysis":

Years

Number of meta-analyses

in Annals, JAMA, Lancet, NEJM

1971-75

0

1976-80

0

1981-85

1

1986-90

32

1991-95

175


Meta-analyses often form the initial step of a cost-effectiveness analysis, decision analysis, or grant application. Once you've done all of that background work, why not publish it?!?! While the results usually correspond to later randomized trials, they do not always (LeLorier, 1997):


Results of RCT

Results of meta-analysis

Positive

Negative

Positive

13

6

Negative

7

14


In the next section, you will learn the steps in a good meta-analysis.   Understanding these steps is important, even if you never plan to do one yourself, because it helps you understand how to critically appraise this type of study.

Meta分析的步骤

There are four basic steps to any good meta-analysis:

  1. Identification

  2. Selection

  3. Abstraction

  4. Analysis

We will discuss each of these steps below.

1. Identification

The first step in a meta-analysis is to find all of the pertinent articles on your topic. Important sources of information for a meta-analysis include:

  • MEDLINE

  • EMBASE

  • CancerLit, AIDSLine, and ToxLine

  • Index Medicus

While MEDLINE, the database of the National Library of Medicine, is a good starting point, it is not the only source of information. MEDLINE indexes approximately 4100 journals, dating from 1966 to the present. It also has an excellent feature called clinical queries. There are also CD-ROM based search engines from BRS Colleague, WinSPIRS, and others which offer different search options, but use essentially the same underlying database. The European version of MEDLINE is called EMBASE, and is a Dutch/English collaboration. Depending on the topic, it may be appropriate to search the more specialized National Library of Medicine databases, such as CancerLit, AIDSLine, and ToxLine.

The Cochrane Collaboration Controlled Trials Register, established in 1993, is an important source of studies for a meta-analysis. The Register includes abstracts of thousands of trials. It includes all controlled trials in the MEDLINE and EMBASE, as well as the results of hand searches by Cochrane Collaboration volunteers of thousands of journals not indexed by MEDLINE or EMBASE. MSU students, faculty and staff can best access the Cochrane through the MSU Library.

Remember Index Medicus? I'm old enough to recall the intimidating row of thick (and I mean THICK) books that were my only way to find medical research articles as a medical student in the 1980's. They can still be useful when it is important to search for articles published before 1966, when MEDLINE and the other electronic databases were established.

Finally, there are other sources of "fugitive literature" that may be important for the author of a meta-analysis (some of which may be found in the Cochrane Controlled Trials Register):

  • unpublished studies - you would have to contact the authors themselves

  • dissertations - there are national indexes of dissertations at university libraries

  • drug company studies - you may have to contact the company directly

  • non-indexed studies - remember to search the bibliographies and Cochrane

  • pre-MEDLINE (1966) - use Index Medicus

It's important to know that different search strategies can result in different results (Table 4.5, Petitti):


 

Topic

Cochrane

CTR

MEDLINE

(expert searcher)

MEDLINE

("amateur" searcher)

Neonatal hyperbilirubinemia

88

28

17

Intraventricular hemorrhage

29

19

11


Note:  CTR = Controlled Trials Register

If you are thinking about doing a meta-analysis of your own, it is important to enlist the aid of an expert Medline searcher such as a medical librarian. The above table also highlights the importance of using the Cochrane Controlled Trials Register.

2. Selection

Once the author of a meta-analysis has assembled a large number of studies, it is important to select the right ones! There are a variety of possible inclusion (also called eligibility) criteria:

  • Whether the study include enough information for analysis (i.e. standard deviation or standard error in addition to point estimate)

  • The study design (i.e. controlled trials only vs randomized controlled trials only, especially for studies of therapy)

  • The year of study, if technology or typical dosing changes (for example, only include studies since 1984 on dyspepsia if you're interested in helicobacter pylori)

  • The dosage used in the study (to assure that an effective dose was used)

  • The language of the article - you or a colleague have to be able to read it!

  • The minimum sample size - very small studies may be unrepresentative and/or not worth the effort

  • The patient age (adults only, > 60 only, etc)

  • The study setting (emergency department, outpatient, inpatient)

3. Abstraction

Once an appropriate group of studies has been identified, the author(s) have to abstract the relevant data from each study. There are many sources of potential error in data abstraction:

  • The article may be wrong due to typographical or copyediting errors

  • Tables can be misinterpreted

  • Errors can occur during you own data entry or abstraction process

A good meta-analysis will take some or all of the following steps to minimize errors:

  • Use 2 independent reviewers

  • Use a 3rd reviewer or consensus meeting to resolve conflicts

  • Train reviewers by practicing with several articles to "calibrate"

  • Compare abstract and text to look for inconsistencies

  • Use a standard form or database which constrains entries to the expected range

  • Report the results of the data abstraction, including the percentage concordance or even a kappa statistic

Bias can also creep into a meta-analysis. For example, the authors may be biased in favor of (or against!) well known researchers. Also, prominent journals may be given greater weight or authority (rightly or wrongly). It is therefore best (although not often done) to have identifiers eliminated from articles.

Finally, part of the data abstraction phase is an assessment of study quality. Chalmers has proposed a fairly complex set of criteria which apply well to randomized controlled trials. Simpler criteria may be sufficient. For example, in a diagnostic meta-analysis, simply assuring a high quality gold standard, independent assessment of reference and study tests, and blinding may be adequate. Too often, the quality assessment is done, then ignored!  Ideally, the results of the quality assessment should inform the analysis and interpretation of results.

4. Analysis

There are many issues and controversies in the analysis of meta-analytic data. First, let's define some important terms:

Homogeneity and heterogeneity describe the degree of between-study variability in a group of studies. It is probably appropriate to combine the results from a homogenous set of studies, but many would argue that results from heterogeneous studies should not be combined. The Q statistic, interpreted using a chi-square distribution, is often used as a test of homogeneity

Fixed effects models consider only within-study variability. The assumption is that studies use identical methods, patients, and measurements; that they should produce identical results; and that differences are only due to within-study variation. By using a fixed effects model, the researcher answers the question: "Did the treatment produce benefit on average in the studies at hand?" The Peto and Mantel-Haenszel odds ratios are both based on a fixed effects model.

Random effects models consider both between-study and within-study variability. The assumption is that studies are a random sample from the universe of all possible studies. With a random effects model, the researcher answers the question: "Will the treatment produce benefit ‘on average’?" The DerSimonian Laird statistic is based on a random effects model.

Fixed and random effects models can give very different answers, and you can create examples where either model gives counterintuitive results (see Petitti, page 92). Usually, though, the answers provided by these different modeling assumptions are similar. Differences only arise when studies are nothomogenous. In a comparison of 22 meta-analyses, fixed and random effects models gave the same answer in 19 out of 22. In 3 cases, fixed effects models were significant while random effects models were not (Berlin, 1989 in Petitti textbook, pg 94).

When there is significant heterogeneity, the between-study variance becomes much larger than the within, and studies of different sample size receive relatively similar weight. When there is homogeneity, sample size dominates, and both models give similar results. Random effects models are therefore more "conservative" and generate a wider confidence interval. Put another way, a random effects model is less likely to show a significant treatment effect than a fixed effects model.

In general, if the studies are homogenous, the researchers should use a fixed effects model. If the studies are heterogeneous, the researchers (and you, the reader) should first ask why! While it may be appropriate to do a random effects analysis on all of the studies, it may be better to identify an important subgroup difference (i.e. studies using one dose showed significant effect, while lower dose did not) and then do a fixed effects analysis of each and report all of the results.

A term you will encounter in many meta-analyses is "sensitivity analysis".   A sensitivity analysis is a way of looking at only certain studies, certain groups of patients, or certain interventions.  For example, a meta-analysis of aspirin in prevention of acute MI might first analyze all studies, but then also look separately at only studies of men and studies of women.

The article by Hasselblad is an excellent starting point for budding meta-analysts, with lots of examples and formulae.  Meta-analysis of diagnostic tests is another area of growing interest - how do you combine sensitivities, specificities, and so on. However, details of calculations for homogeneity, fixed effects models, and random effects models are beyond the scope of this course.

Advanced students and those with a special interest in this topic may wish to review the following sections:

Meta分析证据的评价

Now that we have reviewed the steps in a meta-analysis, we can develop a formal approach to their critical appraisal. As with other kinds of studies, there are three basic steps:

  1. Would the results change my practice if valid?

  2. Are the outcomes important to my patients?

  3. Are the results valid?

A study which meets all three of these criteria is a POEM (Patient-Oriented Evidence that Matters). The first two criteria have to do with relevance, while the third is concerned with validity.

1. Would the results change my practice if valid?

This question must be answered by the individual clinician. A meta-analysis which confirms existing practice need not be subjected to rigorous evaluation, since they results don't change what you are already doing. On the other hand, if the results would change what you do for your patients, it is incumbent on you to take the next step. 

2. Are the outcomes important to my patients?

Most meta-analyses use patient-oriented outcomes such as morbidity, mortality, and symptoms. However, if a study uses surrogate or intermediate outcomes such as FEV1 or hemoglobinA1C, the results should be interpreted cautiously. Proponents of the use of POEMs would argue that if a meta-analysis does not use patient-oriented outcomes, you don't have to proceed to the next step.

3. Are the results valid?

If a study would change practice, and uses patient-oriented outcomes, you are obligated to evaluate its validity. Key criteria include:

  • Did the authors ask a focused clinical question?

  • Were the criteria used to select articles for inclusion appropriate?

  • Is it unlikely that important, relevant studies were missed?

  • Was the validity of the included studies appraised (study quality)?

  • Were assessments of studies reproducible (data abstraction)?

  • Were the results similar from study to study (homogeneity)?

In the next section, we will go into more detail on the evaluation of validity.   For more on this approach to evaluating relevance, see the section on POEMs and DOEs.

Meta-Analysis真实性评价

In the previous section, we listed key criteria for the evaluation of a meta-analysis include.  Let's talk a little more about each one, then work through a sample study.

1. Did the authors ask a focused clinical question?

Better meta-analyses don't try to do it all.  It's a lot like developing a question for your own research;  consider the following questions as examples:

  • Bad question:  "What is the best way to treat bronchitis?"

  • Good question:  "Do antibiotics reduce symptoms in otherwise healthy adults with acute bronchitis?"

The second question is more focused regarding treatment (antibiotics), patient population (otherwise healthy adults), and diagnosis (acute bronchitis) than the first.

2. Were the criteria used to select articles for inclusion appropriate?

Inclusion criteria can include criteria involving the treatment, patient population, study design, and diagnosis.  Possible problems include:

  • poor description of treatment (did they lump together low and high dosages of a drug, some of which may be adequate and others may not?)

  • studies of poor quality included along with those of higher quality?

  • vague or variable description of condition being treated (for example, in studies of acute MI there are many ways to define MI)

3. Is it unlikely that important, relevant studies were missed?

Remember, just using MEDLINE may not be adequate, and a meta-analysis may be more convincing if it includes unpublished trials and studies from the Cochrane Controlled Trials Register which don't appear in MEDLINE.

4. Was the validity of the included studies appraised (study quality)?

Assessment of study quality means that the authors carefully read each study, and rated it on a number of quality measures.  Important aspects of quality for an RCT include:

  • was it blinded? 

  • was it placebo-controlled? 

  • was it randomized?

  • was follow-up complete?

In addition, do the authors of the meta-analysis use this information?  For example, do they drop studies which don't meet a certain minimum level of quality, or do they calculate results for high-quality studies separately from those for all studies or low-quality studies?

5. Were assessments of studies reproducible (data abstraction)?

There should be a description of how data were abstracted.   Ideally, at least two people should abstract each study, then compare results and have a formal mechanism for resolving conflicts.

6. Were the results similar from study to study (homogeneity)?

When study results are homogenous, using a Q statistic or chi-square test, it is much more likely that the meta-analysis reflects "the truth".  When studies are heterogeneous (i.e. some find benefit, some do not) the authors should be very cautious about combining the results, and if they do should use a random effects model.

Now, let's look at the meta-analysis we asked you to pull earlier with these questions in mind.  This article was selected because it is a topic of interest to all generalist physicians, and is fairly straightforward. 

Meta-Analysis结果使用

Most meta-analyses will initially give you a table of included studies.  In our study, that table is published on the Web only.  We see that most studies used doxycycline, sulfa/trimethoprim, or erythromycin.  This is a possible limitation - maybe the pharmaceutical reps are right, and Biaxin or Zithromax are really better, although I doubt it!

Let's look at the results now.  The three key figures are reproduced below.  In Figure 2, the first column has the study name;  the second column the proportion receiving antibiotic with productive cough at follow-up;   the third column the proportion receiving placebo with productive cough at follow-up.  Let's stop and think here.  If antibiotics work, patients receiving them should be less likely to have cough at follow-up.  That would make the relative risk of cough less than 1.0.  In the light blue "relative risk diagram" below, the vertical line indicates a relative risk of 1.0.  The horizontal dots and bars represent the relative risk and 95% confidence interval for each study.  The weight is proportional to the inverse of the variance of the study - high variance, associated with a small study, means less weight given to that study, and vice versa.   The relative risk is shown numberically in the final column of each table.   Finally, the summary estimate of relative risk with its confidence interval is shown in the last row, along with an estimate of homogeneity (the chi-square). 

Figure 2.  Productive cough at 7 to 11 days follow-up.

MetaAnalysisDiagram2.gif (25862 bytes)

Looking at this table, you see that the summary estimate of relative risk just barely includes one, suggesting that the association is NOT statistically significant.   Look at the next figure below:

Figure 3.  Failure to clinically improve at 7 to 11 days follow-up.

MetaAnalysisDiagram3.gif (24107 bytes)

In this case, the confidence interval of the summary relative risk also includes 1.0 (it is 0.36 - 1.09).

Figure 4.  Adverse drug effects.

MetaAnalysisDiagram4.gif (25378 bytes)

Finally, while there appear to be more adverse drug effects with antibiotic use, this association is also not statistically significant.

So, have we learned nothing?  No.  Even meta-analyses can suffer from Type II error, i.e. too small a sample size.  The total number of patients studied was only about 600 in the 6 studies, and it is interesting to note that while the results were not significant, they were consistent.  That is, 5 out of 6 studies fell either to the left or right of the vertical bar for each outcome.