database individually, including ALL your search terms, any
MeSH or other subject headings, truncation (like hemipleg ),
and/or wildcards (like sul ur). Apply all your limits (such as
years of search, English language only, and so on). Once all
search terms have been combined and you have applied all
relevant limits, you should have a final number of records or
articles for each database. Enter this information in the top
left box of the PRISMA flow chart. You should add the total
number of combined results from all databases (including
duplicates) after the equal sign where it says .
Many researchers also add notations in the box for the number
of results from each database search, for example, Pubmed
(n=335), Embase (n= 600), and so on. If you search trial
registers, such as , , , or others,
you should enter that number after the equal sign in .
NOTE:Some citation managers automatically remove duplicates
with each file you import. Be sure to capture the number of articles
from your database searches before any duplicates are removed.
To avoid reviewing duplicate articles,
you need to remove any articles that appear more than once in your
results. You may want to export the entire list of articles from each
database to a citation manager such as EndNote, Sciwheel, Zotero,
or Mendeley (including both citation and abstract in your file) and
remove the duplicates there. If you are using Covidence for your
review, you should also add the duplicate articles identified in
Covidence to the citation manager number. Enter the number of
records removed as duplicates in the second box on your PRISMA
template. If you are using automation tools to help evaluate the
relevance of citations in your results, you would also enter that
number here.
If you are using Covidence to screen your articles, you can
copy the numbers from the PRISMA diagram in your Covidence
review into the boxes mentioned below. Covidence does not include
the number of results from each database, so you will need to keep
track of that number yourself.
The final step is to subtract the number
of records excluded during the review of full-texts (Step 9)
from the total number of full-texts reviewed (Step 8). Enter
this number in the box labeled "Studies included in review,"
combining numbers with your grey literature search results in this
box if needed.
You have now completed your PRISMA flow diagram, unless you
have also performed searches in non-database sources or are
performing a search update. If so, complete those portions of the template as well.
Step 1: Preparation Download the flow diagram template version 1 PRISMA 2020 flow diagram for new systematic reviews which included searches of databases, registers and other sources or the version 2 PRISMA 2020 flow diagram for updated systematic reviews which included searches of databases, registers and other sources .
If you have identified articles through other sources than databases (such as manual searches through reference lists of articles you have found or search engines like Google Scholar), enter the total number of records from each source type in the box on the top right of the flow diagram. | |
This should be the total number of reports you obtain from each grey literature source. | |
List the number of documents for which you are unable to find the full text. Remember to use Find@UNC and to request items to see if we can order them from other libraries before automatically excluding them. | |
This should be the number of grey literature reports sought for retrieval (Step 2) minus the number of reports not retrieved (Step 3). Review the full text for these items to assess their eligibility for inclusion in your systematic review. | |
After reviewing all items in the full-text screening stage for eligibility, enter the total number of articles you exclude in the box titled "Reports Excluded," and then list your reasons for excluding the item as well as the number of items excluded for each reason. Examples include wrong setting, wrong patient population, wrong intervention, wrong dosage, etc. You should only count an excluded item once in your list even if if meets multiple exclusion criteria. | |
The final step is to subtract the number of excluded articles or records during the eligibility review of full-texts from the total number of articles reviewed for eligibility. Enter this number in the box labeled "Studies included in review," combining numbers with your database search results in this box if needed. You have now completed your PRISMA flow diagram, which you can now include in the results section of your article or assignment. |
Step 1: Preparation Download the flow diagram template version 2 PRISMA 2020 flow diagram for updated systematic reviews which included searches of databases and registers only or the version 2 PRISMA 2020 flow diagram for updated systematic reviews which included searches of databases, registers and other sources .
In the Previous
| |
At the bottom of the column, There will also be a box for the total number of studies included in your |
For more information about updating your systematic review, see the box Updating Your Review? on the Step 3: Conduct Literature Searches page of the guide.
Scientific articles often follow the IMRaD format: Introduction, Methods, Results, and Discussion. You will also need a title and an abstract to summarize your research.
You can read more about scientific writing through the library guides below.
Systematic reviews follow the same structure as original research articles, but you will need to report on your search instead of on details like the participants or sampling. Sections of your manuscript are shown as bold headings in the PRISMA checklist.
Title | Describe your manuscript and state whether it is a systematic review, meta-analysis, or both. |
---|---|
Abstract | Structure the abstract and include (as applicable): background, objectives, data sources, study eligibility criteria, participants, interventions, quality assessment and synthesis methods, results, limitations, conclusions, implications of key findings, and systematic review registration number. |
Introduction | Describe the rationale for the review and provide a statement of questions being addressed. |
Methods | Include details regarding the protocol, eligibility criteria, databases searched, full search strategy of at least one database (often reported in appendix), and the study selection process. Describe how data were extracted and analyzed. If a librarian is part of your research team, that person may be best suited to write this section. |
Results | Report the numbers of articles screened at each stage using a PRISMA diagram. Include information about included study characteristics, risk of bias (quality assessment) within studies, and results across studies. |
Discussion | Summarize main findings, including the strength of evidence and limitations of the review. Provide a general interpretation of the results and implications for future research. |
Funding | Describe any sources of funding for the systematic review. |
Appendix | Include entire search strategy for at least one database in the appendix (include search strategies for all databases searched for more transparency). |
Refer to the PRISMA checklist for more information.
Consider including a Plain Language Summary (PLS) when you publish your systematic review. Like an abstract, a PLS gives an overview of your study, but is specifically written and formatted to be easy for non-experts to understand.
Tips for writing a PLS:
Learn more about Plain Language Summaries:
Organize the literature review into sections that present themes or identify trends, including relevant theory. You are not trying to list all the material published, but to synthesize and evaluate it according to the guiding concept of your thesis or research question.
What is a literature review?
A literature review is an account of what has been published on a topic by accredited scholars and researchers. Occasionally you will be asked to write one as a separate assignment, but more often it is part of the introduction to an essay, research report, or thesis. In writing the literature review, your purpose is to convey to your reader what knowledge and ideas have been established on a topic, and what their strengths and weaknesses are. As a piece of writing, the literature review must be defined by a guiding concept (e.g., your research objective, the problem or issue you are discussing, or your argumentative thesis). It is not just a descriptive list of the material available, or a set of summaries
A literature review must do these things:
Text written by Dena Taylor, Health Sciences Writing Centre, University of Toronto
http://www.writing.utoronto.ca/advice/specific-types-of-writing/literature-review
University of Texas Arlington Libraries 702 Planetarium Place · Arlington, TX 76019 · 817-272-3000
Purdue Online Writing Lab Purdue OWL® College of Liberal Arts
This page is brought to you by the OWL at Purdue University. When printing this page, you must include the entire legal notice.
Copyright ©1995-2018 by The Writing Lab & The OWL at Purdue and Purdue University. All rights reserved. This material may not be published, reproduced, broadcast, rewritten, or redistributed without permission. Use of this site constitutes acceptance of our terms and conditions of fair use.
A literature review is a document or section of a document that collects key sources on a topic and discusses those sources in conversation with each other (also called synthesis ). The lit review is an important genre in many disciplines, not just literature (i.e., the study of works of literature such as novels and plays). When we say “literature review” or refer to “the literature,” we are talking about the research ( scholarship ) in a given field. You will often see the terms “the research,” “the scholarship,” and “the literature” used mostly interchangeably.
There are a number of different situations where you might write a literature review, each with slightly different expectations; different disciplines, too, have field-specific expectations for what a literature review is and does. For instance, in the humanities, authors might include more overt argumentation and interpretation of source material in their literature reviews, whereas in the sciences, authors are more likely to report study designs and results in their literature reviews; these differences reflect these disciplines’ purposes and conventions in scholarship. You should always look at examples from your own discipline and talk to professors or mentors in your field to be sure you understand your discipline’s conventions, for literature reviews as well as for any other genre.
A literature review can be a part of a research paper or scholarly article, usually falling after the introduction and before the research methods sections. In these cases, the lit review just needs to cover scholarship that is important to the issue you are writing about; sometimes it will also cover key sources that informed your research methodology.
Lit reviews can also be standalone pieces, either as assignments in a class or as publications. In a class, a lit review may be assigned to help students familiarize themselves with a topic and with scholarship in their field, get an idea of the other researchers working on the topic they’re interested in, find gaps in existing research in order to propose new projects, and/or develop a theoretical framework and methodology for later research. As a publication, a lit review usually is meant to help make other scholars’ lives easier by collecting and summarizing, synthesizing, and analyzing existing research on a topic. This can be especially helpful for students or scholars getting into a new research area, or for directing an entire community of scholars toward questions that have not yet been answered.
Most lit reviews use a basic introduction-body-conclusion structure; if your lit review is part of a larger paper, the introduction and conclusion pieces may be just a few sentences while you focus most of your attention on the body. If your lit review is a standalone piece, the introduction and conclusion take up more space and give you a place to discuss your goals, research methods, and conclusions separately from where you discuss the literature itself.
Introduction:
Conclusion:
Lit reviews can take many different organizational patterns depending on what you are trying to accomplish with the review. Here are some examples:
Any lit review is only as good as the research it discusses; make sure your sources are well-chosen and your research is thorough. Don’t be afraid to do more research if you discover a new thread as you’re writing. More info on the research process is available in our "Conducting Research" resources .
As you’re doing your research, create an annotated bibliography ( see our page on the this type of document ). Much of the information used in an annotated bibliography can be used also in a literature review, so you’ll be not only partially drafting your lit review as you research, but also developing your sense of the larger conversation going on among scholars, professionals, and any other stakeholders in your topic.
Usually you will need to synthesize research rather than just summarizing it. This means drawing connections between sources to create a picture of the scholarly conversation on a topic over time. Many student writers struggle to synthesize because they feel they don’t have anything to add to the scholars they are citing; here are some strategies to help you:
The most interesting literature reviews are often written as arguments (again, as mentioned at the beginning of the page, this is discipline-specific and doesn’t work for all situations). Often, the literature review is where you can establish your research as filling a particular gap or as relevant in a particular way. You have some chance to do this in your introduction in an article, but the literature review section gives a more extended opportunity to establish the conversation in the way you would like your readers to see it. You can choose the intellectual lineage you would like to be part of and whose definitions matter most to your thinking (mostly humanities-specific, but this goes for sciences as well). In addressing these points, you argue for your place in the conversation, which tends to make the lit review more compelling than a simple reporting of other sources.
So, what is a literature review? "A literature review is an account of what has been published on a topic by accredited scholars and researchers. In writing the literature review, your purpose is to convey to your reader what knowledge and ideas have been established on a topic, and what their strengths and weaknesses are. As a piece of writing, the literature review must be defined by a guiding concept (e.g., your research objective, the problem or issue you are discussing, or your argumentative thesis). It is not just a descriptive list of the material available, or a set of summaries." Taylor, D. The literature review: A few tips on conducting it . University of Toronto Health Sciences Writing Centre.
What are the goals of creating a Literature Review? A literature could be written to accomplish different aims:
Baumeister, R. F., & Leary, M. R. (1997). Writing narrative literature reviews . Review of General Psychology , 1 (3), 311-320.
What kinds of sources require a Literature Review?
All these instances require you to collect what has been written about your research topic so that you can demonstrate how your own research sheds new light on the topic.
What kinds of literature reviews are written?
Narrative review: The purpose of this type of review is to describe the current state of the research on a specific topic/research and to offer a critical analysis of the literature reviewed. Studies are grouped by research/theoretical categories, and themes and trends, strengths and weakness, and gaps are identified. The review ends with a conclusion section which summarizes the findings regarding the state of the research of the specific study, the gaps identify and if applicable, explains how the author's research will address gaps identify in the review and expand the knowledge on the topic reviewed.
Systematic review : "The authors of a systematic review use a specific procedure to search the research literature, select the studies to include in their review, and critically evaluate the studies they find." (p. 139). Nelson, L. K. (2013). Research in Communication Sciences and Disorders . Plural Publishing.
Meta-analysis : "Meta-analysis is a method of reviewing research findings in a quantitative fashion by transforming the data from individual studies into what is called an effect size and then pooling and analyzing this information. The basic goal in meta-analysis is to explain why different outcomes have occurred in different studies." (p. 197). Roberts, M. C., & Ilardi, S. S. (2003). Handbook of Research Methods in Clinical Psychology . Blackwell Publishing.
Meta-synthesis : "Qualitative meta-synthesis is a type of qualitative study that uses as data the findings from other qualitative studies linked by the same or related topic." (p.312). Zimmer, L. (2006). Qualitative meta-synthesis: A question of dialoguing with texts . Journal of Advanced Nursing , 53 (3), 311-318.
Writing-a-literature-review-six-steps-to-get-you-from-start-to-finish.
Tanya Golash-Boza, Associate Professor of Sociology, University of California
February 03, 2022
Writing a literature review is often the most daunting part of writing an article, book, thesis, or dissertation. “The literature” seems (and often is) massive. I have found it helpful to be as systematic as possible when completing this gargantuan task.
Sonja Foss and William Walters* describe an efficient and effective way of writing a literature review. Their system provides an excellent guide for getting through the massive amounts of literature for any purpose: in a dissertation, an M.A. thesis, or preparing a research article for publication in any field of study. Below is a summary of the steps they outline as well as a step-by-step method for writing a literature review.
Step One: Decide on your areas of research:
Before you begin to search for articles or books, decide beforehand what areas you are going to research. Make sure that you only get articles and books in those areas, even if you come across fascinating books in other areas. A literature review I am currently working on, for example, explores barriers to higher education for undocumented students.
Step Two: Search for the literature:
Conduct a comprehensive bibliographic search of books and articles in your area. Read the abstracts online and download and/or print those articles that pertain to your area of research. Find books in the library that are relevant and check them out. Set a specific time frame for how long you will search. It should not take more than two or three dedicated sessions.
Step Three: Find relevant excerpts in your books and articles:
Skim the contents of each book and article and look specifically for these five things:
1. Claims, conclusions, and findings about the constructs you are investigating
2. Definitions of terms
3. Calls for follow-up studies relevant to your project
4. Gaps you notice in the literature
5. Disagreement about the constructs you are investigating
When you find any of these five things, type the relevant excerpt directly into a Word document. Don’t summarize, as summarizing takes longer than simply typing the excerpt. Make sure to note the name of the author and the page number following each excerpt. Do this for each article and book that you have in your stack of literature. When you are done, print out your excerpts.
Step Four: Code the literature:
Get out a pair of scissors and cut each excerpt out. Now, sort the pieces of paper into similar topics. Figure out what the main themes are. Place each excerpt into a themed pile. Make sure each note goes into a pile. If there are excerpts that you can’t figure out where they belong, separate those and go over them again at the end to see if you need new categories. When you finish, place each stack of notes into an envelope labeled with the name of the theme.
Step Five: Create Your Conceptual Schema:
Type, in large font, the name of each of your coded themes. Print this out, and cut the titles into individual slips of paper. Take the slips of paper to a table or large workspace and figure out the best way to organize them. Are there ideas that go together or that are in dialogue with each other? Are there ideas that contradict each other? Move around the slips of paper until you come up with a way of organizing the codes that makes sense. Write the conceptual schema down before you forget or someone cleans up your slips of paper.
Step Six: Begin to Write Your Literature Review:
Choose any section of your conceptual schema to begin with. You can begin anywhere, because you already know the order. Find the envelope with the excerpts in them and lay them on the table in front of you. Figure out a mini-conceptual schema based on that theme by grouping together those excerpts that say the same thing. Use that mini-conceptual schema to write up your literature review based on the excerpts that you have in front of you. Don’t forget to include the citations as you write, so as not to lose track of who said what. Repeat this for each section of your literature review.
Once you complete these six steps, you will have a complete draft of your literature review. The great thing about this process is that it breaks down into manageable steps something that seems enormous: writing a literature review.
I think that Foss and Walter’s system for writing the literature review is ideal for a dissertation, because a Ph.D. candidate has already read widely in his or her field through graduate seminars and comprehensive exams.
It may be more challenging for M.A. students, unless you are already familiar with the literature. It is always hard to figure out how much you need to read for deep meaning, and how much you just need to know what others have said. That balance will depend on how much you already know.
For people writing literature reviews for articles or books, this system also could work, especially when you are writing in a field with which you are already familiar. The mere fact of having a system can make the literature review seem much less daunting, so I recommend this system for anyone who feels overwhelmed by the prospect of writing a literature review.
*Destination Dissertation: A Traveler's Guide to a Done Dissertation
Image Credit/Source: Goldmund Lukic/Getty Images
Please enter your Email Address
Please enter valid email address
Please Enter your First Name
Please enter your Last Name
Please enter your Questions or Comments.
Please enter the Privacy
Please enter the Terms & Conditions
Related articles.
User Experience (UX) Research is the process of discovering and understanding user requirements, motivations, and behaviours
Learn how Wiley partners with plagiarism detection services to support academic integrity around the world
Medical student Nicole Foley shares her top tips for writing and getting your work published.
Wiley and Lumina are working together to support the efforts of researchers at Harvard Medical School to develop and test new machine learning tools and artificial intelligence (AI) software that can
Learn more about our relationship with a company that helps scientists identify the right products to use in their research
What is ISBN? ISSN? DOI? Learn about some of the unique identifiers for book and journal content.
Learn how medical practitioners can easily access and search visual assets from our article portfolio
Explore free-to-use services that can help you discover new content
Watch this webinar to help you learn how to get published.
Atypon launches Scitrus, a personalized web app that allows users to create a customized feed of the latest research.
FOR INDIVIDUALS
FOR INSTITUTIONS & BUSINESSES
WILEY NETWORK
ABOUT WILEY
Corporate Responsibility
Corporate Governance
Leadership Team
Cookie Preferences
Copyright @ 2000-2024 by John Wiley & Sons, Inc., or related companies. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.
Rights & Permissions
Privacy Policy
Terms of Use
Literature review & systematic review steps.
These steps for conducting a systematic literature review are listed below .
Also see subpages for more information about:
Consider the PICO Format: Population/Problem, Intervention, Comparison, Outcome
Focus on defining the Population or Problem and Intervention (don't narrow by Comparison or Outcome just yet!)
"What are the effects of the Pilates method for patients with low back pain?"
A "scoping search" investigates the breadth and/or depth of the initial question or may identify a gap in the literature.
Eligible studies may be located by searching in:
When searching, if possible, translate terms to controlled vocabulary of the database. Use text word searching when necessary.
Use Boolean operators to connect search terms:
Search: pilates AND ("low back pain" OR backache )
Video Tutorials - Translating PICO Questions into Search Queries
Expand your search strategy with synonymous search terms harvested from:
(pilates OR exercise movement techniques) AND ("low back pain" OR backache* OR sciatica OR lumbago OR spondylosis)
As you develop a final, reproducible strategy for each database, save your strategies in a:
Use database filters to limit your results based on your defined inclusion/exclusion criteria. In addition to relying on the databases' categorical filters, you may also need to manually screen results.
NOTE: Many databases allow you to filter to "Full Text Only". This filter is not recommended . It excludes articles if their full text is not available in that particular database (CINAHL, PubMed, etc), but if the article is relevant, it is important that you are able to read its title and abstract, regardless of 'full text' status. The full text is likely to be accessible through another source (a different database, or Interlibrary Loan).
Selected citations and/or entire sets of search results can be downloaded from the database into a citation management tool. If you are conducting a systematic review that will require reporting according to PRISMA standards, a citation manager can help you keep track of the number of articles that came from each database, as well as the number of duplicate records.
In Zotero, you can create a Collection for the combined results set, and sub-collections for the results from each database you search. You can then use Zotero's 'Duplicate Items" function to find and merge duplicate records.
Covidence is a web-based tool that enables you to work with a team to screen titles/abstracts and full text for inclusion in your review, as well as extract data from the included studies.
The PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analyses) flow diagram is a visual representation of the flow of records through different phases of a systematic review. It depicts the number of records identified, included and excluded. It is best used in conjunction with the PRISMA checklist .
Example from: Stotz, S. A., McNealy, K., Begay, R. L., DeSanto, K., Manson, S. M., & Moore, K. R. (2021). Multi-level diabetes prevention and treatment interventions for Native people in the USA and Canada: A scoping review. Current Diabetes Reports, 2 (11), 46. https://doi.org/10.1007/s11892-021-01414-3
There are a number of reporting guideline available to guide the synthesis and reporting of results in systematic literature reviews.
It is common to organize findings in a matrix, also known as a Table of Evidence (ToE).
Cook, D. A., & West, C. P. (2012). Conducting systematic reviews in medical education: a stepwise approach. Medical Education , 46 (10), 943–952.
Intended for healthcare professionals
Systematic reviews and meta-analyses are essential to summarise evidence relating to efficacy and safety of healthcare interventions accurately and reliably. The clarity and transparency of these reports, however, are not optimal. Poor reporting of systematic reviews diminishes their value to clinicians, policy makers, and other users.
Since the development of the QUOROM (quality of reporting of meta-analysis) statement—a reporting guideline published in 1999—there have been several conceptual, methodological, and practical advances regarding the conduct and reporting of systematic reviews and meta-analyses. Also, reviews of published systematic reviews have found that key information about these studies is often poorly reported. Realising these issues, an international group that included experienced authors and methodologists developed PRISMA (preferred reporting items for systematic reviews and meta-analyses) as an evolution of the original QUOROM guideline for systematic reviews and meta-analyses of evaluations of health care interventions.
The PRISMA statement consists of a 27-item checklist and a four-phase flow diagram. The checklist includes items deemed essential for transparent reporting of a systematic review. In this explanation and elaboration document, we explain the meaning and rationale for each checklist item. For each item, we include an example of good reporting and, where possible, references to relevant empirical studies and methodological literature. The PRISMA statement, this document, and the associated website ( www.prisma-statement.org/ ) should be helpful resources to improve reporting of systematic reviews and meta-analyses.
Systematic reviews and meta-analyses are essential tools for summarising evidence accurately and reliably. They help clinicians keep up to date; provide evidence for policy makers to judge risks, benefits, and harms of healthcare behaviours and interventions; gather together and summarise related research for patients and their carers; provide a starting point for clinical practice guideline developers; provide summaries of previous research for funders wishing to support new research; 1 and help editors judge the merits of publishing reports of new studies. 2 Recent data suggest that at least 2500 new systematic reviews reported in English are indexed in Medline annually. 3
Unfortunately, there is considerable evidence that key information is often poorly reported in systematic reviews, thus diminishing their potential usefulness. 3 4 5 6 As is true for all research, systematic reviews should be reported fully and transparently to allow readers to assess the strengths and weaknesses of the investigation. 7 That rationale led to the development of the QUOROM (quality of reporting of meta-analysis) statement; those detailed reporting recommendations were published in 1999. 8 In this paper we describe the updating of that guidance. Our aim is to ensure clear presentation of what was planned, done, and found in a systematic review.
Terminology used to describe systematic reviews and meta-analyses has evolved over time and varies across different groups of researchers and authors (see box 1 at end of document). In this document we adopt the definitions used by the Cochrane Collaboration. 9 A systematic review attempts to collate all empirical evidence that fits pre-specified eligibility criteria to answer a specific research question. It uses explicit, systematic methods that are selected to minimise bias, thus providing reliable findings from which conclusions can be drawn and decisions made. Meta-analysis is the use of statistical methods to summarise and combine the results of independent studies. Many systematic reviews contain meta-analyses, but not all.
The QUOROM statement, developed in 1996 and published in 1999, 8 was conceived as a reporting guidance for authors reporting a meta-analysis of randomised trials. Since then, much has happened. First, knowledge about the conduct and reporting of systematic reviews has expanded considerably. For example, the Cochrane Library’s Methodology Register (which includes reports of studies relevant to the methods for systematic reviews) now contains more than 11 000 entries (March 2009). Second, there have been many conceptual advances, such as “outcome-level” assessments of the risk of bias, 10 11 that apply to systematic reviews. Third, authors have increasingly used systematic reviews to summarise evidence other than that provided by randomised trials.
However, despite advances, the quality of the conduct and reporting of systematic reviews remains well short of ideal. 3 4 5 6 All of these issues prompted the need for an update and expansion of the QUOROM statement. Of note, recognising that the updated statement now addresses the above conceptual and methodological issues and may also have broader applicability than the original QUOROM statement, we changed the name of the reporting guidance to PRISMA (preferred reporting items for systematic reviews and meta-analyses).
The PRISMA statement was developed by a group of 29 review authors, methodologists, clinicians, medical editors, and consumers. 12 They attended a three day meeting in 2005 and participated in extensive post-meeting electronic correspondence. A consensus process that was informed by evidence, whenever possible, was used to develop a 27-item checklist (table 1 ⇓ ) and a four-phase flow diagram (fig 1 ⇓ ) (also available as extra items on bmj.com for researchers to download and re-use). Items deemed essential for transparent reporting of a systematic review were included in the checklist. The flow diagram originally proposed by QUOROM was also modified to show numbers of identified records, excluded articles, and included studies. After 11 revisions the group approved the checklist, flow diagram, and this explanatory paper.
Fig 1 Flow of information through the different phases of a systematic review.
Checklist of items to include when reporting a systematic review or meta-analysis
The PRISMA statement itself provides further details regarding its background and development. 12 This accompanying explanation and elaboration document explains the meaning and rationale for each checklist item. A few PRISMA Group participants volunteered to help draft specific items for this document, and four of these (DGA, AL, DM, and JT) met on several occasions to further refine the document, which was circulated and ultimately approved by the larger PRISMA Group.
PRISMA focuses on ways in which authors can ensure the transparent and complete reporting of systematic reviews and meta-analyses. It does not address directly or in a detailed manner the conduct of systematic reviews, for which other guides are available. 13 14 15 16
We developed the PRISMA statement and this explanatory document to help authors report a wide array of systematic reviews to assess the benefits and harms of a healthcare intervention. We consider most of the checklist items relevant when reporting systematic reviews of non-randomised studies assessing the benefits and harms of interventions. However, we recognise that authors who address questions relating to aetiology, diagnosis, or prognosis, for example, and who review epidemiological or diagnostic accuracy studies may need to modify or incorporate additional items for their systematic reviews.
We modeled this explanation and elaboration document after those prepared for other reporting guidelines. 17 18 19 To maximise the benefit of this document, we encourage people to read it in conjunction with the PRISMA statement. 11
We present each checklist item and follow it with a published exemplar of good reporting for that item. (We edited some examples by removing citations or web addresses, or by spelling out abbreviations.) We then explain the pertinent issue, the rationale for including the item, and relevant evidence from the literature, whenever possible. No systematic search was carried out to identify exemplars and evidence. We also include seven boxes at the end of the document that provide a more comprehensive explanation of certain thematic aspects of the methodology and conduct of systematic reviews.
Although we focus on a minimal list of items to consider when reporting a systematic review, we indicate places where additional information is desirable to improve transparency of the review process. We present the items numerically from 1 to 27; however, authors need not address items in this particular order in their reports. Rather, what is important is that the information for each item is given somewhere within the report.
Title and abstract, item 1: title.
Identify the report as a systematic review, meta-analysis, or both.
Examples “Recurrence rates of video-assisted thoracoscopic versus open surgery in the prevention of recurrent pneumothoraces: a systematic review of randomised and non-randomised trials” 20
“Mortality in randomised trials of antioxidant supplements for primary and secondary prevention: systematic review and meta-analysis” 21
Explanation Authors should identify their report as a systematic review or meta-analysis. Terms such as “review” or “overview” do not describe for readers whether the review was systematic or whether a meta-analysis was performed. A recent survey found that 50% of 300 authors did not mention the terms “systematic review” or “meta-analysis” in the title or abstract of their systematic review. 3 Although sensitive search strategies have been developed to identify systematic reviews, 22 inclusion of the terms systematic review or meta-analysis in the title may improve indexing and identification.
We advise authors to use informative titles that make key information easily accessible to readers. Ideally, a title reflecting the PICOS approach (participants, interventions, comparators, outcomes, and study design) (see item 11 and box 2) may help readers as it provides key information about the scope of the review. Specifying the design(s) of the studies included, as shown in the examples, may also help some readers and those searching databases.
Some journals recommend “indicative titles” that indicate the topic matter of the review, while others require declarative titles that give the review’s main conclusion. Busy practitioners may prefer to see the conclusion of the review in the title, but declarative titles can oversimplify or exaggerate findings. Thus, many journals and methodologists prefer indicative titles as used in the examples above.
Provide a structured summary including, as applicable, background; objectives; data sources; study eligibility criteria, participants, and interventions; study appraisal and synthesis methods; results; limitations; conclusions and implications of key findings; funding for the systematic review; and systematic review registration number.
Example “ Context : The role and dose of oral vitamin D supplementation in nonvertebral fracture prevention have not been well established.
Objective : To estimate the effectiveness of vitamin D supplementation in preventing hip and nonvertebral fractures in older persons.
Data Sources : A systematic review of English and non-English articles using MEDLINE and the Cochrane Controlled Trials Register (1960-2005), and EMBASE (1991-2005). Additional studies were identified by contacting clinical experts and searching bibliographies and abstracts presented at the American Society for Bone and Mineral Research (1995-2004). Search terms included randomised controlled trial (RCT), controlled clinical trial, random allocation, double-blind method, cholecalciferol, ergocalciferol, 25-hydroxyvitamin D, fractures, humans, elderly, falls, and bone density.
Study Selection : Only double-blind RCTs of oral vitamin D supplementation (cholecalciferol, ergocalciferol) with or without calcium supplementation vs calcium supplementation or placebo in older persons (>60 years) that examined hip or nonvertebral fractures were included.
Data Extraction : Independent extraction of articles by 2 authors using predefined data fields, including study quality indicators.
Data Synthesis : All pooled analyses were based on random-effects models. Five RCTs for hip fracture (n=9294) and 7 RCTs for nonvertebral fracture risk (n=9820) met our inclusion criteria. All trials used cholecalciferol. Heterogeneity among studies for both hip and nonvertebral fracture prevention was observed, which disappeared after pooling RCTs with low-dose (400 IU/d) and higher-dose vitamin D (700-800 IU/d), separately. A vitamin D dose of 700 to 800 IU/d reduced the relative risk (RR) of hip fracture by 26% (3 RCTs with 5572 persons; pooled RR, 0.74; 95% confidence interval [CI], 0.61-0.88) and any nonvertebral fracture by 23% (5 RCTs with 6098 persons; pooled RR, 0.77; 95% CI, 0.68-0.87) vs calcium or placebo. No significant benefit was observed for RCTs with 400 IU/d vitamin D (2 RCTs with 3722 persons; pooled RR for hip fracture, 1.15; 95% CI, 0.88-1.50; and pooled RR for any nonvertebral fracture, 1.03; 95% CI, 0.86-1.24).
Conclusions : Oral vitamin D supplementation between 700 to 800 IU/d appears to reduce the risk of hip and any nonvertebral fractures in ambulatory or institutionalised elderly persons. An oral vitamin D dose of 400 IU/d is not sufficient for fracture prevention.” 23
Explanation Abstracts provide key information that enables readers to understand the scope, processes, and findings of a review and to decide whether to read the full report. The abstract may be all that is readily available to a reader, for example, in a bibliographic database. The abstract should present a balanced and realistic assessment of the review’s findings that mirrors, albeit briefly, the main text of the report.
We agree with others that the quality of reporting in abstracts presented at conferences and in journal publications needs improvement. 24 25 While we do not uniformly favour a specific format over another, we generally recommend structured abstracts. Structured abstracts provide readers with a series of headings pertaining to the purpose, conduct, findings, and conclusions of the systematic review being reported. 26 27 They give readers more complete information and facilitate finding information more easily than unstructured abstracts. 28 29 30 31 32
A highly structured abstract of a systematic review could include the following headings: Context (or Background ); Objective (or Purpose ); Data sources ; Study selection (or Eligibility criteria ); Study appraisal and Synthesis methods (or Data extraction and Data synthesis ); Results ; Limitations ; and Conclusions (or Implications ). Alternatively, a simpler structure could cover but collapse some of the above headings (such as label Study selection and Study appraisal as Review methods ) or omit some headings such as Background and Limitations .
In the highly structured abstract mentioned above, authors use the Background heading to set the context for readers and explain the importance of the review question. Under the Objectives heading, they ideally use elements of PICOS (see box 2) to state the primary objective of the review. Under a Data sources heading, they summarise sources that were searched, any language or publication type restrictions, and the start and end dates of searches. Study selection statements then ideally describe who selected studies using what inclusion criteria. Data extraction methods statements describe appraisal methods during data abstraction and the methods used to integrate or summarise the data. The Data synthesis section is where the main results of the review are reported. If the review includes meta-analyses, authors should provide numerical results with confidence intervals for the most important outcomes. Ideally, they should specify the amount of evidence in these analyses (numbers of studies and numbers of participants). Under a Limitations heading, authors might describe the most important weaknesses of included studies as well as limitations of the review process. Then authors should provide clear and balanced Conclusions that are closely linked to the objective and findings of the review. Additionally, it would be helpful if authors included some information about funding for the review. Finally, although protocol registration for systematic reviews is still not common practice, if authors have registered their review or received a registration number, we recommend providing the registration information at the end of the abstract.
Taking all the above considerations into account, the intrinsic tension between the goal of completeness of the abstract and its keeping into the space limit often set by journal editors is recognised as a major challenge.
Describe the rationale for the review in the context of what is already known.
Example “Reversing the trend of increasing weight for height in children has proven difficult. It is widely accepted that increasing energy expenditure and reducing energy intake form the theoretical basis for management. Therefore, interventions aiming to increase physical activity and improve diet are the foundation of efforts to prevent and treat childhood obesity. Such lifestyle interventions have been supported by recent systematic reviews, as well as by the Canadian Paediatric Society, the Royal College of Paediatrics and Child Health, and the American Academy of Pediatrics. However, these interventions are fraught with poor adherence. Thus, school-based interventions are theoretically appealing because adherence with interventions can be improved. Consequently, many local governments have enacted or are considering policies that mandate increased physical activity in schools, although the effect of such interventions on body composition has not been assessed.” 33
Explanation Readers need to understand the rationale behind the study and what the systematic review may add to what is already known. Authors should tell readers whether their report is a new systematic review or an update of an existing one. If the review is an update, authors should state reasons for the update, including what has been added to the evidence base since the previous version of the review.
An ideal background or introduction that sets context for readers might include the following. First, authors might define the importance of the review question from different perspectives (such as public health, individual patient, or health policy). Second, authors might briefly mention the current state of knowledge and its limitations. As in the above example, information about the effects of several different interventions may be available that helps readers understand why potential relative benefits or harms of particular interventions need review. Third, authors might whet readers’ appetites by clearly stating what the review aims to add. They also could discuss the extent to which the limitations of the existing evidence base may be overcome by the review.
Provide an explicit statement of questions being addressed with reference to participants, interventions, comparisons, outcomes, and study design (PICOS).
Example “To examine whether topical or intraluminal antibiotics reduce catheter-related bloodstream infection, we reviewed randomised, controlled trials that assessed the efficacy of these antibiotics for primary prophylaxis against catheter-related bloodstream infection and mortality compared with no antibiotic therapy in adults undergoing hemodialysis.” 34
Explanation The questions being addressed, and the rationale for them, are one of the most critical parts of a systematic review. They should be stated precisely and explicitly so that readers can understand quickly the review’s scope and the potential applicability of the review to their interests. 35 Framing questions so that they include the following five “PICOS” components may improve the explicitness of review questions: (1) the patient population or disease being addressed (P), (2) the interventions or exposure of interest (I), (3) the comparators (C), (4) the main outcome or endpoint of interest (O), and (5) the study designs chosen (S). For more detail regarding PICOS, see box 2.
Good review questions may be narrowly focused or broad, depending on the overall objectives of the review. Sometimes broad questions might increase the applicability of the results and facilitate detection of bias, exploratory analyses, and sensitivity analyses. 35 36 Whether narrowly focused or broad, precisely stated review objectives are critical as they help define other components of the review process such as the eligibility criteria (item 6) and the search for relevant literature (items 7 and 8).
Indicate if a review protocol exists, if and where it can be accessed (such as a web address), and, if available, provide registration information including the registration number.
Example “Methods of the analysis and inclusion criteria were specified in advance and documented in a protocol.” 37
Explanation A protocol is important because it pre-specifies the objectives and methods of the systematic review. For instance, a protocol specifies outcomes of primary interest, how reviewers will extract information about those outcomes, and methods that reviewers might use to quantitatively summarise the outcome data (see item 13). Having a protocol can help restrict the likelihood of biased post hoc decisions in review methods, such as selective outcome reporting. Several sources provide guidance about elements to include in the protocol for a systematic review. 16 38 39 For meta-analyses of individual patient-level data, we advise authors to describe whether a protocol was explicitly designed and whether, when, and how participating collaborators endorsed it. 40 41
Authors may modify protocols during the research, and readers should not automatically consider such modifications inappropriate. For example, legitimate modifications may extend the period of searches to include older or newer studies, broaden eligibility criteria that proved too narrow, or add analyses if the primary analyses suggest that additional ones are warranted. Authors should, however, describe the modifications and explain their rationale.
Although worthwhile protocol amendments are common, one must consider the effects that protocol modifications may have on the results of a systematic review, especially if the primary outcome is changed. Bias from selective outcome reporting in randomised trials has been well documented. 42 43 An examination of 47 Cochrane reviews revealed indirect evidence for possible selective reporting bias for systematic reviews. Almost all (n=43) contained a major change, such as the addition or deletion of outcomes, between the protocol and the full publication. 44 Whether (or to what extent) the changes reflected bias, however, was not clear. For example, it has been rather common not to describe outcomes that were not presented in any of the included studies.
Registration of a systematic review, typically with a protocol and registration number, is not yet common, but some opportunities exist. 45 46 Registration may possibly reduce the risk of multiple reviews addressing the same question, 45 46 47 48 reduce publication bias, and provide greater transparency when updating systematic reviews. Of note, a survey of systematic reviews indexed in Medline in November 2004 found that reports of protocol use had increased to about 46% 3 from 8% noted in previous surveys. 49 The improvement was due mostly to Cochrane reviews, which, by requirement, have a published protocol. 3
Specify study characteristics (such as PICOS, length of follow-up) and report characteristics (such as years considered, language, publication status) used as criteria for eligibility, giving rationale.
Examples Types of studies: “Randomised clinical trials studying the administration of hepatitis B vaccine to CRF [chronic renal failure] patients, with or without dialysis. No language, publication date, or publication status restrictions were imposed…”
Types of participants: “Participants of any age with CRF or receiving dialysis (haemodialysis or peritoneal dialysis) were considered. CRF was defined as serum creatinine greater than 200 µmol/L for a period of more than six months or individuals receiving dialysis (haemodialysis or peritoneal dialysis)…Renal transplant patients were excluded from this review as these individuals are immunosuppressed and are receiving immunosuppressant agents to prevent rejection of their transplanted organs, and they have essentially normal renal function...”
Types of intervention: “Trials comparing the beneficial and harmful effects of hepatitis B vaccines with adjuvant or cytokine co-interventions [and] trials comparing the beneficial and harmful effects of immunoglobulin prophylaxis. This review was limited to studies looking at active immunisation. Hepatitis B vaccines (plasma or recombinant (yeast) derived) of all types, dose, and regimens versus placebo, control vaccine, or no vaccine…”
Types of outcome measures: “Primary outcome measures: Seroconversion, ie, proportion of patients with adequate anti-HBs response (>10 IU/L or Sample Ratio Units). Hepatitis B infections (as measured by hepatitis B core antigen (HBcAg) positivity or persistent HBsAg positivity), both acute and chronic. Acute (primary) HBV [hepatitis B virus] infections were defined as seroconversion to HBsAg positivity or development of IgM anti-HBc. Chronic HBV infections were defined as the persistence of HBsAg for more than six months or HBsAg positivity and liver biopsy compatible with a diagnosis or chronic hepatitis B. Secondary outcome measures: Adverse events of hepatitis B vaccinations…[and]…mortality.” 50
Explanation Knowledge of the eligibility criteria is essential in appraising the validity, applicability, and comprehensiveness of a review. Thus, authors should unambiguously specify eligibility criteria used in the review. Carefully defined eligibility criteria inform various steps of the review methodology. They influence the development of the search strategy and serve to ensure that studies are selected in a systematic and unbiased manner.
A study may be described in multiple reports, and one report may describe multiple studies. Therefore, we separate eligibility criteria into the following two components: study characteristics and report characteristics. Both need to be reported. Study eligibility criteria are likely to include the populations, interventions, comparators, outcomes, and study designs of interest (PICOS, see box 2), as well as other study-specific elements, such as specifying a minimum length of follow-up. Authors should state whether studies will be excluded because they do not include (or report) specific outcomes to help readers ascertain whether the systematic review may be biased as a consequence of selective reporting. 42 43
Report eligibility criteria are likely to include language of publication, publication status (such as inclusion of unpublished material and abstracts), and year of publication. Inclusion or not of non-English language literature, 51 52 53 54 55 unpublished data, or older data can influence the effect estimates in meta-analyses. 56 57 58 59 Caution may need to be exercised in including all identified studies due to potential differences in the risk of bias such as, for example, selective reporting in abstracts. 60 61 62
Describe all information sources in the search (such as databases with dates of coverage, contact with study authors to identify additional studies) and date last searched.
Example “Studies were identified by searching electronic databases, scanning reference lists of articles and consultation with experts in the field and drug companies…No limits were applied for language and foreign papers were translated. This search was applied to Medline (1966 - Present), CancerLit (1975 - Present), and adapted for Embase (1980 - Present), Science Citation Index Expanded (1981 - Present) and Pre-Medline electronic databases. Cochrane and DARE (Database of Abstracts of Reviews of Effectiveness) databases were reviewed…The last search was run on 19 June 2001. In addition, we handsearched contents pages of Journal of Clinical Oncology 2001, European Journal of Cancer 2001 and Bone 2001, together with abstracts printed in these journals 1999 - 2001. A limited update literature search was performed from 19 June 2001 to 31 December 2003.” 63
Explanation The National Library of Medicine’s Medline database is one of the most comprehensive sources of healthcare information in the world. Like any database, however, its coverage is not complete and varies according to the field. Retrieval from any single database, even by an experienced searcher, may be imperfect, which is why detailed reporting is important within the systematic review.
At a minimum, for each database searched, authors should report the database, platform, or provider (such as Ovid, Dialog, PubMed) and the start and end dates for the search of each database. This information lets readers assess the currency of the review, which is important because the publication time-lag outdates the results of some reviews. 64 This information should also make updating more efficient. 65 Authors should also report who developed and conducted the search. 66
In addition to searching databases, authors should report the use of supplementary approaches to identify studies, such as hand searching of journals, checking reference lists, searching trials registries or regulatory agency websites, 67 contacting manufacturers, or contacting authors. Authors should also report if they attempted to acquire any missing information (such as on study methods or results) from investigators or sponsors; it is useful to describe briefly who was contacted and what unpublished information was obtained.
Present the full electronic search strategy for at least one major database, including any limits used, such that it could be repeated.
Examples In text: “We used the following search terms to search all trials registers and databases: immunoglobulin*; IVIG; sepsis; septic shock; septicaemia; and septicemia…” 68
In appendix: “Search strategy: MEDLINE (OVID)
01. immunoglobulins/
02. immunoglobulin$.tw.
03. ivig.tw.
04. 1 or 2 or 3
05. sepsis/
06. sepsis.tw.
07. septic shock/
08. septic shock.tw.
09. septicemia/
10. septicaemia.tw.
11. septicemia.tw.
12. 5 or 6 or 7 or 8 or 9 or 10 or 11
13. 4 and 12
14. randomised controlled trials/
15. randomised-controlled-trial.pt.
16. controlled-clinical-trial.pt.
17. random allocation/
18. double-blind method/
19. single-blind method/
20. 14 or 15 or 16 or 17 or 18 or 19
21. exp clinical trials/
22. clinical-trial.pt.
23. (clin$ adj trial$).ti,ab.
24. ((singl$ or doubl$ or trebl$ or tripl$) adj (blind$)).ti,ab.
25. placebos/
26. placebo$.ti,ab.
27. random$.ti,ab.
28. 21 or 22 or 23 or 24 or 25 or 26 or 27
29. research design/
30. comparative study/
31. exp evaluation studies/
32. follow-up studies/
33. prospective studies/
34. (control$ or prospective$ or volunteer$).ti,ab.
35. 30 or 31 or 32 or 33 or 34
36. 20 or 28 or 29 or 35
37. 13 and 36” 68
Explanation The search strategy is an essential part of the report of any systematic review. Searches may be complicated and iterative, particularly when reviewers search unfamiliar databases or their review is addressing a broad or new topic. Perusing the search strategy allows interested readers to assess the comprehensiveness and completeness of the search, and to replicate it. Thus, we advise authors to report their full electronic search strategy for at least one major database. As an alternative to presenting search strategies for all databases, authors could indicate how the search took into account other databases searched, as index terms vary across databases. If different searches are used for different parts of a wider question (such as questions relating to benefits and questions relating to harms), we recommend authors provide at least one example of a strategy for each part of the objective. 69 We also encourage authors to state whether search strategies were peer reviewed as part of the systematic review process. 70
We realise that journal restrictions vary and that having the search strategy in the text of the report is not always feasible. We strongly encourage all journals, however, to find ways—such as a “web extra,” appendix, or electronic link to an archive—to make search strategies accessible to readers. We also advise all authors to archive their searches so that (1) others may access and review them (such as replicate them or understand why their review of a similar topic did not identify the same reports), and (2) future updates of their review are facilitated.
Several sources provide guidance on developing search strategies. 71 72 73 Most searches have constraints, such as relating to limited time or financial resources, inaccessible or inadequately indexed reports and databases, unavailability of experts with particular language or database searching skills, or review questions for which pertinent evidence is not easy to find. Authors should be straightforward in describing their search constraints. Apart from the keywords used to identify or exclude records, they should report any additional limitations relevant to the search, such as language and date restrictions (see also eligibility criteria, item 6). 51
State the process for selecting studies (that is, for screening, for determining eligibility, for inclusion in the systematic review, and, if applicable, for inclusion in the meta-analysis).
Example “Eligibility assessment…[was] performed independently in an unblinded standardized manner by 2 reviewers…Disagreements between reviewers were resolved by consensus.” 74
Explanation There is no standard process for selecting studies to include in a systematic review. Authors usually start with a large number of identified records from their search and sequentially exclude records according to eligibility criteria. We advise authors to report how they screened the retrieved records (typically a title and abstract), how often it was necessary to review the full text publication, and if any types of record (such as letters to the editor) were excluded. We also advise using the PRISMA flow diagram to summarise study selection processes (see item 17 and box 3).
Efforts to enhance objectivity and avoid mistakes in study selection are important. Thus authors should report whether each stage was carried out by one or several people, who these people were, and, whenever multiple independent investigators performed the selection, what the process was for resolving disagreements. The use of at least two investigators may reduce the possibility of rejecting relevant reports. 75 The benefit may be greatest for topics where selection or rejection of an article requires difficult judgments. 76 For these topics, authors should ideally tell readers the level of inter-rater agreement, how commonly arbitration about selection was required, and what efforts were made to resolve disagreements (such as by contact with the authors of the original studies).
Describe the method of data extraction from reports (such as piloted forms, independently by two reviewers) and any processes for obtaining and confirming data from investigators.
Example “We developed a data extraction sheet (based on the Cochrane Consumers and Communication Review Group’s data extraction template), pilot-tested it on ten randomly-selected included studies, and refined it accordingly. One review author extracted the following data from included studies and the second author checked the extracted data…Disagreements were resolved by discussion between the two review authors; if no agreement could be reached, it was planned a third author would decide. We contacted five authors for further information. All responded and one provided numerical data that had only been presented graphically in the published paper.” 77
Explanation Reviewers extract information from each included study so that they can critique, present, and summarise evidence in a systematic review. They might also contact authors of included studies for information that has not been, or is unclearly, reported. In meta-analysis of individual patient data, this phase involves collection and scrutiny of detailed raw databases. The authors should describe these methods, including any steps taken to reduce bias and mistakes during data collection and data extraction. 78 (See box 3)
Some systematic reviewers use a data extraction form that could be reported as an appendix or “Web extra” to their report. These forms could show the reader what information reviewers sought (see item 11) and how they extracted it. Authors could tell readers if the form was piloted. Regardless, we advise authors to tell readers who extracted what data, whether any extractions were completed in duplicate, and, if so, whether duplicate abstraction was done independently and how disagreements were resolved.
Published reports of the included studies may not provide all the information required for the review. Reviewers should describe any actions they took to seek additional information from the original researchers (see item 7). The description might include how they attempted to contact researchers, what they asked for, and their success in obtaining the necessary information. Authors should also tell readers when individual patient data were sought from the original researchers. 41 (see item 11) and indicate the studies for which such data were used in the analyses. The reviewers ideally should also state whether they confirmed the accuracy of the information included in their review with the original researchers, for example, by sending them a copy of the draft review. 79
Some studies are published more than once. Duplicate publications may be difficult to ascertain, and their inclusion may introduce bias. 80 81 We advise authors to describe any steps they used to avoid double counting and piece together data from multiple reports of the same study (such as juxtaposing author names, treatment comparisons, sample sizes, or outcomes). We also advise authors to indicate whether all reports on a study were considered, as inconsistencies may reveal important limitations. For example, a review of multiple publications of drug trials showed that reported study characteristics may differ from report to report, including the description of the design, number of patients analysed, chosen significance level, and outcomes. 82 Authors ideally should present any algorithm that they used to select data from overlapping reports and any efforts they used to solve logical inconsistencies across reports.
List and define all variables for which data were sought (such as PICOS, funding sources) and any assumptions and simplifications made.
Examples “Information was extracted from each included trial on: (1) characteristics of trial participants (including age, stage and severity of disease, and method of diagnosis), and the trial’s inclusion and exclusion criteria; (2) type of intervention (including type, dose, duration and frequency of the NSAID [non-steroidal anti-inflammatory drug]; versus placebo or versus the type, dose, duration and frequency of another NSAID; or versus another pain management drug; or versus no treatment); (3) type of outcome measure (including the level of pain reduction, improvement in quality of life score (using a validated scale), effect on daily activities, absence from work or school, length of follow up, unintended effects of treatment, number of women requiring more invasive treatment).” 83
Explanation It is important for readers to know what information review authors sought, even if some of this information was not available. 84 If the review is limited to reporting only those variables that were obtained, rather than those that were deemed important but could not be obtained, bias might be introduced and the reader might be misled. It is therefore helpful if authors can refer readers to the protocol (see item 5) and archive their extraction forms (see item 10), including definitions of variables. The published systematic review should include a description of the processes used with, if relevant, specification of how readers can get access to additional materials.
We encourage authors to report whether some variables were added after the review started. Such variables might include those found in the studies that the reviewers identified (such as important outcome measures that the reviewers initially overlooked). Authors should describe the reasons for adding any variables to those already pre-specified in the protocol so that readers can understand the review process.
We advise authors to report any assumptions they made about missing or unclear information and to explain those processes. For example, in studies of women aged 50 or older it is reasonable to assume that none were pregnant, even if this is not reported. Likewise, review authors might make assumptions about the route of administration of drugs assessed. However, special care should be taken in making assumptions about qualitative information. For example, the upper age limit for “children” can vary from 15 years to 21 years, “intense” physiotherapy might mean very different things to different researchers at different times and for different patients, and the volume of blood associated with “heavy” blood loss might vary widely depending on the setting.
Describe methods used for assessing risk of bias in individual studies (including specification of whether this was done at the study or outcome level, or both), and how this information is to be used in any data synthesis.
Example “To ascertain the validity of eligible randomized trials, pairs of reviewers working independently and with adequate reliability determined the adequacy of randomization and concealment of allocation, blinding of patients, health care providers, data collectors, and outcome assessors; and extent of loss to follow-up (i.e. proportion of patients in whom the investigators were not able to ascertain outcomes).” 85
“To explore variability in study results (heterogeneity) we specified the following hypotheses before conducting the analysis. We hypothesised that effect size may differ according to the methodological quality of the studies.” 86
Explanation The likelihood that the treatment effect reported in a systematic review approximates the truth depends on the validity of the included studies, as certain methodological characteristics may be associated with effect sizes. 87 88 For example, trials without reported adequate allocation concealment exaggerate treatment effects on average compared with those with adequate concealment. 88 Therefore, it is important for authors to describe any methods that they used to gauge the risk of bias in the included studies and how that information was used. 89 Additionally, authors should provide a rationale if no assessment of risk of bias was undertaken. The most popular term to describe the issues relevant to this item is “quality,” but for the reasons that are elaborated in box 4 we prefer to name this item as “assessment of risk of bias.”
Many methods exist to assess the overall risk of bias in included studies, including scales, checklists, and individual components. 90 91 As discussed in box 4, scales that numerically summarise multiple components into a single number are misleading and unhelpful. 92 93 Rather, authors should specify the methodological components that they assessed. Common markers of validity for randomised trials include the following: appropriate generation of random allocation sequence; 94 concealment of the allocation sequence; 93 blinding of participants, health care providers, data collectors, and outcome adjudicators; 95 96 97 98 proportion of patients lost to follow-up; 99 100 stopping of trials early for benefit; 101 and whether the analysis followed the intention-to-treat principle. 100 102 The ultimate decision regarding which methodological features to evaluate requires consideration of the strength of the empiric data, theoretical rationale, and the unique circumstances of the included studies.
Authors should report how they assessed risk of bias; whether it was in a blind manner; and if assessments were completed by more than one person, and if so, whether they were completed independently. 103 104 Similarly, we encourage authors to report any calibration exercises among review team members that were done. Finally, authors need to report how their assessments of risk of bias are used subsequently in the data synthesis (see item 16). Despite the often difficult task of assessing the risk of bias in included studies, authors are sometimes silent on what they did with the resultant assessments. 89 If authors exclude studies from the review or any subsequent analyses on the basis of the risk of bias, they should tell readers which studies they excluded and explain the reasons for those exclusions (see item 6). Authors should also describe any planned sensitivity or subgroup analyses related to bias assessments (see item 16).
State the principal summary measures (such as risk ratio, difference in means).
Examples “Relative risk of mortality reduction was the primary measure of treatment effect.” 105
“The meta-analyses were performed by computing relative risks (RRs) using random-effects model. Quantitative analyses were performed on an intention-to-treat basis and were confined to data derived from the period of follow-up. RR and 95% confidence intervals for each side effect (and all side effects) were calculated.” 106
“The primary outcome measure was the mean difference in log 10 HIV-1 viral load comparing zinc supplementation to placebo...” 107
Explanation When planning a systematic review, it is generally desirable that authors pre-specify the outcomes of primary interest (see item 5) as well as the intended summary effect measure for each outcome. The chosen summary effect measure may differ from that used in some of the included studies. If possible the choice of effect measures should be explained, though it is not always easy to judge in advance which measure is the most appropriate.
For binary outcomes, the most common summary measures are the risk ratio, odds ratio, and risk difference. 108 Relative effects are more consistent across studies than absolute effects, 109 110 although absolute differences are important when interpreting findings (see item 24).
For continuous outcomes, the natural effect measure is the difference in means. 108 Its use is appropriate when outcome measurements in all studies are made on the same scale. The standardised difference in means is used when the studies do not yield directly comparable data. Usually this occurs when all studies assess the same outcome but measure it in a variety of ways (such as different scales to measure depression).
For time-to-event outcomes, the hazard ratio is the most common summary measure. Reviewers need the log hazard ratio and its standard error for a study to be included in a meta-analysis. 111 This information may not be given for all studies, but methods are available for estimating the desired quantities from other reported information. 111 Risk ratio and odds ratio (in relation to events occurring by a fixed time) are not equivalent to the hazard ratio, and median survival times are not a reliable basis for meta-analysis. 112 If authors have used these measures they should describe their methods in the report.
Describe the methods of handling data and combining results of studies, if done, including measures of consistency (such as I 2 ) for each meta-analysis.
Examples “We tested for heterogeneity with the Breslow-Day test, and used the method proposed by Higgins et al. to measure inconsistency (the percentage of total variation across studies due to heterogeneity) of effects across lipid-lowering interventions. The advantages of this measure of inconsistency (termed I 2 ) are that it does not inherently depend on the number of studies and is accompanied by an uncertainty interval.” 113
“In very few instances, estimates of baseline mean or mean QOL [Quality of life] responses were obtained without corresponding estimates of variance (standard deviation [SD] or standard error). In these instances, an SD was imputed from the mean of the known SDs. In a number of cases, the response data available were the mean and variance in a pre study condition and after therapy. The within-patient variance in these cases could not be calculated directly and was approximated by assuming independence.” 114
Explanation The data extracted from the studies in the review may need some transformation (processing) before they are suitable for analysis or for presentation in an evidence table. Although such data handling may facilitate meta-analyses, it is sometimes needed even when meta-analyses are not done. For example, in trials with more than two intervention groups it may be necessary to combine results for two or more groups (such as receiving similar but non-identical interventions), or it may be desirable to include only a subset of the data to match the review’s inclusion criteria. When several different scales (such as for depression) are used across studies, the sign of some scores may need to be reversed to ensure that all scales are aligned (such as so low values represent good health on all scales). Standard deviations may have to be reconstructed from other statistics such as P values and t statistics, 115 116 or occasionally they may be imputed from the standard deviations observed in other studies. 117 Time-to-event data also usually need careful conversions to a consistent format. 111 Authors should report details of any such data processing.
Statistical combination of data from two or more separate studies in a meta-analysis may be neither necessary nor desirable (see box 5 and item 21). Regardless of the decision to combine individual study results, authors should report how they planned to evaluate between-study variability (heterogeneity or inconsistency) (box 6). The consistency of results across trials may influence the decision of whether to combine trial results in a meta-analysis.
When meta-analysis is done, authors should specify the effect measure (such as relative risk or mean difference) (see item 13), the statistical method (such as inverse variance), and whether a fixed-effects or random-effects approach, or some other method (such as Bayesian) was used (see box 6). If possible, authors should explain the reasons for those choices.
Specify any assessment of risk of bias that may affect the cumulative evidence (such as publication bias, selective reporting within studies).
Examples “For each trial we plotted the effect by the inverse of its standard error. The symmetry of such ‘funnel plots’ was assessed both visually, and formally with Egger’s test, to see if the effect decreased with increasing sample size.” 118
“We assessed the possibility of publication bias by evaluating a funnel plot of the trial mean differences for asymmetry, which can result from the non publication of small trials with negative results…Because graphical evaluation can be subjective, we also conducted an adjusted rank correlation test and a regression asymmetry test as formal statistical tests for publication bias...We acknowledge that other factors, such as differences in trial quality or true study heterogeneity, could produce asymmetry in funnel plots.” 119
Explanation Reviewers should explore the possibility that the available data are biased. They may examine results from the available studies for clues that suggest there may be missing studies (publication bias) or missing data from the included studies (selective reporting bias) (see box 7). Authors should report in detail any methods used to investigate possible bias across studies.
It is difficult to assess whether within-study selective reporting is present in a systematic review. If a protocol of an individual study is available, the outcomes in the protocol and the published report can be compared. Even in the absence of a protocol, outcomes listed in the methods section of the published report can be compared with those for which results are presented. 120 In only half of 196 trial reports describing comparisons of two drugs in arthritis were all the effect variables in the methods and results sections the same. 82 In other cases, knowledge of the clinical area may suggest that it is likely that the outcome was measured even if it was not reported. For example, in a particular disease, if one of two linked outcomes is reported but the other is not, then one should question whether the latter has been selectively omitted. 121 122
Only 36% (76 of 212) of therapeutic systematic reviews published in November 2004 reported that study publication bias was considered, and only a quarter of those intended to carry out a formal assessment for that bias. 3 Of 60 meta-analyses in 24 articles published in 2005 in which formal assessments were reported, most were based on fewer than 10 studies; most displayed statistically significant heterogeneity; and many reviewers misinterpreted the results of the tests employed. 123 A review of trials of antidepressants found that meta-analysis of only the published trials gave effect estimates 32% larger on average than when all trials sent to the drug agency were analysed. 67
Describe methods of additional analyses (such as sensitivity or subgroup analyses, meta-regression), if done, indicating which were pre-specified.
Example “Sensitivity analyses were pre-specified. The treatment effects were examined according to quality components (concealed treatment allocation, blinding of patients and caregivers, blinded outcome assessment), time to initiation of statins, and the type of statin. One post-hoc sensitivity analysis was conducted including unpublished data from a trial using cerivastatin.” 124
Explanation Authors may perform additional analyses to help understand whether the results of their review are robust, all of which should be reported. Such analyses include sensitivity analysis, subgroup analysis, and meta-regression. 125
Sensitivity analyses are used to explore the degree to which the main findings of a systematic review are affected by changes in its methods or in the data used from individual studies (such as study inclusion criteria, results of risk of bias assessment). Subgroup analyses address whether the summary effects vary in relation to specific (usually clinical) characteristics of the included studies or their participants. Meta-regression extends the idea of subgroup analysis to the examination of the quantitative influence of study characteristics on the effect size. 126 Meta-regression also allows authors to examine the contribution of different variables to the heterogeneity in study findings. Readers of systematic reviews should be aware that meta-regression has many limitations, including a danger of over-interpretation of findings. 127 128
Even with limited data, many additional analyses can be undertaken. The choice of which analysis to undertake will depend on the aims of the review. None of these analyses, however, is exempt from producing potentially misleading results. It is important to inform readers whether these analyses were performed, their rationale, and which were pre-specified.
Give numbers of studies screened, assessed for eligibility, and included in the review, with reasons for exclusions at each stage, ideally with a flow diagram.
Examples In text: “A total of 10 studies involving 13 trials were identified for inclusion in the review. The search of Medline, PsycInfo and Cinahl databases provided a total of 584 citations. After adjusting for duplicates 509 remained. Of these, 479 studies were discarded because after reviewing the abstracts it appeared that these papers clearly did not meet the criteria. Three additional studies…were discarded because full text of the study was not available or the paper could not be feasibly translated into English. The full text of the remaining 27 citations was examined in more detail. It appeared that 22 studies did not meet the inclusion criteria as described. Five studies…met the inclusion criteria and were included in the systematic review. An additional five studies...that met the criteria for inclusion were identified by checking the references of located, relevant papers and searching for studies that have cited these papers. No unpublished relevant studies were obtained.” 129
See flow diagram in fig 2 ⇓ .
Fig 2 Example flow diagram of study selection. DDW = Digestive Disease Week; UEGW = United European Gastroenterology Week. Adapted from Fuccio et al 130
Explanation Authors should report, ideally with a flow diagram, the total number of records identified from electronic bibliographic sources (including specialised database or registry searches), hand searches of various sources, reference lists, citation indices, and experts. It is useful if authors delineate for readers the number of selected articles that were identified from the different sources so that they can see, for example, whether most articles were identified through electronic bibliographic sources or from references or experts. Literature identified primarily from references or experts may be prone to citation or publication bias. 131 132
The flow diagram and text should describe clearly the process of report selection throughout the review. Authors should report unique records identified in searches, records excluded after preliminary screening (such as screening of titles and abstracts), reports retrieved for detailed evaluation, potentially eligible reports that were not retrievable, retrieved reports that did not meet inclusion criteria and the primary reasons for exclusion, and the studies included in the review. Indeed, the most appropriate layout may vary for different reviews.
Authors should also note the presence of duplicate or supplementary reports so that readers understand the number of individual studies compared with the number of reports that were included in the review. Authors should be consistent in their use of terms, such as whether they are reporting on counts of citations, records, publications, or studies. We believe that reporting the number of studies is the most important.
A flow diagram can be very useful; it should depict all the studies included based on fulfilling the eligibility criteria, and whether data have been combined for statistical analysis. A recent review of 87 systematic reviews found that about half included a QUOROM flow diagram. 133 The authors of this research recommended some important ways that reviewers can improve the use of a flow diagram when describing the flow of information throughout the review process, including a separate flow diagram for each important outcome reported. 133
For each study, present characteristics for which data were extracted (such as study size, PICOS, follow-up period) and provide the citation.
Examples In text: “ Characteristics of included studies
All four studies finally selected for the review were randomised controlled trials published in English. The duration of the intervention was 24 months for the RIO-North America and 12 months for the RIO-Diabetes, RIO-Lipids and RIO-Europe study. Although the last two described a period of 24 months during which they were conducted, only the first 12-months results are provided. All trials had a run-in, as a single blind period before the randomisation.
Participants
The included studies involved 6625 participants. The main inclusion criteria entailed adults (18 years or older), with a body mass index greater than 27 kg/m 2 and less than 5 kg variation in body weight within the three months before study entry.
Intervention
All trials were multicentric. The RIO-North America was conducted in the USA and Canada, RIO-Europe in Europe and the USA, RIO-Diabetes in the USA and 10 other different countries not specified, and RIO-Lipids in eight unspecified different countries.
The intervention received was placebo, 5 mg of rimonabant or 20 mg of rimonabant once daily in addition to a mild hypocaloric diet (600 kcal/day deficit).
In all studies the primary outcome assessed was weight change from baseline after one year of treatment and the RIO-North America study also evaluated the prevention of weight regain between the first and second year. All studies evaluated adverse effects, including those of any kind and serious events. Quality of life was measured in only one study, but the results were not described (RIO-Europe).
Secondary and additional outcomes
These included prevalence of metabolic syndrome after one year and change in cardiometabolic risk factors such as blood pressure, lipid profile, etc.
No study included mortality and costs as outcome.
The timing of outcome measures was variable and could include monthly investigations, evaluations every three months or a single final evaluation after one year.” 134
In table: See table 2 ⇓ .
Example of summary of study characteristics: Summary of included studies evaluating the efficacy of antiemetic agents in acute gastroenteritis. Adapted from DeCamp et al 135
Explanation For readers to gauge the validity and applicability of a systematic review’s results, they need to know something about the included studies. Such information includes PICOS (box 2) and specific information relevant to the review question. For example, if the review is examining the long term effects of antidepressants for moderate depressive disorder, authors should report the follow-up periods of the included studies. For each included study, authors should provide a citation for the source of their information regardless of whether or not the study is published. This information makes it easier for interested readers to retrieve the relevant publications or documents.
Reporting study-level data also allows the comparison of the main characteristics of the studies included in the review. Authors should present enough detail to allow readers to make their own judgments about the relevance of included studies. Such information also makes it possible for readers to conduct their own subgroup analyses and interpret subgroups, based on study characteristics.
Authors should avoid, whenever possible, assuming information when it is missing from a study report (such as sample size, method of randomisation). Reviewers may contact the original investigators to try to obtain missing information or confirm the data extracted for the systematic review. If this information is not obtained, this should be noted in the report. If information is imputed, the reader should be told how this was done and for which items. Presenting study-level data makes it possible to clearly identify unpublished information obtained from the original researchers and make it available for the public record.
Typically, study-level characteristics are presented as a table as in the example (table 2 ⇑ ). Such presentation ensures that all pertinent items are addressed and that missing or unclear information is clearly indicated. Although paper based journals do not generally allow for the quantity of information available in electronic journals or Cochrane reviews, this should not be accepted as an excuse for omission of important aspects of the methods or results of included studies, since these can, if necessary, be shown on a website.
Following the presentation and description of each included study, as discussed above, reviewers usually provide a narrative summary of the studies. Such a summary provides readers with an overview of the included studies. It may, for example, address the languages of the published papers, years of publication, and geographic origins of the included studies.
The PICOS framework is often helpful in reporting the narrative summary indicating, for example, the clinical characteristics and disease severity of the participants and the main features of the intervention and of the comparison group. For non-pharmacological interventions, it may be helpful to specify for each study the key elements of the intervention received by each group. Full details of the interventions in included studies were reported in only three of 25 systematic reviews relevant to general practice. 84
Present data on risk of bias of each study and, if available, any outcome-level assessment (see item 12).
Example See table 3 ⇓ .
Example of assessment of the risk of bias: Quality measures of the randomised controlled trials that failed to fulfil any one of six markers of validity. Adapted from Devereaux et al 96
Explanation We recommend that reviewers assess the risk of bias in the included studies using a standard approach with defined criteria (see item 12). They should report the results of any such assessments. 89
Reporting only summary data (such as “two of eight trials adequately concealed allocation”) is inadequate because it fails to inform readers which studies had the particular methodological shortcoming. A more informative approach is to explicitly report the methodological features evaluated for each study. The Cochrane Collaboration’s new tool for assessing the risk of bias also requests that authors substantiate these assessments with any relevant text from the original studies. 11 It is often easiest to provide these data in a tabular format, as in the example. However, a narrative summary describing the tabular data can also be helpful for readers.
For all outcomes considered (benefits and harms), present, for each study, simple summary data for each intervention group and effect estimates and confidence intervals, ideally with a forest plot.
Examples See table 4 ⇓ and fig 3 ⇓ .
Fig 3 Example of summary results: Overall failure (defined as failure of assigned regimen or relapse) with tetracycline-rifampicin versus tetracycline-streptomycin. Adapted from Skalsky et al 137
Example of summary results: Heterotopic ossification in trials comparing radiotherapy to non-steroidal anti-inflammatory drugs after major hip procedures and fractures. Adapted from Pakos et al 136
Explanation Publication of summary data from individual studies allows the analyses to be reproduced and other analyses and graphical displays to be investigated. Others may wish to assess the impact of excluding particular studies or consider subgroup analyses not reported by the review authors. Displaying the results of each treatment group in included studies also enables inspection of individual study features. For example, if only odds ratios are provided, readers cannot assess the variation in event rates across the studies, making the odds ratio impossible to interpret. 138 Additionally, because data extraction errors in meta-analyses are common and can be large, 139 the presentation of the results from individual studies makes it easier to identify errors. For continuous outcomes, readers may wish to examine the consistency of standard deviations across studies, for example, to be reassured that standard deviation and standard error have not been confused. 138
For each study, the summary data for each intervention group are generally given for binary outcomes as frequencies with and without the event (or as proportions such as 12/45). It is not sufficient to report event rates per intervention group as percentages. The required summary data for continuous outcomes are the mean, standard deviation, and sample size for each group. In reviews that examine time-to-event data, the authors should report the log hazard ratio and its standard error (or confidence interval) for each included study. Sometimes, essential data are missing from the reports of the included studies and cannot be calculated from other data but may need to be imputed by the reviewers. For example, the standard deviation may be imputed using the typical standard deviations in the other trials 116 117 (see item 14). Whenever relevant, authors should indicate which results were not reported directly and had to be estimated from other information (see item 13). In addition, the inclusion of unpublished data should be noted.
For all included studies it is important to present the estimated effect with a confidence interval. This information may be incorporated in a table showing study characteristics or may be shown in a forest plot. 140 The key elements of the forest plot are the effect estimates and confidence intervals for each study shown graphically, but it is preferable also to include, for each study, the numerical group-specific summary data, the effect size and confidence interval, and the percentage weight (see second example, fig 3 ⇑ ). For discussion of the results of meta-analysis, see item 21.
In principle, all the above information should be provided for every outcome considered in the review, including both benefits and harms. When there are too many outcomes for full information to be included, results for the most important outcomes should be included in the main report with other information provided as a web appendix. The choice of the information to present should be justified in light of what was originally stated in the protocol. Authors should explicitly mention if the planned main outcomes cannot be presented due to lack of information. There is some evidence that information on harms is only rarely reported in systematic reviews, even when it is available in the original studies. 141 Selective omission of harms results biases a systematic review and decreases its ability to contribute to informed decision making.
Present the main results of the review. If meta-analyses are done, include for each, confidence intervals and measures of consistency.
Examples “Mortality data were available for all six trials, randomizing 311 patients and reporting data for 305 patients. There were no deaths reported in the three respiratory syncytial virus/severe bronchiolitis trials; thus our estimate is based on three trials randomizing 232 patients, 64 of whom died. In the pooled analysis, surfactant was associated with significantly lower mortality (relative risk =0.7, 95% confidence interval =0.4–0.97, P=0.04). There was no evidence of heterogeneity (I 2 =0%).” 142
“Because the study designs, participants, interventions, and reported outcome measures varied markedly, we focused on describing the studies, their results, their applicability, and their limitations and on qualitative synthesis rather than meta-analysis.” 143
“We detected significant heterogeneity within this comparison (I 2 =46.6%, χ 2 =13.11, df=7, P=0.07). Retrospective exploration of the heterogeneity identified one trial that seemed to differ from the others. It included only small ulcers (wound area less than 5 cm 2 ). Exclusion of this trial removed the statistical heterogeneity and did not affect the finding of no evidence of a difference in healing rate between hydrocolloids and simple low adherent dressings (relative risk=0.98, [95% confidence interval] 0.85 to 1.12, I 2 =0%).” 144
Explanation Results of systematic reviews should be presented in an orderly manner. Initial narrative descriptions of the evidence covered in the review (see item 18) may tell readers important things about the study populations and the design and conduct of studies. These descriptions can facilitate the examination of patterns across studies. They may also provide important information about applicability of evidence, suggest the likely effects of any major biases, and allow consideration, in a systematic manner, of multiple explanations for possible differences of findings across studies.
If authors have conducted one or more meta-analyses, they should present the results as an estimated effect across studies with a confidence interval. It is often simplest to show each meta-analysis summary with the actual results of included studies in a forest plot (see item 20). 140 It should always be clear which of the included studies contributed to each meta-analysis. Authors should also provide, for each meta-analysis, a measure of the consistency of the results from the included studies such as I 2 (heterogeneity, see box 6); a confidence interval may also be given for this measure. 145 If no meta-analysis was performed, the qualitative inferences should be presented as systematically as possible with an explanation of why meta-analysis was not done, as in the second example above. 143 Readers may find a forest plot, without a summary estimate, helpful in such cases.
Authors should in general report syntheses for all the outcome measures they set out to investigate (that is, those described in the protocol, see item 4) to allow readers to draw their own conclusions about the implications of the results. Readers should be made aware of any deviations from the planned analysis. Authors should tell readers if the planned meta-analysis was not thought appropriate or possible for some of the outcomes and the reasons for that decision.
It may not always be sensible to give meta-analysis results and forest plots for each outcome. If the review addresses a broad question, there may be a very large number of outcomes. Also, some outcomes may have been reported in only one or two studies, in which case forest plots are of little value and may be seriously biased.
Of 300 systematic reviews indexed in Medline in 2004, a little more than half (54%) included meta-analyses, of which the majority (91%) reported assessing for inconsistency in results.
Present results of any assessment of risk of bias across studies (see item 15).
Example “Strong evidence of heterogeneity (I 2 =79%, P <0.001) was observed. To explore this heterogeneity, a funnel plot was drawn. The funnel plot [fig 4 ⇓ ] shows evidence of considerable asymmetry.” 146
Fig 4 Example of a funnel plot showing evidence of considerable asymmetry. SE = standard error. Adapted from Appleton et al 146
“Specifically, four sertraline trials involving 486 participants and one citalopram trial involving 274 participants were reported as having failed to achieve a statistically significant drug effect, without reporting mean HRSD [Hamilton Rating Scale for Depression] scores. We were unable to find data from these trials on pharmaceutical company Web sites or through our search of the published literature. These omissions represent 38% of patients in sertraline trials and 23% of patients in citalopram trials. Analyses with and without inclusion of these trials found no differences in the patterns of results; similarly, the revealed patterns do not interact with drug type. The purpose of using the data obtained from the FDA was to avoid publication bias, by including unpublished as well as published trials. Inclusion of only those sertraline and citalopram trials for which means were reported to the FDA would constitute a form of reporting bias similar to publication bias and would lead to overestimation of drug–placebo differences for these drug types. Therefore, we present analyses only on data for medications for which complete clinical trials’ change was reported.” 147
Explanation Authors should present the results of any assessments of risk of bias across studies. If a funnel plot is reported, authors should specify the effect estimate and measure of precision used, presented typically on the x axis and y axis, respectively. Authors should describe if and how they have tested the statistical significance of any possible asymmetry (see item 15). Results of any investigations of selective reporting of outcomes within studies (as discussed in item 15) should also be reported. Also, we advise authors to tell readers if any pre-specified analyses for assessing risk of bias across studies were not completed and the reasons (such as too few included studies).
Give results of additional analyses, if done (such as sensitivity or subgroup analyses, meta-regression [see item 16]).
Example “...benefits of chondroitin were smaller in trials with adequate concealment of allocation compared with trials with unclear concealment (P for interaction =0.050), in trials with an intention-to-treat analysis compared with those that had excluded patients from the analysis (P for interaction =0.017), and in large compared with small trials (P for interaction =0.022).” 148
“Subgroup analyses according to antibody status, antiviral medications, organ transplanted, treatment duration, use of antilymphocyte therapy, time to outcome assessment, study quality and other aspects of study design did not demonstrate any differences in treatment effects. Multivariate meta-regression showed no significant difference in CMV [cytomegalovirus] disease after allowing for potential confounding or effect-modification by prophylactic drug used, organ transplanted or recipient serostatus in CMV positive recipients and CMV negative recipients of CMV positive donors.” 149
Explanation Authors should report any subgroup or sensitivity analyses and whether they were pre-specified (see items 5 and 16). For analyses comparing subgroups of studies (such as separating studies of low and high dose aspirin), the authors should report any tests for interactions, as well as estimates and confidence intervals from meta-analyses within each subgroup. Similarly, meta-regression results (see item 16) should not be limited to P values but should include effect sizes and confidence intervals, 150 as the first example reported above does in a table. The amount of data included in each additional analysis should be specified if different from that considered in the main analyses. This information is especially relevant for sensitivity analyses that exclude some studies; for example, those with high risk of bias.
Importantly, all additional analyses conducted should be reported, not just those that were statistically significant. This information will help avoid selective outcome reporting bias within the review as has been demonstrated in reports of randomised controlled trials. 42 44 121 151 152 Results from exploratory subgroup or sensitivity analyses should be interpreted cautiously, bearing in mind the potential for multiple analyses to mislead.
Summarise the main findings, including the strength of evidence for each main outcome; consider their relevance to key groups (such as healthcare providers, users, and policy makers).
Example “Overall, the evidence is not sufficiently robust to determine the comparative effectiveness of angioplasty (with or without stenting) and medical treatment alone. Only 2 randomized trials with long-term outcomes and a third randomized trial that allowed substantial crossover of treatment after 3 months directly compared angioplasty and medical treatment…the randomized trials did not evaluate enough patients or did not follow patients for a sufficient duration to allow definitive conclusions to be made about clinical outcomes, such as mortality and cardiovascular or kidney failure events.
Some acceptable evidence from comparison of medical treatment and angioplasty suggested no difference in long-term kidney function but possibly better blood pressure control after angioplasty, an effect that may be limited to patients with bilateral atherosclerotic renal artery stenosis. The evidence regarding other outcomes is weak. Because the reviewed studies did not explicitly address patients with rapid clinical deterioration who may need acute intervention, our conclusions do not apply to this important subset of patients.” 143
Explanation Authors should give a brief and balanced summary of the nature and findings of the review. Sometimes, outcomes for which little or no data were found should be noted due to potential relevance for policy decisions and future research. Applicability of the review’s findings—to different patients, settings, or target audiences, for example—should be mentioned. Although there is no standard way to assess applicability simultaneously to different audiences, some systems do exist. 153 Sometimes, authors formally rate or assess the overall body of evidence addressed in the review and can present the strength of their summary recommendations tied to their assessments of the quality of evidence (such as the GRADE system). 10
Authors need to keep in mind that statistical significance of the effects does not always suggest clinical or policy relevance. Likewise, a non-significant result does not demonstrate that a treatment is ineffective. Authors should ideally clarify trade-offs and how the values attached to the main outcomes would lead different people to make different decisions. In addition, adroit authors consider factors that are important in translating the evidence to different settings and that may modify the estimates of effects reported in the review. 153 Patients and healthcare providers may be primarily interested in which intervention is most likely to provide a benefit with acceptable harms, while policy makers and administrators may value data on organisational impact and resource utilisation.
Discuss limitations at study and outcome level (such as risk of bias), and at review level (such as incomplete retrieval of identified research, reporting bias).
Examples Outcome level: “The meta-analysis reported here combines data across studies in order to estimate treatment effects with more precision than is possible in a single study. The main limitation of this meta-analysis, as with any overview, is that the patient population, the antibiotic regimen and the outcome definitions are not the same across studies.” 154
Study and review level: “Our study has several limitations. The quality of the studies varied. Randomization was adequate in all trials; however, 7 of the articles did not explicitly state that analysis of data adhered to the intention-to-treat principle, which could lead to overestimation of treatment effect in these trials, and we could not assess the quality of 4 of the 5 trials reported as abstracts. Analyses did not identify an association between components of quality and re-bleeding risk, and the effect size in favour of combination therapy remained statistically significant when we excluded trials that were reported as abstracts.
Publication bias might account for some of the effect we observed. Smaller trials are, in general, analyzed with less methodological rigor than larger studies, and an asymmetrical funnel plot suggests that selective reporting may have led to an overestimation of effect sizes in small trials.” 155
Explanation A discussion of limitations should address the validity (that is, risk of bias) and reporting (informativeness) of the included studies, limitations of the review process, and generalisability (applicability) of the review. Readers may find it helpful if authors discuss whether studies were threatened by serious risks of bias, whether the estimates of the effect of the intervention are too imprecise, or if there were missing data for many participants or important outcomes.
Limitations of the review process might include limitations of the search (such as restricting to English-language publications), and any difficulties in the study selection, appraisal, and meta-analysis processes. For example, poor or incomplete reporting of study designs, patient populations, and interventions may hamper interpretation and synthesis of the included studies. 84 Applicability of the review may be affected if there are limited data for certain populations or subgroups where the intervention might perform differently or few studies assessing the most important outcomes of interest; or if there is a substantial amount of data relating to an outdated intervention or comparator or heavy reliance on imputation of missing values for summary estimates (item 14).
Provide a general interpretation of the results in the context of other evidence, and implications for future research.
Example Implications for practice: “Between 1995 and 1997 five different meta-analyses of the effect of antibiotic prophylaxis on infection and mortality were published. All confirmed a significant reduction in infections, though the magnitude of the effect varied from one review to another. The estimated impact on overall mortality was less evident and has generated considerable controversy on the cost effectiveness of the treatment. Only one among the five available reviews, however, suggested that a weak association between respiratory tract infections and mortality exists and lack of sufficient statistical power may have accounted for the limited effect on mortality.”
Implications for research: “A logical next step for future trials would thus be the comparison of this protocol against a regimen of a systemic antibiotic agent only to see whether the topical component can be dropped. We have already identified six such trials but the total number of patients so far enrolled (n=1056) is too small for us to be confident that the two treatments are really equally effective. If the hypothesis is therefore considered worth testing more and larger randomised controlled trials are warranted. Trials of this kind, however, would not resolve the relevant issue of treatment induced resistance. To produce a satisfactory answer to this, studies with a different design would be necessary. Though a detailed discussion goes beyond the scope of this paper, studies in which the intensive care unit rather than the individual patient is the unit of randomisation and in which the occurrence of antibiotic resistance is monitored over a long period of time should be undertaken.” 156
Explanation Systematic reviewers sometimes draw conclusions that are too optimistic 157 or do not consider the harms equally as carefully as the benefits, although some evidence suggests these problems are decreasing. 158 If conclusions cannot be drawn because there are too few reliable studies, or too much uncertainty, this should be stated. Such a finding can be as important as finding consistent effects from several large studies.
Authors should try to relate the results of the review to other evidence, as this helps readers to better interpret the results. For example, there may be other systematic reviews about the same general topic that have used different methods or have addressed related but slightly different questions. 159 160 Similarly, there may be additional information relevant to decision makers, such as the cost-effectiveness of the intervention (such as health technology assessment). Authors may discuss the results of their review in the context of existing evidence regarding other interventions.
We advise authors also to make explicit recommendations for future research. In a sample of 2535 Cochrane reviews, 82% included recommendations for research with specific interventions, 30% suggested the appropriate type of participants, and 52% suggested outcome measures for future research. 161 There is no corresponding assessment about systematic reviews published in medical journals, but we believe that such recommendations are much less common in those reviews.
Clinical research should not be planned without a thorough knowledge of similar, existing research. 162 There is evidence that this still does not occur as it should and that authors of primary studies do not consider a systematic review when they design their studies. 163 We believe systematic reviews have great potential for guiding future clinical research.
Describe sources of funding or other support (such as supply of data) for the systematic review, and the role of funders for the systematic review.
Examples “The evidence synthesis upon which this article was based was funded by the Centers for Disease Control and Prevention for the Agency for Healthcare Research and Quality and the U.S. Prevention Services Task Force.” 164
“Role of funding source: The funders played no role in study design, collection, analysis, interpretation of data, writing of the report, or in the decision to submit the paper for publication. They accept no responsibility for the contents.” 165
Explanation Authors of systematic reviews, like those of any other research study, should disclose any funding they received to carry out the review, or state if the review was not funded. Lexchin and colleagues 166 observed that outcomes of reports of randomised trials and meta-analyses of clinical trials funded by the pharmaceutical industry are more likely to favor the sponsor’s product compared with studies with other sources of funding. Similar results have been reported elsewhere. 167 168 Analogous data suggest that similar biases may affect the conclusions of systematic reviews. 169
Given the potential role of systematic reviews in decision making, we believe authors should be transparent about the funding and the role of funders, if any. Sometimes the funders will provide services, such as those of a librarian to complete the searches for relevant literature or access to commercial databases not available to the reviewers. Any level of funding or services provided to the systematic review team should be reported. Authors should also report whether the funder had any role in the conduct or report of the review. Beyond funding issues, authors should report any real or perceived conflicts of interest related to their role or the role of the funder in the reporting of the systematic review. 170
In a survey of 300 systematic reviews published in November 2004, funding sources were not reported in 41% of the reviews. 3 Only a minority of reviews (2%) reported being funded by for-profit sources, but the true proportion may be higher. 171
The PRISMA statement and this document have focused on systematic reviews of reports of randomised trials. Other study designs, including non-randomised studies, quasi-experimental studies, and interrupted time series, are included in some systematic reviews that evaluate the effects of healthcare interventions. 172 173 The methods of these reviews may differ to varying degrees from the typical intervention review, for example regarding the literature search, data abstraction, assessment of risk of bias, and analysis methods. As such, their reporting demands might also differ from what we have described here. A useful principle is for systematic review authors to ensure that their methods are reported with adequate clarity and transparency to enable readers to critically judge the available evidence and replicate or update the research.
In some systematic reviews, the authors will seek the raw data from the original researchers to calculate the summary statistics. These systematic reviews are called individual patient (or participant) data reviews. 40 41 Individual patient data meta-analyses may also be conducted with prospective accumulation of data rather than retrospective accumulation of existing data. Here too, extra information about the methods will need to be reported.
Other types of systematic reviews exist. Realist reviews aim to determine how complex programmes work in specific contexts and settings. 174 Meta-narrative reviews aim to explain complex bodies of evidence through mapping and comparing different overarching storylines. 175 Network meta-analyses, also known as multiple treatments meta-analyses, can be used to analyse data from comparisons of many different treatments. 176 177 They use both direct and indirect comparisons and can be used to compare interventions that have not been directly compared.
We believe that the issues we have highlighted in this paper are relevant to ensure transparency and understanding of the processes adopted and the limitations of the information presented in systematic reviews of different types. We hope that PRISMA can be the basis for more detailed guidance on systematic reviews of other types of research, including diagnostic accuracy and epidemiological studies.
We developed the PRISMA statement using an approach for developing reporting guidelines that has evolved over several years. 178 The overall aim of PRISMA is to help ensure the clarity and transparency of reporting of systematic reviews, and recent data indicate that this reporting guidance is much needed. 3 PRISMA is not intended to be a quality assessment tool and it should not be used as such.
This PRISMA explanation and elaboration document was developed to facilitate the understanding, uptake, and dissemination of the PRISMA statement and hopefully provide a pedagogical framework for those interested in conducting and reporting systematic reviews. It follows a format similar to that used in other explanatory documents. 17 18 19 Following the recommendations in the PRISMA checklist may increase the word count of a systematic review report. We believe, however, that the benefit of readers being able to critically appraise a clear, complete, and transparent systematic review report outweighs the possible slight increase in the length of the report.
While the aims of PRISMA are to reduce the risk of flawed reporting of systematic reviews and improve the clarity and transparency in how reviews are conducted, we have little data to state more definitively whether this “intervention” will achieve its intended goal. A previous effort to evaluate QUOROM was not successfully completed. 178 Publication of the QUOROM statement was delayed for two years while a research team attempted to evaluate its effectiveness by conducting a randomised controlled trial with the participation of eight major medical journals. Unfortunately that trial was not completed due to accrual problems (David Moher, personal communication). Other evaluation methods might be easier to conduct. At least one survey of 139 published systematic reviews in the critical care literature 179 suggests that their quality improved after the publication of QUOROM.
If the PRISMA statement is endorsed by and adhered to in journals, as other reporting guidelines have been, 17 18 19 180 there should be evidence of improved reporting of systematic reviews. For example, there have been several evaluations of whether the use of CONSORT improves reports of randomised controlled trials. A systematic review of these studies 181 indicates that use of CONSORT is associated with improved reporting of certain items, such as allocation concealment. We aim to evaluate the benefits (that is, improved reporting) and possible adverse effects (such as increased word length) of PRISMA and we encourage others to consider doing likewise.
Even though we did not carry out a systematic literature search to produce our checklist, and this is indeed a limitation of our effort, PRISMA was developed using an evidence based approach whenever possible. Checklist items were included if there was evidence that not reporting the item was associated with increased risk of bias, or where it was clear that information was necessary to appraise the reliability of a review. To keep PRISMA up to date and as evidence based as possible requires regular vigilance of the literature, which is growing rapidly. Currently the Cochrane Methodology Register has more than 11 000 records pertaining to the conduct and reporting of systematic reviews and other evaluations of health and social care. For some checklist items, such as reporting the abstract (item 2), we have used evidence from elsewhere in the belief that the issue applies equally well to reporting of systematic reviews. Yet for other items, evidence does not exist; for example, whether a training exercise improves the accuracy and reliability of data extraction. We hope PRISMA will act as a catalyst to help generate further evidence that can be considered when further revising the checklist in the future.
More than 10 years have passed between the development of the QUOROM statement and its update, the PRISMA statement. We aim to update PRISMA more frequently. We hope that the implementation of PRISMA will be better than it has been for QUOROM. There are at least two reasons to be optimistic. First, systematic reviews are increasingly used by healthcare providers to inform “best practice” patient care. Policy analysts and managers are using systematic reviews to inform healthcare decision making and to better target future research. Second, we anticipate benefits from the development of the EQUATOR Network, described below.
Developing any reporting guideline requires considerable effort, experience, and expertise. While reporting guidelines have been successful for some individual efforts, 17 18 19 there are likely others who want to develop reporting guidelines who possess little time, experience, or knowledge as to how to do so appropriately. The EQUATOR (enhancing the quality and transparency of health research) Network aims to help such individuals and groups by serving as a global resource for anybody interested in developing reporting guidelines, regardless of the focus. 7 180 182 The overall goal of EQUATOR is to improve the quality of reporting of all health science research through the development and translation of reporting guidelines. Beyond this aim, the network plans to develop a large web presence by developing and maintaining a resource centre of reporting tools, and other information for reporting research ( www.equator-network.org/ ).
We encourage healthcare journals and editorial groups, such as the World Association of Medical Editors and the International Committee of Medical Journal Editors, to endorse PRISMA in much the same way as they have endorsed other reporting guidelines, such as CONSORT. We also encourage editors of healthcare journals to support PRISMA by updating their “instructions to authors” and including the PRISMA web address, and by raising awareness through specific editorial actions.
The terminology used to describe systematic reviews and meta-analyses has evolved over time and varies between fields. Different terms have been used by different groups, such as educators and psychologists. The conduct of a systematic review comprises several explicit and reproducible steps, such as identifying all likely relevant records, selecting eligible studies, assessing the risk of bias, extracting data, qualitative synthesis of the included studies, and possibly meta-analyses.
Initially this entire process was termed a meta-analysis and was so defined in the QUOROM statement. 8 More recently, especially in healthcare research, there has been a trend towards preferring the term systematic review. If quantitative synthesis is performed, this last stage alone is referred to as a meta-analysis. The Cochrane Collaboration uses this terminology, 9 under which a meta-analysis, if performed, is a component of a systematic review. Regardless of the question addressed and the complexities involved, it is always possible to complete a systematic review of existing data, but not always possible or desirable, to quantitatively synthesise results because of clinical, methodological, or statistical differences across the included studies. Conversely, with prospective accumulation of studies and datasets where the plan is eventually to combine them, the term “(prospective) meta-analysis” may make more sense than “systematic review.”
For retrospective efforts, one possibility is to use the term systematic review for the whole process up to the point when one decides whether to perform a quantitative synthesis. If a quantitative synthesis is performed, some researchers refer to this as a meta-analysis. This definition is similar to that found in the current edition of the Dictionary of Epidemiology . 183
While we recognise that the use of these terms is inconsistent and there is residual disagreement among the members of the panel working on PRISMA, we have adopted the definitions used by the Cochrane Collaboration. 9
Systematic review A systematic review attempts to collate all empirical evidence that fits pre-specified eligibility criteria to answer a specific research question. It uses explicit, systematic methods that are selected with a view to minimising bias, thus providing reliable findings from which conclusions can be drawn and decisions made. 184 185 The key characteristics of a systematic review are ( a ) a clearly stated set of objectives with an explicit, reproducible methodology; ( b ) a systematic search that attempts to identify all studies that would meet the eligibility criteria; ( c ) an assessment of the validity of the findings of the included studies, such as through the assessment of risk of bias; and ( d ) systematic presentation and synthesis of the characteristics and findings of the included studies.
Meta-analysis Meta-analysis is the use of statistical techniques to integrate and summarise the results of included studies. Many systematic reviews contain meta-analyses, but not all. By combining information from all relevant studies, meta-analyses can provide more precise estimates of the effects of health care than those derived from the individual studies included within a review.
Formulating relevant and precise questions that can be answered in a systematic review can be complex and time consuming. A structured approach for framing questions that uses five components may help facilitate the process. This approach is commonly known by the acronym “PICOS” where each letter refers to a component: the patient population or the disease being addressed (P), the interventions or exposure (I), the comparator group (C), the outcome or endpoint (O), and the study design chosen (S). 186 Issues relating to PICOS affect several PRISMA items (items 6, 8, 9, 10, 11, and 18).
P— Providing information about the population requires a precise definition of a group of participants (often patients), such as men over the age of 65 years, their defining characteristics of interest (often disease), and possibly the setting of care considered, such as an acute care hospital.
I— The interventions (exposures) under consideration in the systematic review need to be transparently reported. For example, if the reviewers answer a question regarding the association between a woman’s prenatal exposure to folic acid and subsequent offspring’s neural tube defects, reporting the dose, frequency, and duration of folic acid used in different studies is likely to be important for readers to interpret the review’s results and conclusions. Other interventions (exposures) might include diagnostic, preventive, or therapeutic treatments; arrangements of specific processes of care; lifestyle changes; psychosocial or educational interventions; or risk factors.
C— Clearly reporting the comparator (control) group intervention(s)—such as usual care, drug, or placebo—is essential for readers to fully understand the selection criteria of primary studies included in the systematic review, and might be a source of heterogeneity investigators have to deal with. Comparators are often poorly described. Clearly reporting what the intervention is compared with is important and may sometimes have implications for the inclusion of studies in a review—many reviews compare with “standard care,” which is otherwise undefined; this should be properly addressed by authors.
O— The outcomes of the intervention being assessed—such as mortality, morbidity, symptoms, or quality of life improvements—should be clearly specified as they are required to interpret the validity and generalisability of the systematic review’s results.
S— Finally, the type of study design(s) included in the review should be reported. Some reviews include only reports of randomised trials, whereas others have broader design criteria and include randomised trials and certain types of observational studies. Still other reviews, such as those specifically answering questions related to harms, may include a wide variety of designs ranging from cohort studies to case reports. Whatever study designs are included in the review, these should be reported.
Independently from how difficult it is to identify the components of the research question, the important point is that a structured approach is preferable, and this extends beyond systematic reviews of effectiveness. Ideally the PICOS criteria should be formulated a priori, in the systematic review’s protocol, although some revisions might be required because of the iterative nature of the review process. Authors are encouraged to report their PICOS criteria and whether any modifications were made during the review process. A useful example in this realm is the appendix of the “systematic reviews of water fluoridation” undertaken by the Centre for Reviews and Dissemination. 187
Comprehensive searches usually result in a large number of identified records, a much smaller number of studies included in the systematic review, and even fewer of these studies included in any meta-analyses. Reports of systematic reviews often provide little detail as to the methods used by the review team in this process. Readers are often left with what can be described as the “X-files” phenomenon, as it is unclear what occurs between the initial set of identified records and those finally included in the review.
Sometimes, review authors simply report the number of included studies; more often they report the initial number of identified records and the number of included studies. Rarely, although this is optimal for readers, do review authors report the number of identified records, the smaller number of potentially relevant studies, and the even smaller number of included studies, by outcome. Review authors also need to differentiate between the number of reports and studies. Often there will not be a 1:1 ratio of reports to studies and this information needs to be described in the systematic review report.
Ideally, the identification of study reports should be reported as text in combination with use of the PRISMA flow diagram. While we recommend use of the flow diagram, a small number of reviews might be particularly simple and can be sufficiently described with a few brief sentences of text. More generally, review authors will need to report the process used for each step: screening the identified records; examining the full text of potentially relevant studies (and reporting the number that could not be obtained); and applying eligibility criteria to select the included studies.
Such descriptions should also detail how potentially eligible records were promoted to the next stage of the review (such as full text screening) and to the final stage of this process, the included studies. Often review teams have three response options for excluding records or promoting them to the next stage of the winnowing process: “yes,” “no,” and “maybe.”
Similarly, some detail should be reported on who participated and how such processes were completed. For example, a single person may screen the identified records while a second person independently examines a small sample of them. The entire winnowing process is one of “good bookkeeping” whereby interested readers should be able to work backwards from the included studies to come up with the same numbers of identified records.
There is often a paucity of information describing the data extraction processes in reports of systematic reviews. Authors may simply report that “relevant” data were extracted from each included study with little information about the processes used for data extraction. It may be useful for readers to know whether a systematic review’s authors developed, a priori or not, a data extraction form, whether multiple forms were used, the number of questions, whether the form was pilot tested, and who completed the extraction. For example, it is important for readers to know whether one or more people extracted data, and if so, whether this was completed independently, whether “consensus” data were used in the analyses, and if the review team completed an informal training exercise or a more formal reliability exercise.
In this paper, and elsewhere, 11 we sought to use a new term for many readers, namely, risk of bias, for evaluating each included study in a systematic review. Previous papers 89 188 tended to use the term “quality.” When carrying out a systematic review we believe it is important to distinguish between quality and risk of bias and to focus on evaluating and reporting the latter. Quality is often the best the authors have been able to do. For example, authors may report the results of surgical trials in which blinding of the outcome assessors was not part of the trial’s conduct. Even though this may have been the best methodology the researchers were able to do, there are still theoretical grounds for believing that the study was susceptible to (risk of) bias.
Assessing the risk of bias should be part of the conduct and reporting of any systematic review. In all situations, we encourage systematic reviewers to think ahead carefully about what risks of bias (methodological and clinical) may have a bearing on the results of their systematic reviews.
For systematic reviewers, understanding the risk of bias on the results of studies is often difficult, because the report is only a surrogate of the actual conduct of the study. There is some suggestion 189 190 that the report may not be a reasonable facsimile of the study, although this view is not shared by all. 88 191 There are three main ways to assess risk of bias—individual components, checklists, and scales. There are a great many scales available, 192 although we caution against their use based on theoretical grounds 193 and emerging empirical evidence. 194 Checklists are less frequently used and potentially have the same problems as scales. We advocate using a component approach and one that is based on domains for which there is good empirical evidence and perhaps strong clinical grounds. The new Cochrane risk of bias tool 11 is one such component approach.
The Cochrane risk of bias tool consists of five items for which there is empirical evidence for their biasing influence on the estimates of an intervention’s effectiveness in randomised trials (sequence generation, allocation concealment, blinding, incomplete outcome data, and selective outcome reporting) and a catch-all item called “other sources of bias”. 11 There is also some consensus that these items can be applied for evaluation of studies across diverse clinical areas. 93 Other risk of bias items may be topic or even study specific—that is, they may stem from some peculiarity of the research topic or some special feature of the design of a specific study. These peculiarities need to be investigated on a case-by-case basis, based on clinical and methodological acumen, and there can be no general recipe. In all situations, systematic reviewers need to think ahead carefully about what aspects of study quality may have a bearing on the results.
Deciding whether to combine data involves statistical, clinical, and methodological considerations. The statistical decisions are perhaps the most technical and evidence-based. These are more thoroughly discussed in box 6. The clinical and methodological decisions are generally based on discussions within the review team and may be more subjective.
Clinical considerations will be influenced by the question the review is attempting to address. Broad questions might provide more “license” to combine more disparate studies, such as whether “Ritalin is effective in increasing focused attention in people diagnosed with attention deficit hyperactivity disorder (ADHD).” Here authors might elect to combine reports of studies involving children and adults. If the clinical question is more focused, such as whether “Ritalin is effective in increasing classroom attention in previously undiagnosed ADHD children who have no comorbid conditions,” it is likely that different decisions regarding synthesis of studies are taken by authors. In any case authors should describe their clinical decisions in the systematic review report.
Deciding whether to combine data also has a methodological component. Reviewers may decide not to combine studies of low risk of bias with those of high risk of bias (see items 12 and 19). For example, for subjective outcomes, systematic review authors may not wish to combine assessments that were completed under blind conditions with those that were not.
For any particular question there may not be a “right” or “wrong” choice concerning synthesis, as such decisions are likely complex. However, as the choice may be subjective, authors should be transparent as to their key decisions and describe them for readers.
Meta-analysis: statistical combination of the results of multiple studies.
If it is felt that studies should have their results combined statistically, other issues must be considered because there are many ways to conduct a meta-analysis. Different effect measures can be used for both binary and continuous outcomes (see item 13). Also, there are two commonly used statistical models for combining data in a meta-analysis. 195 The fixed-effect model assumes that there is a common treatment effect for all included studies; 196 it is assumed that the observed differences in results across studies reflect random variation. 196 The random-effects model assumes that there is no common treatment effect for all included studies but rather that the variation of the effects across studies follows a particular distribution. 197 In a random-effects model it is believed that the included studies represent a random sample from a larger population of studies addressing the question of interest. 198
There is no consensus about whether to use fixed- or random-effects models, and both are in wide use. The following differences have influenced some researchers regarding their choice between them. The random-effects model gives more weight to the results of smaller trials than does the fixed-effect analysis, which may be undesirable as small trials may be inferior and most prone to publication bias. The fixed-effect model considers only within-study variability, whereas the random-effects model considers both within- and between-study variability. This is why a fixed-effect analysis tends to give narrower confidence intervals (that is, provides greater precision) than a random-effects analysis. 110 196 199 In the absence of any between-study heterogeneity, the fixed- and random-effects estimates will coincide.
In addition, there are different methods for performing both types of meta-analysis. 200 Common fixed-effect approaches are Mantel-Haenszel and inverse variance, whereas random-effects analyses usually use the DerSimonian and Laird approach, although other methods exist, including Bayesian meta-analysis. 201
In the presence of demonstrable between-study heterogeneity (see below), some consider that the use of a fixed-effect analysis is counterintuitive because their main assumption is violated. Others argue that it is inappropriate to conduct any meta-analysis when there is unexplained variability across trial results. If the reviewers decide not to combine the data quantitatively, a danger is that eventually they may end up using quasi-quantitative rules of poor validity (such as vote counting of how many studies have nominally significant results) for interpreting the evidence. Statistical methods to combine data exist for almost any complex situation that may arise in a systematic review, but one has to be aware of their assumptions and limitations to avoid misapplying or misinterpreting these methods.
We expect some variation (inconsistency) in the results of different studies due to chance alone. Variability in excess of that due to chance reflects true differences in the results of the trials, and is called “heterogeneity.” The conventional statistical approach to evaluating heterogeneity is a χ 2 test (Cochran’s Q), but it has low power when there are few studies and excessive power when there are many studies. 202 By contrast, the I 2 statistic quantifies the amount of variation in results across studies beyond that expected by chance and so is preferable to Q. 202 203 I 2 represents the percentage of the total variation in estimated effects across studies that is due to heterogeneity rather than to chance; some authors consider an I 2 value less than 25% as low. 202 However, I 2 also suffers from large uncertainty in the common situation where only a few studies are available, 204 and reporting the uncertainty in I 2 (such as 95% confidence interval) may be helpful. 145 When there are few studies, inferences about heterogeneity should be cautious.
When considerable heterogeneity is observed, it is advisable to consider possible reasons. 205 In particular, the heterogeneity may be due to differences between subgroups of studies (see item 16). Also, data extraction errors are a common cause of substantial heterogeneity in results with continuous outcomes. 139
Systematic reviews aim to incorporate information from all relevant studies. The absence of information from some studies may pose a serious threat to the validity of a review. Data may be incomplete because some studies were not published, or because of incomplete or inadequate reporting within a published article. These problems are often summarised as “publication bias,” although the bias arises from non-publication of full studies and selective publication of results in relation to their findings. Non-publication of research findings dependent on the actual results is an important risk of bias to a systematic review and meta-analysis.
Several empirical investigations have shown that the findings from clinical trials are more likely to be published if the results are statistically significant (P<0.05) than if they are not. 125 206 207 For example, of 500 oncology trials with more than 200 participants for which preliminary results were presented at a conference of the American Society of Clinical Oncology, 81% with P<0.05 were published in full within five years compared with only 68% of those with P>0.05. 208
Also, among published studies, those with statistically significant results are published sooner than those with non-significant findings. 209 When some studies are missing for these reasons, the available results will be biased towards exaggerating the effect of an intervention.
In many systematic reviews only some of the eligible studies (often a minority) can be included in a meta-analysis for a specific outcome. For some studies, the outcome may not be measured or may be measured but not reported. The former will not lead to bias, but the latter could.
Evidence is accumulating that selective reporting bias is widespread and of considerable importance. 42 43 In addition, data for a given outcome may be analysed in multiple ways and the choice of presentation influenced by the results obtained. In a study of 102 randomised trials, comparison of published reports with trial protocols showed that a median of 38% efficacy and 50% safety outcomes per trial, respectively, were not available for meta-analysis. Statistically significant outcomes had higher odds of being fully reported in publications when compared with non-significant outcomes for both efficacy (pooled odds ratio 2.4 (95% confidence interval 1.4 to 4.0)) and safety (4.7 (1.8 to 12)) data. Several other studies have had similar findings. 210 211
Missing studies may increasingly be identified from trials registries. Evidence of missing outcomes may come from comparison with the study protocol, if available, or by careful examination of published articles. 11 Study publication bias and selective outcome reporting are difficult to exclude or verify from the available results, especially when few studies are available.
If the available data are affected by either (or both) of the above biases, smaller studies would tend to show larger estimates of the effects of the intervention. Thus one possibility is to investigate the relation between effect size and sample size (or more specifically, precision of the effect estimate). Graphical methods, especially the funnel plot, 212 and analytic methods (such as Egger’s test) are often used, 213 214 215 although their interpretation can be problematic. 216 217 Strictly speaking, such analyses investigate “small study bias”; there may be many reasons why smaller studies have systematically different effect sizes than larger studies, of which reporting bias is just one. 218 Several alternative tests for bias have also been proposed, beyond the ones testing small study bias, 215 219 220 but none can be considered a gold standard. Although evidence that smaller studies had larger estimated effects than large ones may suggest the possibility that the available evidence is biased, misinterpretation of such data is common. 123
Cite this as: BMJ 2009;339:b2700
The following people contributed to this paper: Doug Altman, Centre for Statistics in Medicine (Oxford, UK); Gerd Antes, University Hospital Freiburg (Freiburg, Germany); David Atkins, Health Services Research and Development Service, Veterans Health Administration (Washington DC, USA); Virginia Barbour, PLoS Medicine (Cambridge, UK); Nick Barrowman, Children’s Hospital of Eastern Ontario (Ottawa, Canada); Jesse A Berlin, Johnson & Johnson Pharmaceutical Research and Development (Titusville NJ, USA); Jocalyn Clark, PLoS Medicine (at the time of writing, BMJ , London); Mike Clarke, UK Cochrane Centre (Oxford, UK) and School of Nursing and Midwifery, Trinity College (Dublin, Ireland); Deborah Cook, Departments of Medicine, Clinical Epidemiology and Biostatistics, McMaster University (Hamilton, Canada); Roberto D’Amico, Università di Modena e Reggio Emilia (Modena, Italy) and Centro Cochrane Italiano, Istituto Ricerche Farmacologiche Mario Negri (Milan, Italy); Jonathan J Deeks, University of Birmingham (Birmingham); P J Devereaux, Departments of Medicine, Clinical Epidemiology and Biostatistics, McMaster University (Hamilton, Canada); Kay Dickersin, Johns Hopkins Bloomberg School of Public Health (Baltimore MD, USA); Matthias Egger, Department of Social and Preventive Medicine, University of Bern (Bern, Switzerland); Edzard Ernst, Peninsula Medical School (Exeter, UK); Peter C Gøtzsche, Nordic Cochrane Centre (Copenhagen, Denmark); Jeremy Grimshaw, Ottawa Hospital Research Institute (Ottawa, Canada); Gordon Guyatt, Departments of Medicine, Clinical Epidemiology and Biostatistics, McMaster University; Julian Higgins, MRC Biostatistics Unit (Cambridge, UK); John P A Ioannidis, University of Ioannina Campus (Ioannina, Greece); Jos Kleijnen, Kleijnen Systematic Reviews (York, UK) and School for Public Health and Primary Care (CAPHRI), University of Maastricht (Maastricht, Netherlands); Tom Lang, Tom Lang Communications and Training (Davis CA, USA); Alessandro Liberati, Università di Modena e Reggio Emilia (Modena, Italy) and Centro Cochrane Italiano, Istituto Ricerche Farmacologiche Mario Negri (Milan, Italy); Nicola Magrini, NHS Centre for the Evaluation of the Effectiveness of Health Care—CeVEAS (Modena, Italy); David McNamee, Lancet (London, UK); David Moher, Ottawa Methods Centre, Ottawa Hospital Research Institute (Ottawa, Canada); Lorenzo Moja, Centro Cochrane Italiano, Istituto Ricerche Farmacologiche Mario Negri; Maryann Napoli, Center for Medical Consumers (New York, USA); Cynthia Mulrow, Annals of Internal Medicine (Philadelphia, Pennsylvania, US); Andy Oxman, Norwegian Health Services Research Centre (Oslo, Norway); Ba’ Pham, Toronto Health Economics and Technology Assessment Collaborative (Toronto, Canada) (at the time of first meeting of the group, GlaxoSmithKline Canada, Mississauga, Canada); Drummond Rennie, University of California San Francisco (San Francisco CA, USA); Margaret Sampson, Children’s Hospital of Eastern Ontario (Ottawa, Canada); Kenneth F Schulz, Family Health International (Durham NC, USA); Paul G Shekelle, Southern California Evidence Based Practice Center (Santa Monica CA, USA); Jennifer Tetzlaff, Ottawa Methods Centre, Ottawa Hospital Research Institute (Ottawa, Canada); David Tovey, Cochrane Library , Cochrane Collaboration (Oxford, UK) (at the time of first meeting of the group, BMJ , London); Peter Tugwell, Institute of Population Health, University of Ottawa (Ottawa, Canada).
Lorenzo Moja helped with the preparation and the several updates of the manuscript and assisted with the preparation of the reference list. AL is the guarantor of the manuscript.
Competing interests: None declared.
Provenance and peer review: Not commissioned; externally peer reviewed.
In order to encourage dissemination of the PRISMA statement, this article is freely accessible on bmj.com and will also be published in PLoS Medicine , Annals of Internal Medicine , Journal of Clinical Epidemiology , and Open Medicine . The authors jointly hold the copyright of this article. For details on further use, see the PRISMA website ( www.prisma-statement.org/ ).
This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Critical Care volume 28 , Article number: 214 ( 2024 ) Cite this article
2826 Accesses
60 Altmetric
Metrics details
Ventilator-associated pneumonia (VAP) is a prevalent and grave hospital-acquired infection that affects mechanically ventilated patients. Diverse diagnostic criteria can significantly affect VAP research by complicating the identification and management of the condition, which may also impact clinical management.
We conducted this review to assess the diagnostic criteria and the definitions of the term “ventilator-associated” used in randomised controlled trials (RCTs) of VAP management.
Based on the protocol (PROSPERO 2019 CRD42019147411), we conducted a systematic search on MEDLINE/PubMed and Cochrane CENTRAL for RCTs, published or registered between 2010 and 2024.
We included completed and ongoing RCTs that assessed pharmacological or non-pharmacological interventions in adults with VAP.
Data were collected using a tested extraction sheet, as endorsed by the Cochrane Collaboration. After cross-checking, data were summarised in a narrative and tabular form.
In total, 7,173 records were identified through the literature search. Following the exclusion of records that did not meet the eligibility criteria, 119 studies were included. Diagnostic criteria were provided in 51.2% of studies, and the term “ventilator-associated” was defined in 52.1% of studies. The most frequently included diagnostic criteria were pulmonary infiltrates (96.7%), fever (86.9%), hypothermia (49.1%), sputum (70.5%), and hypoxia (32.8%). The different criteria were used in 38 combinations across studies. The term “ventilator-associated” was defined in nine different ways.
When provided, diagnostic criteria and definitions of VAP in RCTs display notable variability. Continuous efforts to harmonise VAP diagnostic criteria in future clinical trials are crucial to improve quality of care, enable accurate epidemiological assessments, and guide effective antimicrobial stewardship.
Ventilator-associated pneumonia (VAP) stands as the most prevalent and serious hospital-acquired infection observed in intensive care units [ 1 ]. VAP prolongs hospital stays, durations of mechanical ventilation, and is associated with considerable mortality and an increase in healthcare costs [ 2 , 3 ].
Diagnosing VAP can be challenging for clinicians as it shares clinical signs and symptoms with other forms of pneumonia as well as non-infectious conditions [ 4 ]. The most recent international clinical guidelines define VAP as the presence of respiratory infection signs combined with new radiographic infiltrates in a patient who has been ventilated for at least 48 h [ 5 , 6 ]. While the guidelines developed by ERS/ESICM/ESCMID/ALAT do not provide a detailed definition of signs of respiratory infection [ 5 ], the ATS/IDSA guidelines mention that clinical signs may include the new onset of fever, purulent sputum, leucocytosis, and decline in oxygenation [ 6 ]. However, the ATS/IDSA guideline panel also acknowledges that there is no gold standard for the diagnosis of VAP [ 6 ]. This lack of a standardised definition is further highlighted by the varying, surveillance-based definitions of VAP provided by the Centre for Disease Control (CDC) and the European Centre for Disease Control (ECDC) [ 7 , 8 ]. These definitions, focusing on a combination of clinical, radiological, and microbiological signs to identify cases of VAP, were established to standardise reporting and facilitate the monitoring of infections in healthcare settings. However, the criteria given by the CDC and ECDC may not always align with the diagnostic criteria used by clinicians to confirm or rule out the condition [ 9 , 10 , 11 ].
Variations in the eligibility criteria applied to VAP can have a significant impact on systematic reviews and meta-analyses that assess different interventions, primarily due to the potential lack of comparability among the studied populations [ 12 ]. Furthermore, the incidence of VAP may be underestimated when excessively strict diagnostic criteria are employed [ 13 , 14 ].
A recent systematic review conducted by Weiss et al. focused on inclusion and judgment criteria used in randomised controlled trials (RCTs) on nosocomial pneumonia and found considerable heterogeneity [ 15 ]. However, the authors only considered RCTs evaluating antimicrobial treatment as interventions, did not distinguish between hospital-acquired pneumonia (HAP) and VAP, and did not evaluate definitions of the term "ventilator-associated".
The objective of this systematic review was to provide a concise overview of the diagnostic criteria for VAP recently used in RCTs, as well as the definitions attributed to the term "ventilator-associated". Its findings will provide valuable insights to a forthcoming task force, which aims to establish a uniform definition and diagnostic criteria for VAP in clinical trials. The task force will be made up of representatives from prominent international societies with an interest in VAP, as well as patient partners with lived experience. The harmonisation of the diagnostic criteria for VAP in upcoming clinical research are vital for enhancing patient care, enabling accurate epidemiological studies, and guiding successful antimicrobial stewardship programs.
The protocol for this systematic review was registered in advance with the International Prospective Register of Systematic Reviews (PROSPERO 2019 CRD42019147411), encompassing a broad review focusing on pneumonia outcomes and diagnostic criteria in RCTs. Recognising the limitations of discussing all findings in one manuscript, we opted to produce several focused and comprehensive manuscripts, all employing the same fundamental methodology, as registered with PROSPERO. While a previous publication focused on outcomes reported in RCTs on pneumonia management [ 16 ], the current submission specifically addresses diagnostic criteria for VAP.
We included RCTs that were registered, planned, and/or completed that: (1) enrolled adults with VAP; and (2) assessed the safety, efficacy and/or effectiveness of pharmacological or non-pharmacological interventions for treating VAP.
We have excluded systematic reviews, meta-analyses, narrative reviews, post hoc analyses from RCTs, observational studies, case reports, editorials, conference proceedings, and studies that do not exclusively focus on pneumonia (such as trials including patients with pneumonia alongside other diseases). Additionally, studies on pneumonia subtypes other than VAP, such as pneumonia without specifying a subtype, community-acquired pneumonia (CAP), healthcare-associated pneumonia (HCAP), and HAP, have also been excluded. To maintain focus and relevance, studies on Coronavirus Disease 2019 (COVID-19) were excluded from this systematic review, as the viral aetiology and distinct clinical management protocols differ significantly from the nature and treatment strategies of VAP. RCT protocols were only included if the results have not been previously published in another article included in this systematic review. Due to resource constraints and the lack of multilingual expertise within the review team, this systematic review was restricted to English-language RCTs.
On 20 May 2024, we searched MEDLINE/PubMed, and the Cochrane Register of Controlled Trials (CENTRAL) for RCTs published between 1 January 2010 and 19 May 2024. We used electronic algorithms introducing a combination of controlled vocabulary and search terms as reported in the Appendix.
Two reviewers (FH, MF) independently screened titles and abstracts to identify eligible studies using Rayyan [ 17 ]. In case of disagreement, a third reviewer was consulted (AGM). After immediate exclusion of duplicates using EndNote X9, four reviewers (AGM, FH, JH, MF) independently checked for eligibility at full-text level. The results of the selection process are reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) [ 18 ].
We developed an extraction sheet as endorsed by the Cochrane Collaboration [ 19 ]. The extraction sheet was independently tested by three reviewers (AGM, FH, MF) on five randomly selected studies and adapted to ensure good inter-reviewer agreement. The extraction sheet contained the following elements: (1) study ID, name, reference and NCT number; (2) type of pneumonia: CAP, HCAP, HAP and/or VAP; (3) diagnostic criteria for pneumonia; (4) definition of setting; (5) study origin, design, populations, interventions, and outcomes.
Four reviewers (AGM, FH, JH, MF) extracted data from the eligible studies. Data were extracted sequentially from either a manuscript containing published results, a published protocol, or, upon obtaining a trial registration number from CENTRAL, from one of the designated trial registries, such as ClinicalTrials.gov, the Clinical Trials Registry India (CTRI), the Chinese Clinical Trial Registry (ChiCTR), the European Clinical Trials Database (EudraCT), the Iranian Registry of Clinical Trials (IRCT), the Japan Primary Registries Network (JPRN), and the Japanese University Hospital Medical Information Network Clinical Trials Registry (UMIN-CTR). Cross-checking of all extracted data was performed by a second reviewer (AGM, AK, MF, RR, TW). Disagreements regarding data collection were resolved by discussion between all reviewers.
The findings were consolidated through a combination of narrative and tabular formats. The presentation encompassed the quantitative representation of each diagnostic criterion in terms of numerical values and proportions. Additionally, we provide an analysis of the various combinations of diagnostic criteria employed in RCTs in a sunburst diagram and a tabular format, along with an examination of the definitions attributed to the term "ventilator-associated".
The main goal of this systematic review was to explore the diagnostic criteria used in clinical trials for diagnosing VAP. It covered trials with published protocols and/or results, as well as those only registered in a trial database. The varying levels and gaps in the information provided by the various sources made it difficult to conduct a reliable and meaningful risk of bias assessment for all included studies. However, for RCTs with published data, risk of bias was evaluated by four reviewers (AGM, JH, MF, RR) using the Risk of Bias in Randomized Trials 2 tool (RoB-2 tool), as endorsed by the Cochrane Collaboration [ 20 ].
A total of 7173 records were identified through the databases MEDLINE and CENTRAL, as illustrated in Fig. 1 . Following the removal of duplicate entries, a screening process involving the evaluation of titles and abstracts was conducted on 5652 records. Among these, 650 records were deemed potentially eligible for inclusion. Ultimately, our review included 119 studies that specifically focused on VAP (Table S1 in the Appendix, the full dataset is available online [ 21 ]).
PRISMA flowchart showing study selection
The total number of patients in the 119 identified studies was 21,289. Among these studies, 83 focused exclusively on VAP, while the remaining studies encompassed various subtypes of pneumonia in addition to VAP (see Table 1 ). The majority of these studies were registered, and their protocols were accessible either through publication in a journal article or on a clinical trial platform. Results were accessible in 56.3% of cases, while both results and the protocol were accessible in 36.9% of cases. In 40.3% of the included studies, data could only be obtained from a trial registry platform, with ClinicalTrials.gov being the primary platform in 36 out of 48 cases, and ChiCTR (n = 2), CTRI (n = 3), EudraCT (n = 3), IRCT (n = 2), JPRN (n = 1) and UMIN-CTR (n = 1) in the remaining cases.
Diagnostic criteria were provided in 51.2% and the term “ventilator-associated” was defined in 52.1% of the studies, respectively. Of the 20 studies (16.8%) that referred to previously published diagnostic criteria, 13 cited the Clinical Pulmonary Infection Score (CPIS) [ 22 ], while the remaining referred to national and international guidelines.
We evaluated the risk of bias in 67 studies with published results using the RoB-2 tool. The overall assessment showed that 25% of the studies were at high risk of bias, 30% were at low risk of bias, and the remaining 45% had some concerns about potential bias. These results indicate variability in the methodological quality of the studies included in the review. The overall risk of bias and the detailed results of our assessments for the 67 studies are displayed in the Appendix (Figures SF1-SF2).
Pulmonary infiltrates.
Of the 61 studies on VAP that provided diagnostic criteria, 59 (96.7%) included the radiological evidence of a new or progressive pulmonary infiltrate.
The most frequently included clinical signs and symptoms were fever (86.9%), hypothermia (49.1%), sputum (70.5%), and hypoxia (32.8%). Different cut-off values were employed to define fever and hypothermia, as indicated in Table 2 . The majority of studies, accounting for 45.2%, utilised a cut-off of > 38 degrees Celsius (°C) to define fever, while 13.2% of studies used a cut-off of ≥ 38°C. In the case of hypothermia, the most commonly employed cut-off value was < 35°C, which was utilised in 43.3% of studies that included hypothermia as a criterion. Only a minority of studies provided information on the site of temperature measurement. Oral measurement was the most frequently employed method, followed by axillary and core temperature measurements (further details are displayed in Table S2 in the Appendix).
Fifty-four studies (88.5%) incorporated white blood count abnormalities as part of their diagnostic criteria for VAP. Conversely, only one study included an elevation of procalcitonin (PCT) as a diagnostic factor, and none of the identified studies included C-reactive protein (CRP). The specific thresholds for leucocytosis and leucopoenia varied across studies, with leucocyte counts ranging from greater than 10,000/mm3 to greater than 12,000/mm3 for leucocytosis, and less than 3,500/mm3 to less than 4,500/mm3 for leucopoenia (Table 3 ).
All definitions of pneumonia were composite in nature and required the fulfilment of a minimum number of predetermined criteria for the diagnosis to be established. In 90.2% of the studies the presence of a new pulmonary infiltrate was a mandatory criterion. Two studies did not include an infiltrate as criterion, whereas the remaining studies (n = 4) included the presence of an infiltrate in their criteria, it was, however, not required for a diagnosis.
The most commonly employed set of diagnostic criteria (18/61, 29.5%) consisted of a pulmonary infiltrate along with two or more additional criteria. However, these additional criteria varied across studies (Fig. 2 ). A quarter (17/61) of the included studies that provided diagnostic criteria required the fulfilment of all individual criteria for diagnosis, including an infiltrate. An infiltrate and one or more additional criteria were used to establish a diagnosis of VAP in 14.8% of studies (9/61). A total of 38 different combinations of diagnostic criteria for VAP were used in the 61 identified studies. A full set of these criteria is displayed in Table S3 in the Appendix.
The different combinations of diagnostic criteria used in VAP RCTs. CXR radiological evidence of a new infiltrate; T temperature criterion; WBC white blood count criterion; dys/tach dyspnoea and/or tachypnoea; O2 hypoxia; auscultation auscultation abnormalities
We noted that 52.1% of included studies incorporated a specific definition of the term “ventilator-associated” (Table 4 ). A total of nine distinct definitions were identified across 62 RCTs. The definition most commonly used was “onset after > 48 h of mechanical ventilation” (82.3%). Other definitions employed varying time thresholds, ranging from 24 h to seven days. Additionally, certain studies introduced supplementary criteria to further delineate the concept of “ventilator-associated”, such as administration of antibiotics prior to mechanical ventilation, duration of hospitalisation, or the timing of extubation.
This systematic review provides a concise overview of the diagnostic criteria for VAP used in RCTs and the definitions attributed to the term “ventilator-associated”. A total of 119 studies on VAP, published or registered between 2010 and 2024, were included, spanning a total of 21,289 patients. The majority of studies focused exclusively on VAP, while some also included other subtypes of pneumonia alongside VAP. Diagnostic criteria were provided in only 51.2% of the studies, and the term “ventilator-associated” was defined in only 52.1% of the studies. The most commonly utilised definition for “ventilator-associated” was “onset after > 48 h of mechanical ventilation”, used by 82.3% of studies providing a definition.
In clinical practice, the diagnosis of VAP is often based on a combination of clinical signs, laboratory results, and imaging findings, yet these are not without their limitations [ 8 ]. Our systematic review revealed considerable heterogeneity among diagnostic criteria for VAP in recent RCTs. Various combinations of specific criteria were employed to define VAP, leading to significant variability. Moreover, commonly used criteria were defined in different ways, with variations observed in the thresholds set for fever/hypothermia, as well as leucocytosis/leucopoenia.
Several criteria that were used in the studies included in our review have been shown to be insufficient for confirming a diagnosis of VAP. One of the most important criteria, included in the majority of reviewed RCTs, a new or progressive pulmonary infiltrate, has previously been reported to be of limited diagnostic value due to a lack of specificity [ 14 ]. Additionally, criteria like fever/hypothermia and the measurement of biomarkers such as leukocytes, CRP, and PCT may not be effective in diagnosing or excluding VAP in various clinical settings [ 4 , 23 , 24 ]. Despite this, CRP is widely used and has demonstrated some clinical value in predicting VAP [ 25 ]. It is, therefore, surprising that none of the RCTs included in our review employed CRP as a diagnostic criterion.
Overall, the findings of our systematic review underline the diverse nature of VAP, with different diagnostic criteria increasing the risk of both over- and underdiagnosis of VAP [ 14 , 26 ]. There have been attempts to diagnose VAP more objectively, one of these being the development of the CPIS in 1991, a six-component score that 10.9% of studies included in our review referred to [ 27 ]. This score includes different cut-offs for body temperature, leucocyte counts, tracheal secretion appearances, oxygenation levels and radiographical changes to estimate the risk for VAP. However, the CPIS has been shown not to be superior to other diagnostic criteria, and, therefore, its application remains controversial [ 8 , 11 , 22 , 28 ]. Other commonly applied criteria, such as the surveillance-based criteria by the ECDC and CDC, did not seem to be accurate enough to detect true cases of VAP either [ 9 , 10 , 11 ]. Furthermore, there is limited agreement between the two surveillance-based criteria, which has previously resulted in different estimates of VAP events [ 29 ].
In lieu of definitive diagnostic scores or sets of diagnostic criteria to detect all true cases of VAP, the findings of our systematic review indicate the need for more homogeneous diagnostic criteria in future RCTs, to assure their comparability. Currently, international guidelines avoid providing clear diagnostic criteria for VAP [ 5 , 6 ]. Given the significance of establishing strong consensus definitions for high-risk conditions like VAP, it is essential to emphasise even further that a uniform definition is crucial not only for advancing therapeutic research but also, and perhaps more importantly, for refining diagnostic methods. Together with core outcome sets, these definitions can help to improve the likelihood of attaining robust and reliable findings in forthcoming systematic reviews and meta-analyses [ 16 , 30 ].
We used a comprehensive search strategy which included multiple databases and a wide range of search terms, ensuring broad identification of all potentially relevant trials. Additionally, the inclusion criteria were clearly defined, and the study selection process was conducted independently by multiple reviewers to minimise bias. The extraction sheet used for data collection was tested for inter-reviewer agreement and adapted accordingly. Another strength is the open availability of the complete dataset, maximising the transparency and reproducibility of our findings.
However, the following limitations need to be acknowledged. Firstly, the review only included RCTs conducted in English, which may have introduced language bias. This approach was adopted to ensure feasible and reliable data analysis within the scope of the resources available.
Additionally, the exclusion of studies focusing on pneumonia subtypes other than VAP may limit the generalisability of our findings. Furthermore, the lack of diagnostic criteria and definitions in a significant proportion of included studies suggests a potential reporting bias. This might be reinforced by the fact that 40.3% of data were received from trial registry platforms. Compared to final manuscript publications, reporting of eligibility criteria is often incomplete on registry platforms, therefore this must be highlighted as a limitation [ 31 ].
This systematic review provides an overview of diagnostic criteria for VAP used in RCTs and the definitions attributed to the term “ventilator-associated”. Our findings highlight the heterogeneity and lack of standardisation in commonly used diagnostic criteria, as well as the variability in definitions of "ventilator-associated" across clinical trials. We emphasise the need for a uniform definition of VAP to enable better comparability between studies and interventions. The results of this review will inform the work of an upcoming task force aimed at establishing such standardised criteria.
Raw data are accessible via the Open Science Framework (OSF) at osf.io/v3 × 42. This link is referenced in our manuscript (Ref. 21).
Torres A, Cilloniz C, Niederman MS, et al. Pneumonia. Nat Rev Dis Primers. 2021;7(1):28. https://doi.org/10.1038/s41572-021-00259-0 .
Article Google Scholar
Muscedere JG, Day A, Heyland DK. Mortality, attributable mortality, and clinical events as end points for clinical trials of ventilator-associated pneumonia and hospital-acquired pneumonia. Clin Infect Dis. 2010;51:S120–5. https://doi.org/10.1086/653060 .
Article PubMed Google Scholar
Melsen WG, Rovers MM, Groenwold RH, et al. Attributable mortality of ventilator-associated pneumonia: a meta-analysis of individual patient data from randomised prevention studies. Lancet Infect Dis. 2013;13:665–71. https://doi.org/10.1016/S1473-3099(13)70081-1 .
Alagna L, Palomba E, Chatenoud L, et al. Comparison of multiple definitions for ventilator-associated pneumonia in patients requiring mechanical ventilation for non-pulmonary conditions: preliminary data from PULMIVAP, an Italian multi-centre cohort study. J Hosp Infect. 2023;140:90–5. https://doi.org/10.1016/j.jhin.2023.07.023 .
Article CAS PubMed Google Scholar
Torres A, Niederman MS, Chastre J, et al. International ERS/ESICM/ESCMID/ALAT guidelines for the management of hospital-acquired pneumonia and ventilator-associated pneumonia. Eur Respir J. 2017;50:1700582. https://doi.org/10.1183/13993003.00582-2017 .
Kalil AC, Metersky ML, Klompas M, et al. Management of Adults With Hospital-acquired and Ventilator-associated Pneumonia: 2016 Clinical Practice Guidelines by the Infectious Diseases Society of America and the American Thoracic Society. Clin Infect Dis. 2016;63:e61–111. https://doi.org/10.1093/cid/ciw353 .
Article PubMed PubMed Central Google Scholar
Plachouras D, Lepape A, Suetens C. ECDC definitions and methods for the surveillance of healthcare-associated infections in intensive care units. Intensive Care Med. 2018;44:2216–8. https://doi.org/10.1007/s00134-018-5113-0 .
Nair GB, Niederman MS. Ventilator-associated pneumonia: present understanding and ongoing debates. Intensive Care Med. 2015;41:34–48. https://doi.org/10.1007/s00134-014-3564-5 .
Ramírez-Estrada S, Lagunes L, Peña-López Y, et al. Assessing predictive accuracy for outcomes of ventilator-associated events in an international cohort: the EUVAE study. Intensive Care Med. 2018;44:1212–20. https://doi.org/10.1007/s00134-018-5269-7 .
Waltrick R, Possamai DS, de Aguiar FP, et al. Comparison between a clinical diagnosis method and the surveillance technique of the Center for Disease Control and Prevention for identification of mechanical ventilator-associated pneumonia. Rev Bras Ter Intensiva. 2015;27:260. https://doi.org/10.5935/0103-507X.20150047 .
Rahimibashar F, Miller AC, Yaghoobi MH, Vahedian-Azimi A. A comparison of diagnostic algorithms and clinical parameters to diagnose ventilator-associated pneumonia: a prospective observational study. BMC Pulm Med. 2021;21:161. https://doi.org/10.1186/s12890-021-01527-1 .
Article CAS PubMed PubMed Central Google Scholar
Malmivaara A. Methodological considerations of the GRADE method. Ann Med. 2015;47:1–5.
Al-Omari B, McMeekin P, Allen AJ, et al. Systematic review of studies investigating ventilator associated pneumonia diagnostics in intensive care. BMC Pulm Med. 2021;21:196. https://doi.org/10.1186/s12890-021-01560-0 .
Fernando SM, Tran A, Cheng W, et al. Diagnosis of ventilator-associated pneumonia in critically ill adult patients—a systematic review and meta-analysis. Intensive Care Med. 2020;46:1170–9. https://doi.org/10.1007/s00134-020-06036-z .
Weiss E, Essaied W, Adrie C, et al. Treatment of severe hospital-acquired and ventilator-associated pneumonia: a systematic review of inclusion and judgment criteria used in randomized controlled trials. Crit Care. 2017. https://doi.org/10.1186/s13054-017-1755-5 .
Mathioudakis AG, Fally M, Hansel J, et al. Clinical trials of pneumonia management assess heterogeneous outcomes and measurement instruments. J Clin Epidemiol. 2023;164:88–95. https://doi.org/10.1016/j.jclinepi.2023.10.011 .
Ouzzani M, Hammady H, Fedorowicz Z, Elmagarmid A. Rayyan—a web and mobile app for systematic reviews. Syst Rev. 2016;5:210. https://doi.org/10.1186/s13643-016-0384-4 .
Moher D, Liberati A, Tetzlaff J, et al. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. Ann Intern Med. 2009;151:264–9.
Li T, Higgins J, Deeks J (editors) (2019) Chapter 5: Collecting data | Cochrane Training. In: Cochrane Handbook for Systematic Reviews of Interventions version 6.0. https://training.cochrane.org/handbook/current/chapter-05 . Accessed 21 Jul 2020
Sterne JAC, Savović J, Page MJ, et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ. 2019;366: l4898. https://doi.org/10.1136/bmj.l4898 .
ERS COS Pneumonia dataset. http://osf.io/v3x42
Zilberberg MD, Shorr AF. Ventilator-associated pneumonia: the clinical pulmonary infection score as a surrogate for diagnostics and outcome. Clin Infect Dis. 2010;51:S131–5. https://doi.org/10.1086/653062 .
Huang H-B, Peng J-M, Weng L, et al. Procalcitonin-guided antibiotic therapy in intensive care unit patients: a systematic review and meta-analysis. Ann Intensive Care. 2017;7:114. https://doi.org/10.1186/s13613-017-0338-6 .
Palazzo SJ, Simpson T, Schnapp L. Biomarkers for ventilator-associated pneumonia: review of the literature. Heart Lung. 2011;40:293–8. https://doi.org/10.1016/j.hrtlng.2010.11.003 .
Póvoa P, Martin-Loeches I, Ramirez P, et al. Biomarker kinetics in the prediction of VAP diagnosis: results from the BioVAP study. Ann Intensive Care. 2016;6:32. https://doi.org/10.1186/s13613-016-0134-8 .
Johnstone J, Muscedere J, Dionne J, et al. Definitions, rates and associated mortality of ICU-acquired pneumonia: a multicenter cohort study. J Crit Care. 2023;75:154284. https://doi.org/10.1016/j.jcrc.2023.154284 .
Pugin J, Auckenthaler R, Mili N, et al. Diagnosis of ventilator-associated pneumonia by bacteriologic analysis of bronchoscopic and nonbronchoscopic “blind” bronchoalveolar lavage fluid. Am Rev Respir Dis. 1991;143:1121–9. https://doi.org/10.1164/ajrccm/143.5_Pt_1.1121 .
Fàbregas N, Ewig S, Torres A, et al. Clinical diagnosis of ventilator associated pneumonia revisited: comparative validation using immediate post-mortem lung biopsies. Thorax. 1999;54:867–73.
Craven TH, Wojcik G, McCoubrey J, et al. Lack of concordance between ECDC and CDC systems for surveillance of ventilator associated pneumonia. Intensive Care Med. 2018;44:265–6. https://doi.org/10.1007/s00134-017-4993-8 .
Mathioudakis AG, Khaleva E, Fally M, et al. Core outcome sets, developed collaboratively with patients, can improve the relevance and comparability of clinical trials. Eur Respir J. 2023;61:2202107. https://doi.org/10.1183/13993003.02107-2022 .
Speich B, Gloy VL, Klatte K, et al. Reliability of trial information across registries for trials with multiple registrations. JAMA Netw Open. 2021;4:e2128898. https://doi.org/10.1001/jamanetworkopen.2021.28898 .
Download references
We would like to acknowledge and honour the contributions of Prof. Tobias Welte, who was a vital member of our research team and co-author of this manuscript. Prof. Welte passed away after the initial submission of this work but before its final acceptance. His insights and expertise were invaluable to the development of this research, and he remains deeply missed by the team. We dedicate this work to his memory.
Open access funding provided by Copenhagen University This study was partly supported by the NIHR Manchester Biomedical Research Centre (BRC, NIHR203308) as well as the Capital Region of Denmark (Region Hovedstaden). The funders had no role in study design, data collection or analysis, decision to publish, nor preparation of the manuscript. Dr Jan Hansel was supported by an NIHR Academic Clinical Fellowship in Intensive Care Medicine. Dr Rebecca Robey was supported by an NIHR Academic Clinical Fellowship in Respiratory Medicine. Dr Alexander G. Mathioudakis was supported by an NIHR Clinical Lectureship in Respiratory Medicine. All authors have completed a ICMJE uniform disclosure form detailing any conflicts of interest outside the submitted work that they may have. None of the authors have conflicts directly related to this work.
Authors and affiliations.
Department of Respiratory Medicine and Infectious Diseases, Copenhagen University Hospital – Bispebjerg and Frederiksberg, Copenhagen, Denmark
Markus Fally
North West Lung Centre, Wythenshawe Hospital, Manchester University NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
Faiuna Haseeb, Ahmed Kouta, Rebecca C. Robey, Timothy Felton & Alexander G. Mathioudakis
Division of Immunology, Immunity to Infection and Respiratory Medicine, School of Biological Sciences, The University of Manchester, Manchester, UK
Faiuna Haseeb, Ahmed Kouta, Jan Hansel, Rebecca C. Robey, Timothy Felton & Alexander G. Mathioudakis
North West School of Intensive Care Medicine, Health Education England North West, Manchester, UK
Acute Intensive Care Unit, Wythenshawe Hospital, Manchester University NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
Thomas Williams & Timothy Felton
Department of Respiratory Medicine and German Centre of Lung Research (DZL), Hannover Medical School, Hannover, Germany
Tobias Welte
You can also search for this author in PubMed Google Scholar
MF: conceptualisation, methodology, software, formal analysis, investigation, data curation, writing—original draft, visualisation, project administration. FH: conceptualisation, investigation, data curation, validation, writing—review and editing. AK, JH, RCR and TWI: data curation, validation, writing—review and editing. TWE: conceptualisation, investigation, methodology, resources, validation, writing—review and editing. TF: conceptualisation, investigation, methodology, resources, validation, writing—review and editing, supervision. AGM: conceptualisation, investigation, methodology, software, resources, validation, writing—review and editing, project administration, supervision, funding acquisition, project administration.
Correspondence to Markus Fally .
Ethics approval and consent to participate.
Not applicable, as this was a methodological systematic review without patient involvement/participation.
The authors declare no competing interests.
Publisher's note.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file1 (docx 807 kb), search strategy, medline/pubmed.
#1: pneumonia [mh]
#2: bronchopneumonia [mh]
#3: pleuropneumonia [mh]
#4: Healthcare-Associated Pneumonia [mh]
#5: Ventilator-Associated Pneumonia [mh]
#6: pneumonia [ti]
#7: pneumonia* [ti]
#8: bronchopneumonia [ti]
#9: pleuropneumonia [ti]
#10: #1 OR #2 OR #3 OR #4 OR #5 OR #6 OR #7 OR #8 OR #9
#11: randomized controlled trial [pt]
#12: controlled clinical trial [pt]
#13: randomized [tiab]
#14: placebo [tiab]
#15: clinical trials as topic [mesh: noexp]
#16: randomly [tiab]
#17: trial [ti]
#18: #11 OR #12 OR #13 OR #14 OR #15 OR #16 OR #17
#19: animals [mh] NOT humans [mh]
#20: children [mh] NOT adults [mh]
#21: COVID-19 [mh] or (covid[ti]) or (coronavirus [ti]) or (sars-cov-2[ti]) or (covid-19[ti]) or (pandemic[ti])
#22: #19 OR #20 OR #21
#23: #18 NOT #22
#24: #10 AND #23
#25: Publication date: 2010 –2024
#1: MeSH descriptor: [Pneumonia] explode all trees
#2: pneumonia*:ti
#3: #1 or #2
#4: MeSH descriptor: [COVID-19] explode all trees
#5: COVID-19:ti
#6: covid:ti
#7: coronavirus:ti
#8: sars-cov-2:ti
#9: #4 or #5 or #6 or #7 or #8
#10: #3 not #9
#11: Limit: Publication Date from 2010–2024
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Reprints and permissions
Cite this article.
Fally, M., Haseeb, F., Kouta, A. et al. Unravelling the complexity of ventilator-associated pneumonia: a systematic methodological literature review of diagnostic criteria and definitions used in clinical research. Crit Care 28 , 214 (2024). https://doi.org/10.1186/s13054-024-04991-3
Download citation
Received : 28 February 2024
Accepted : 15 June 2024
Published : 02 July 2024
DOI : https://doi.org/10.1186/s13054-024-04991-3
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
ISSN: 1364-8535
The Cowles Library website will be unavailable on Tuesday, Aug. 6th from 1:00 p.m.- 5:00 p.m. due to a scheduled migration to a new platform. You can reach our list of Research Databases at https://library.drake.edu/az/databases
In This Section:
Description.
A literature review, also called a review article or review of literature, surveys the existing research on a topic. The term "literature" in this context refers to published research or scholarship in a particular discipline, rather than "fiction" (like American Literature) or an individual work of literature. In general, literature reviews are most common in the sciences and social sciences.
Literature reviews may be written as standalone works, or as part of a scholarly article or research paper. In either case, the purpose of the review is to summarize and synthesize the key scholarly work that has already been done on the topic at hand. The literature review may also include some analysis and interpretation. A literature review is not a summary of every piece of scholarly research on a topic.
Literature reviews can be very helpful for newer researchers or those unfamiliar with a field by synthesizing the existing research on a given topic, providing the reader with connections and relationships among previous scholarship. Reviews can also be useful to veteran researchers by identifying potentials gaps in the research or steering future research questions toward unexplored areas. If a literature review is part of a scholarly article, it should include an explanation of how the current article adds to the conversation. (From: https://library.drake.edu/englit/criticism)
Research articles: "are empirical articles that describe one or several related studies on a specific, quantitative, testable research question....they are typically organized into four text sections: Introduction, Methods, Results, Discussion." Source: https://psych.uw.edu/storage/writing_center/litrev.pdf)
1. Identify and define the topic that you will be reviewing.
The topic, which is commonly a research question (or problem) of some kind, needs to be identified and defined as clearly as possible. You need to have an idea of what you will be reviewing in order to effectively search for references and to write a coherent summary of the research on it. At this stage it can be helpful to write down a description of the research question, area, or topic that you will be reviewing, as well as to identify any keywords that you will be using to search for relevant research.
2. Conduct a Literature Search
Use a range of keywords to search databases such as PsycINFO and any others that may contain relevant articles. You should focus on peer-reviewed, scholarly articles . In SuperSearch and most databases, you may find it helpful to select the Advanced Search mode and include "literature review" or "review of the literature" in addition to your other search terms. Published books may also be helpful, but keep in mind that peer-reviewed articles are widely considered to be the “gold standard” of scientific research. Read through titles and abstracts, select and obtain articles (that is, download, copy, or print them out), and save your searches as needed. Most of the databases you will need are linked to from the Cowles Library Psychology Research guide .
3. Read through the research that you have found and take notes.
Absorb as much information as you can. Read through the articles and books that you have found, and as you do, take notes. The notes should include anything that will be helpful in advancing your own thinking about the topic and in helping you write the literature review (such as key points, ideas, or even page numbers that index key information). Some references may turn out to be more helpful than others; you may notice patterns or striking contrasts between different sources; and some sources may refer to yet other sources of potential interest. This is often the most time-consuming part of the review process. However, it is also where you get to learn about the topic in great detail. You may want to use a Citation Manager to help you keep track of the citations you have found.
4. Organize your notes and thoughts; create an outline.
At this stage, you are close to writing the review itself. However, it is often helpful to first reflect on all the reading that you have done. What patterns stand out? Do the different sources converge on a consensus? Or not? What unresolved questions still remain? You should look over your notes (it may also be helpful to reorganize them), and as you do, to think about how you will present this research in your literature review. Are you going to summarize or critically evaluate? Are you going to use a chronological or other type of organizational structure? It can also be helpful to create an outline of how your literature review will be structured.
5. Write the literature review itself and edit and revise as needed.
The final stage involves writing. When writing, keep in mind that literature reviews are generally characterized by a summary style in which prior research is described sufficiently to explain critical findings but does not include a high level of detail (if readers want to learn about all the specific details of a study, then they can look up the references that you cite and read the original articles themselves). However, the degree of emphasis that is given to individual studies may vary (more or less detail may be warranted depending on how critical or unique a given study was). After you have written a first draft, you should read it carefully and then edit and revise as needed. You may need to repeat this process more than once. It may be helpful to have another person read through your draft(s) and provide feedback.
6. Incorporate the literature review into your research paper draft. (note: this step is only if you are using the literature review to write a research paper. Many times the literature review is an end unto itself).
After the literature review is complete, you should incorporate it into your research paper (if you are writing the review as one component of a larger paper). Depending on the stage at which your paper is at, this may involve merging your literature review into a partially complete Introduction section, writing the rest of the paper around the literature review, or other processes.
These steps were taken from: https://psychology.ucsd.edu/undergraduate-program/undergraduate-resources/academic-writing-resources/writing-research-papers/writing-lit-review.html#6.-Incorporate-the-literature-r
Use materials placed on reserve by your instructors
Borrow books directly from other Iowa academic library partners
Borrow material from libraries around the world
Ask the library to purchase books or other research materials
Drake history and Iowa political papers
Online access to unique items from the University Archives
Books, eBooks, and videos we highlight throughout the year
Handpicked by experts for your area of study
Schedule a one-on-one session with a librarian
A guide to the research process
Librarians who specialize in your area of study
Find, organize, and use your citations
Writing Center, Speaking Center, and other Tutoring
Tools & resources to help develop your study skills
What we teach and how we can help in your courses
Connect with a librarian
Put material on reserve for your courses
Help with course material adoptions and textbook alternatives
Explore, adopt, adapt, and create open educational resources
Resources to help you publish your research
Research & teaching, records management, about the archives.
What we do and why
Hours, directions, and guidelines for your Archives visit
Reach, follow, and support the Archives
Guides, tutorials, and library expertise to help you succeed as a scholar
Borrowing materials, finding a study space, locating services
Library services and support directed toward Drake Online and other off-campus students
Provide feedback or resolve a problem with the library
Resources and information literacy expertise to support your teaching
Cowles Library faculty and staff profiles
What's happening at Cowles Library
Student employment at Cowles Library
Where we are and when we're open
Services for Drake alumni and visitors
Navigate the library
Check availability and reserve a room
Technology in the library
Cowles Library mission and vision
Policies governing use of library resources, space, and services
Library support for diversity, equity, inclusion, and social justice
Trouble finding something? Try searching , or check out the Get Help page.
Warning: The NCBI web site requires JavaScript to function. more...
An official website of the United States government
The .gov means it's official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you're on a federal government site.
The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.
NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.
Brown KL, Pagel C, Ridout D, et al. Early morbidities following paediatric cardiac surgery: a mixed-methods study. Southampton (UK): NIHR Journals Library; 2020 Jul. (Health Services and Delivery Research, No. 8.30.)
Appendix 1 the prisma flow diagram for the literature review.
Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) flow diagram for the literature review. CINAHL, Cumulative Index to Nursing and Allied Health Literature.
Your browsing activity is empty.
Activity recording is turned off.
Turn recording back on
Connect with NLM
National Library of Medicine 8600 Rockville Pike Bethesda, MD 20894
Web Policies FOIA HHS Vulnerability Disclosure
Help Accessibility Careers
COMMENTS
Our Scoping Review provides a systematic overview on flow research between the years 2000 and 2016. A task force of flow research from the EFRN united their expertise in order to provide a sound scientific summary and discussion of flow research in these years and implications for future research.
What is a literature review? A literature review is a survey of scholarly sources on a specific topic. It provides an overview of current knowledge, allowing you to identify relevant theories, methods, and gaps in the existing research that you can later apply to your paper, thesis, or dissertation topic.
Our review (1) provides a framework to cluster flow research, (2) gives a systematic overview about existing studies and their findings, and (3) provides an overview about implications for future ...
Reviewing the literature requires the ability to juggle multiple tasks, from finding and evaluating relevant material to synthesising information from various sources, from critical thinking to paraphrasing, evaluating, and citation skills [7]. In this contribution, I share ten simple rules I learned working on about 25 literature reviews as a PhD and postdoctoral student. Ideas and insights ...
A Review of Flow Operationalizations in the Psychological Literature Within any field of science, the consensual operationalization of central constructs is a sine qua non for progress. When this is lacking, results across studies cannot be compared, and the potential for progress in the field is severely undermined.
When developing new therapy games, measuring flow experience can indicate whether the game motivates one to train. The purpose of this study was to identify and systematically review current literature on flow experience assessed in patients with stroke, traumatic brain injury, multiple sclerosis and Parkinson's disease.
The flow diagram depicts the flow of information through the different phases of a systematic review. It maps out the number of records identified, included and excluded, and the reasons for exclusions.
Purpose: This paper presents a systematic literature review on flow experience to identify the theoretical underpinnings, outcomes, antecedents, and empirical dimensions of the phenomenon used in ...
A literature review is an account of what has been published on a topic by accredited scholars and researchers. Occasionally you will be asked to write one as a separate assignment, but more often it is part of the introduction to an essay, research report, or thesis. In writing the literature review, your purpose is to convey to your reader what knowledge and ideas have been established on a ...
Writing a Literature Review. A literature review is a document or section of a document that collects key sources on a topic and discusses those sources in conversation with each other (also called synthesis ). The lit review is an important genre in many disciplines, not just literature (i.e., the study of works of literature such as novels ...
Narrative review: The purpose of this type of review is to describe the current state of the research on a specific topic/research and to offer a critical analysis of the literature reviewed. Studies are grouped by research/theoretical categories, and themes and trends, strengths and weakness, and gaps are identified. The review ends with a conclusion section which summarizes the findings ...
Assistant Professor Tanya Golash-Boza summarizes six steps to help you learn how to write a literature review.
Guide to writing, citing, and publishing resources for the health and social sciences.
The flow diagram originally proposed by QUOROM was also modified to show numbers of identified records, excluded articles, and included studies. After 11 revisions the group approved the checklist, flow diagram, and this explanatory paper. Fig 1 Flow of information through the different phases of a systematic review.
Our review (1) provides a framework to cluster flow research, (2) gives a systematic overview about existing studies and their findings, and (3) provides an overview about implications for future research. The provided framework consists of three levels of flow research.
Topics covered in the studies reviewed include the psychophysiological aspects of flow, transmission and group experience of flow, the association of flow with a range of positive outcomes, factors that contribute to flow experiences, and flow experiences of young children. Implications for future research were proffered in light of the findings.
Step 1. Select a Topic Task 1. Identify a Subject for Study Step 2. Develop the of Argumentation Tools Concept 1. Building the Case for a Literature Review Step 3. Search the Literature Task 1. Select the Literature to Review Task 2. Translate the Personal Interest or Concern Into a Research Query {{ Activity 1. Focus a Research Interest {{ Activity 2. Limit the Interest {{ Activity 3. Select ...
The methods and results of systematic reviews should be reported in sufficient detail to allow users to assess the trustworthiness and applicability of the review findings. The Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) ...
Decide what type of review you are doing. A proper systematic review looks at absolutely every resource to find all the information to answer a very narrow research question. What's In A Name?: the difference between a systematic review and a literature review and why it matters. (1)
Flow is a state of entire immersion in the present action, which can lead to effortless and joyful performances. The primary objective of this systematic literature review was directed toward ...
The flow diagram depicts the flow of information through the different phases of a systematic review. It maps out the number of records identified, included and excluded, and the reasons for exclusions. Different templates are available depending on the type of review (new or updated) and sources used to identify studies: PRISMA 2020 flow ...
Steps in a Systematic Review. Searching the Published Literature. Searching the Gray Literature. Methodology and Documentation. Managing the Process. Help. Scoping Reviews. Includes the number of results retrieved from each source. Duplicates are removed.
This session equips participants with all the fundamental skills that they need to research and begin writing their literature review. This includes building and executing effective search strategies to locate relevant materials for literature reviews, projects and other related research activities, key searching techniques, where to search, and how to keep up to date with the
Literature Review 5. Flow Range. Flow range is the difference between maximum and minimum flows over a specific time interval. Higher flow ... More research on the duration of the high-flow period of TMFs in below-average or average water years could reduce uncertainty in
Ventilator-associated pneumonia (VAP) is a prevalent and grave hospital-acquired infection that affects mechanically ventilated patients. Diverse diagnostic criteria can significantly affect VAP research by complicating the identification and management of the condition, which may also impact clinical management. We conducted this review to assess the diagnostic criteria and the definitions of ...
Description. A literature review, also called a review article or review of literature, surveys the existing research on a topic. The term "literature" in this context refers to published research or scholarship in a particular discipline, rather than "fiction" (like American Literature) or an individual work of literature.
FIGURE 20 Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) flow diagram for the literature review. CINAHL, Cumulative Index to Nursing and Allied Health Literature.
However, due to changes quick pills review in the political environment and conditions at surgery penile enlargement before after home flow male review and abroad, and influenced by the ultra left trend of thought, political studies have been neglected.
They fast flow male enhancement reviews believe that the function of the government is to protect individuals from unpredictable circumstances and fast flow enhancement pills reviews to control economic activities based on overall interests, even if the government is required fast reviews to own and use the means of production.
Well, when handling unavailable papers in a systematic literature review, it is essential to adopt a structured approach to ensure the integrity and comprehensiveness of the review.