• Skip to main content
  • Skip to primary sidebar
  • Skip to footer
  • QuestionPro

survey software icon

  • Solutions Industries Gaming Automotive Sports and events Education Government Travel & Hospitality Financial Services Healthcare Cannabis Technology Use Case AskWhy Communities Audience Contactless surveys Mobile LivePolls Member Experience GDPR Positive People Science 360 Feedback Surveys
  • Resources Blog eBooks Survey Templates Case Studies Training Help center

research methodology and data analysis

Home Market Research

Data Analysis in Research: Types & Methods

data-analysis-in-research

What is data analysis in research?

Definition of research in data analysis: According to LeCompte and Schensul, research data analysis is a process used by researchers to reduce data to a story and interpret it to derive insights. The data analysis process helps reduce a large chunk of data into smaller fragments, which makes sense. 

Three essential things occur during the data analysis process — the first is data organization . Summarization and categorization together contribute to becoming the second known method used for data reduction. It helps find patterns and themes in the data for easy identification and linking. The third and last way is data analysis – researchers do it in both top-down and bottom-up fashion.

On the other hand, Marshall and Rossman describe data analysis as a messy, ambiguous, and time-consuming but creative and fascinating process through which a mass of collected data is brought to order, structure and meaning.

We can say that “the data analysis and data interpretation is a process representing the application of deductive and inductive logic to the research and data analysis.”

Why analyze data in research?

Researchers rely heavily on data as they have a story to tell or research problems to solve. It starts with a question, and data is nothing but an answer to that question. But, what if there is no question to ask? Well! It is possible to explore data even without a problem – we call it ‘Data Mining’, which often reveals some interesting patterns within the data that are worth exploring.

Irrelevant to the type of data researchers explore, their mission and audiences’ vision guide them to find the patterns to shape the story they want to tell. One of the essential things expected from researchers while analyzing data is to stay open and remain unbiased toward unexpected patterns, expressions, and results. Remember, sometimes, data analysis tells the most unforeseen yet exciting stories that were not expected when initiating data analysis. Therefore, rely on the data you have at hand and enjoy the journey of exploratory research. 

Create a Free Account

Types of data in research

Every kind of data has a rare quality of describing things after assigning a specific value to it. For analysis, you need to organize these values, processed and presented in a given context, to make it useful. Data can be in different forms; here are the primary data types.

  • Qualitative data: When the data presented has words and descriptions, then we call it qualitative data . Although you can observe this data, it is subjective and harder to analyze data in research, especially for comparison. Example: Quality data represents everything describing taste, experience, texture, or an opinion that is considered quality data. This type of data is usually collected through focus groups, personal qualitative interviews , qualitative observation or using open-ended questions in surveys.
  • Quantitative data: Any data expressed in numbers of numerical figures are called quantitative data . This type of data can be distinguished into categories, grouped, measured, calculated, or ranked. Example: questions such as age, rank, cost, length, weight, scores, etc. everything comes under this type of data. You can present such data in graphical format, charts, or apply statistical analysis methods to this data. The (Outcomes Measurement Systems) OMS questionnaires in surveys are a significant source of collecting numeric data.
  • Categorical data: It is data presented in groups. However, an item included in the categorical data cannot belong to more than one group. Example: A person responding to a survey by telling his living style, marital status, smoking habit, or drinking habit comes under the categorical data. A chi-square test is a standard method used to analyze this data.

Learn More : Examples of Qualitative Data in Education

Data analysis in qualitative research

Data analysis and qualitative data research work a little differently from the numerical data as the quality data is made up of words, descriptions, images, objects, and sometimes symbols. Getting insight from such complicated information is a complicated process. Hence it is typically used for exploratory research and data analysis .

Finding patterns in the qualitative data

Although there are several ways to find patterns in the textual information, a word-based method is the most relied and widely used global technique for research and data analysis. Notably, the data analysis process in qualitative research is manual. Here the researchers usually read the available data and find repetitive or commonly used words. 

For example, while studying data collected from African countries to understand the most pressing issues people face, researchers might find  “food”  and  “hunger” are the most commonly used words and will highlight them for further analysis.

The keyword context is another widely used word-based technique. In this method, the researcher tries to understand the concept by analyzing the context in which the participants use a particular keyword.  

For example , researchers conducting research and data analysis for studying the concept of ‘diabetes’ amongst respondents might analyze the context of when and how the respondent has used or referred to the word ‘diabetes.’

The scrutiny-based technique is also one of the highly recommended  text analysis  methods used to identify a quality data pattern. Compare and contrast is the widely used method under this technique to differentiate how a specific text is similar or different from each other. 

For example: To find out the “importance of resident doctor in a company,” the collected data is divided into people who think it is necessary to hire a resident doctor and those who think it is unnecessary. Compare and contrast is the best method that can be used to analyze the polls having single-answer questions types .

Metaphors can be used to reduce the data pile and find patterns in it so that it becomes easier to connect data with theory.

Variable Partitioning is another technique used to split variables so that researchers can find more coherent descriptions and explanations from the enormous data.

Methods used for data analysis in qualitative research

There are several techniques to analyze the data in qualitative research, but here are some commonly used methods,

  • Content Analysis:  It is widely accepted and the most frequently employed technique for data analysis in research methodology. It can be used to analyze the documented information from text, images, and sometimes from the physical items. It depends on the research questions to predict when and where to use this method.
  • Narrative Analysis: This method is used to analyze content gathered from various sources such as personal interviews, field observation, and  surveys . The majority of times, stories, or opinions shared by people are focused on finding answers to the research questions.
  • Discourse Analysis:  Similar to narrative analysis, discourse analysis is used to analyze the interactions with people. Nevertheless, this particular method considers the social context under which or within which the communication between the researcher and respondent takes place. In addition to that, discourse analysis also focuses on the lifestyle and day-to-day environment while deriving any conclusion.
  • Grounded Theory:  When you want to explain why a particular phenomenon happened, then using grounded theory for analyzing quality data is the best resort. Grounded theory is applied to study data about the host of similar cases occurring in different settings. When researchers are using this method, they might alter explanations or produce new ones until they arrive at some conclusion.
Choosing the right software can be tough. Whether you’re a researcher, business leader, or marketer, check out the top 10  qualitative data analysis software  for analyzing qualitative data.

Data analysis in quantitative research

Preparing data for analysis.

The first stage in research and data analysis is to make it for the analysis so that the nominal data can be converted into something meaningful. Data preparation consists of the below phases.

Phase I: Data Validation

Data validation is done to understand if the collected data sample is per the pre-set standards, or it is a biased data sample again divided into four different stages

  • Fraud: To ensure an actual human being records each response to the survey or the questionnaire
  • Screening: To make sure each participant or respondent is selected or chosen in compliance with the research criteria
  • Procedure: To ensure ethical standards were maintained while collecting the data sample
  • Completeness: To ensure that the respondent has answered all the questions in an online survey. Else, the interviewer had asked all the questions devised in the questionnaire.

Phase II: Data Editing

More often, an extensive research data sample comes loaded with errors. Respondents sometimes fill in some fields incorrectly or sometimes skip them accidentally. Data editing is a process wherein the researchers have to confirm that the provided data is free of such errors. They need to conduct necessary checks and outlier checks to edit the raw edit and make it ready for analysis.

Phase III: Data Coding

Out of all three, this is the most critical phase of data preparation associated with grouping and assigning values to the survey responses . If a survey is completed with a 1000 sample size, the researcher will create an age bracket to distinguish the respondents based on their age. Thus, it becomes easier to analyze small data buckets rather than deal with the massive data pile.

LEARN ABOUT: Steps in Qualitative Research

Methods used for data analysis in quantitative research

After the data is prepared for analysis, researchers are open to using different research and data analysis methods to derive meaningful insights. For sure, statistical analysis plans are the most favored to analyze numerical data. In statistical analysis, distinguishing between categorical data and numerical data is essential, as categorical data involves distinct categories or labels, while numerical data consists of measurable quantities. The method is again classified into two groups. First, ‘Descriptive Statistics’ used to describe data. Second, ‘Inferential statistics’ that helps in comparing the data .

Descriptive statistics

This method is used to describe the basic features of versatile types of data in research. It presents the data in such a meaningful way that pattern in the data starts making sense. Nevertheless, the descriptive analysis does not go beyond making conclusions. The conclusions are again based on the hypothesis researchers have formulated so far. Here are a few major types of descriptive analysis methods.

Measures of Frequency

  • Count, Percent, Frequency
  • It is used to denote home often a particular event occurs.
  • Researchers use it when they want to showcase how often a response is given.

Measures of Central Tendency

  • Mean, Median, Mode
  • The method is widely used to demonstrate distribution by various points.
  • Researchers use this method when they want to showcase the most commonly or averagely indicated response.

Measures of Dispersion or Variation

  • Range, Variance, Standard deviation
  • Here the field equals high/low points.
  • Variance standard deviation = difference between the observed score and mean
  • It is used to identify the spread of scores by stating intervals.
  • Researchers use this method to showcase data spread out. It helps them identify the depth until which the data is spread out that it directly affects the mean.

Measures of Position

  • Percentile ranks, Quartile ranks
  • It relies on standardized scores helping researchers to identify the relationship between different scores.
  • It is often used when researchers want to compare scores with the average count.

For quantitative research use of descriptive analysis often give absolute numbers, but the in-depth analysis is never sufficient to demonstrate the rationale behind those numbers. Nevertheless, it is necessary to think of the best method for research and data analysis suiting your survey questionnaire and what story researchers want to tell. For example, the mean is the best way to demonstrate the students’ average scores in schools. It is better to rely on the descriptive statistics when the researchers intend to keep the research or outcome limited to the provided  sample  without generalizing it. For example, when you want to compare average voting done in two different cities, differential statistics are enough.

Descriptive analysis is also called a ‘univariate analysis’ since it is commonly used to analyze a single variable.

Inferential statistics

Inferential statistics are used to make predictions about a larger population after research and data analysis of the representing population’s collected sample. For example, you can ask some odd 100 audiences at a movie theater if they like the movie they are watching. Researchers then use inferential statistics on the collected  sample  to reason that about 80-90% of people like the movie. 

Here are two significant areas of inferential statistics.

  • Estimating parameters: It takes statistics from the sample research data and demonstrates something about the population parameter.
  • Hypothesis test: I t’s about sampling research data to answer the survey research questions. For example, researchers might be interested to understand if the new shade of lipstick recently launched is good or not, or if the multivitamin capsules help children to perform better at games.

These are sophisticated analysis methods used to showcase the relationship between different variables instead of describing a single variable. It is often used when researchers want something beyond absolute numbers to understand the relationship between variables.

Here are some of the commonly used methods for data analysis in research.

  • Correlation: When researchers are not conducting experimental research or quasi-experimental research wherein the researchers are interested to understand the relationship between two or more variables, they opt for correlational research methods.
  • Cross-tabulation: Also called contingency tables,  cross-tabulation  is used to analyze the relationship between multiple variables.  Suppose provided data has age and gender categories presented in rows and columns. A two-dimensional cross-tabulation helps for seamless data analysis and research by showing the number of males and females in each age category.
  • Regression analysis: For understanding the strong relationship between two variables, researchers do not look beyond the primary and commonly used regression analysis method, which is also a type of predictive analysis used. In this method, you have an essential factor called the dependent variable. You also have multiple independent variables in regression analysis. You undertake efforts to find out the impact of independent variables on the dependent variable. The values of both independent and dependent variables are assumed as being ascertained in an error-free random manner.
  • Frequency tables: The statistical procedure is used for testing the degree to which two or more vary or differ in an experiment. A considerable degree of variation means research findings were significant. In many contexts, ANOVA testing and variance analysis are similar.
  • Analysis of variance: The statistical procedure is used for testing the degree to which two or more vary or differ in an experiment. A considerable degree of variation means research findings were significant. In many contexts, ANOVA testing and variance analysis are similar.

Considerations in research data analysis

  • Researchers must have the necessary research skills to analyze and manipulation the data , Getting trained to demonstrate a high standard of research practice. Ideally, researchers must possess more than a basic understanding of the rationale of selecting one statistical method over the other to obtain better data insights.
  • Usually, research and data analytics projects differ by scientific discipline; therefore, getting statistical advice at the beginning of analysis helps design a survey questionnaire, select data collection methods , and choose samples.

LEARN ABOUT: Best Data Collection Tools

  • The primary aim of data research and analysis is to derive ultimate insights that are unbiased. Any mistake in or keeping a biased mind to collect data, selecting an analysis method, or choosing  audience  sample il to draw a biased inference.
  • Irrelevant to the sophistication used in research data and analysis is enough to rectify the poorly defined objective outcome measurements. It does not matter if the design is at fault or intentions are not clear, but lack of clarity might mislead readers, so avoid the practice.
  • The motive behind data analysis in research is to present accurate and reliable data. As far as possible, avoid statistical errors, and find a way to deal with everyday challenges like outliers, missing data, data altering, data mining , or developing graphical representation.

LEARN MORE: Descriptive Research vs Correlational Research The sheer amount of data generated daily is frightening. Especially when data analysis has taken center stage. in 2018. In last year, the total data supply amounted to 2.8 trillion gigabytes. Hence, it is clear that the enterprises willing to survive in the hypercompetitive world must possess an excellent capability to analyze complex research data, derive actionable insights, and adapt to the new market needs.

LEARN ABOUT: Average Order Value

QuestionPro is an online survey platform that empowers organizations in data analysis and research and provides them a medium to collect data by creating appealing surveys.

MORE LIKE THIS

research methodology and data analysis

QuestionPro: Leading the Charge in Customer Journey Management and Voice of the Customer Platforms

Sep 17, 2024

Driver analysis

What is Driver Analysis? Importance and Best Practices

research methodology and data analysis

Was The Experience Memorable? (Part II) — Tuesday CX Thoughts

data discovery

Data Discovery: What it is, Importance, Process + Use Cases

Sep 16, 2024

Other categories

  • Academic Research
  • Artificial Intelligence
  • Assessments
  • Brand Awareness
  • Case Studies
  • Communities
  • Consumer Insights
  • Customer effort score
  • Customer Engagement
  • Customer Experience
  • Customer Loyalty
  • Customer Research
  • Customer Satisfaction
  • Employee Benefits
  • Employee Engagement
  • Employee Retention
  • Friday Five
  • General Data Protection Regulation
  • Insights Hub
  • Life@QuestionPro
  • Market Research
  • Mobile diaries
  • Mobile Surveys
  • New Features
  • Online Communities
  • Question Types
  • Questionnaire
  • QuestionPro Products
  • Release Notes
  • Research Tools and Apps
  • Revenue at Risk
  • Survey Templates
  • Training Tips
  • Tuesday CX Thoughts (TCXT)
  • Uncategorized
  • What’s Coming Up
  • Workforce Intelligence

PW Skills | Blog

Data Analysis Techniques in Research – Methods, Tools & Examples

' src=

Varun Saharawat is a seasoned professional in the fields of SEO and content writing. With a profound knowledge of the intricate aspects of these disciplines, Varun has established himself as a valuable asset in the world of digital marketing and online content creation.

Data analysis techniques in research are essential because they allow researchers to derive meaningful insights from data sets to support their hypotheses or research objectives.

data analysis techniques in research

Data Analysis Techniques in Research : While various groups, institutions, and professionals may have diverse approaches to data analysis, a universal definition captures its essence. Data analysis involves refining, transforming, and interpreting raw data to derive actionable insights that guide informed decision-making for businesses.

A straightforward illustration of data analysis emerges when we make everyday decisions, basing our choices on past experiences or predictions of potential outcomes.

If you want to learn more about this topic and acquire valuable skills that will set you apart in today’s data-driven world, we highly recommend enrolling in the Data Analytics Course by Physics Wallah . And as a special offer for our readers, use the coupon code “READER” to get a discount on this course.

Table of Contents

What is Data Analysis?

Data analysis is the systematic process of inspecting, cleaning, transforming, and interpreting data with the objective of discovering valuable insights and drawing meaningful conclusions. This process involves several steps:

  • Inspecting : Initial examination of data to understand its structure, quality, and completeness.
  • Cleaning : Removing errors, inconsistencies, or irrelevant information to ensure accurate analysis.
  • Transforming : Converting data into a format suitable for analysis, such as normalization or aggregation.
  • Interpreting : Analyzing the transformed data to identify patterns, trends, and relationships.

Types of Data Analysis Techniques in Research

Data analysis techniques in research are categorized into qualitative and quantitative methods, each with its specific approaches and tools. These techniques are instrumental in extracting meaningful insights, patterns, and relationships from data to support informed decision-making, validate hypotheses, and derive actionable recommendations. Below is an in-depth exploration of the various types of data analysis techniques commonly employed in research:

1) Qualitative Analysis:

Definition: Qualitative analysis focuses on understanding non-numerical data, such as opinions, concepts, or experiences, to derive insights into human behavior, attitudes, and perceptions.

  • Content Analysis: Examines textual data, such as interview transcripts, articles, or open-ended survey responses, to identify themes, patterns, or trends.
  • Narrative Analysis: Analyzes personal stories or narratives to understand individuals’ experiences, emotions, or perspectives.
  • Ethnographic Studies: Involves observing and analyzing cultural practices, behaviors, and norms within specific communities or settings.

2) Quantitative Analysis:

Quantitative analysis emphasizes numerical data and employs statistical methods to explore relationships, patterns, and trends. It encompasses several approaches:

Descriptive Analysis:

  • Frequency Distribution: Represents the number of occurrences of distinct values within a dataset.
  • Central Tendency: Measures such as mean, median, and mode provide insights into the central values of a dataset.
  • Dispersion: Techniques like variance and standard deviation indicate the spread or variability of data.

Diagnostic Analysis:

  • Regression Analysis: Assesses the relationship between dependent and independent variables, enabling prediction or understanding causality.
  • ANOVA (Analysis of Variance): Examines differences between groups to identify significant variations or effects.

Predictive Analysis:

  • Time Series Forecasting: Uses historical data points to predict future trends or outcomes.
  • Machine Learning Algorithms: Techniques like decision trees, random forests, and neural networks predict outcomes based on patterns in data.

Prescriptive Analysis:

  • Optimization Models: Utilizes linear programming, integer programming, or other optimization techniques to identify the best solutions or strategies.
  • Simulation: Mimics real-world scenarios to evaluate various strategies or decisions and determine optimal outcomes.

Specific Techniques:

  • Monte Carlo Simulation: Models probabilistic outcomes to assess risk and uncertainty.
  • Factor Analysis: Reduces the dimensionality of data by identifying underlying factors or components.
  • Cohort Analysis: Studies specific groups or cohorts over time to understand trends, behaviors, or patterns within these groups.
  • Cluster Analysis: Classifies objects or individuals into homogeneous groups or clusters based on similarities or attributes.
  • Sentiment Analysis: Uses natural language processing and machine learning techniques to determine sentiment, emotions, or opinions from textual data.

Also Read: AI and Predictive Analytics: Examples, Tools, Uses, Ai Vs Predictive Analytics

Data Analysis Techniques in Research Examples

To provide a clearer understanding of how data analysis techniques are applied in research, let’s consider a hypothetical research study focused on evaluating the impact of online learning platforms on students’ academic performance.

Research Objective:

Determine if students using online learning platforms achieve higher academic performance compared to those relying solely on traditional classroom instruction.

Data Collection:

  • Quantitative Data: Academic scores (grades) of students using online platforms and those using traditional classroom methods.
  • Qualitative Data: Feedback from students regarding their learning experiences, challenges faced, and preferences.

Data Analysis Techniques Applied:

1) Descriptive Analysis:

  • Calculate the mean, median, and mode of academic scores for both groups.
  • Create frequency distributions to represent the distribution of grades in each group.

2) Diagnostic Analysis:

  • Conduct an Analysis of Variance (ANOVA) to determine if there’s a statistically significant difference in academic scores between the two groups.
  • Perform Regression Analysis to assess the relationship between the time spent on online platforms and academic performance.

3) Predictive Analysis:

  • Utilize Time Series Forecasting to predict future academic performance trends based on historical data.
  • Implement Machine Learning algorithms to develop a predictive model that identifies factors contributing to academic success on online platforms.

4) Prescriptive Analysis:

  • Apply Optimization Models to identify the optimal combination of online learning resources (e.g., video lectures, interactive quizzes) that maximize academic performance.
  • Use Simulation Techniques to evaluate different scenarios, such as varying student engagement levels with online resources, to determine the most effective strategies for improving learning outcomes.

5) Specific Techniques:

  • Conduct Factor Analysis on qualitative feedback to identify common themes or factors influencing students’ perceptions and experiences with online learning.
  • Perform Cluster Analysis to segment students based on their engagement levels, preferences, or academic outcomes, enabling targeted interventions or personalized learning strategies.
  • Apply Sentiment Analysis on textual feedback to categorize students’ sentiments as positive, negative, or neutral regarding online learning experiences.

By applying a combination of qualitative and quantitative data analysis techniques, this research example aims to provide comprehensive insights into the effectiveness of online learning platforms.

Also Read: Learning Path to Become a Data Analyst in 2024

Data Analysis Techniques in Quantitative Research

Quantitative research involves collecting numerical data to examine relationships, test hypotheses, and make predictions. Various data analysis techniques are employed to interpret and draw conclusions from quantitative data. Here are some key data analysis techniques commonly used in quantitative research:

1) Descriptive Statistics:

  • Description: Descriptive statistics are used to summarize and describe the main aspects of a dataset, such as central tendency (mean, median, mode), variability (range, variance, standard deviation), and distribution (skewness, kurtosis).
  • Applications: Summarizing data, identifying patterns, and providing initial insights into the dataset.

2) Inferential Statistics:

  • Description: Inferential statistics involve making predictions or inferences about a population based on a sample of data. This technique includes hypothesis testing, confidence intervals, t-tests, chi-square tests, analysis of variance (ANOVA), regression analysis, and correlation analysis.
  • Applications: Testing hypotheses, making predictions, and generalizing findings from a sample to a larger population.

3) Regression Analysis:

  • Description: Regression analysis is a statistical technique used to model and examine the relationship between a dependent variable and one or more independent variables. Linear regression, multiple regression, logistic regression, and nonlinear regression are common types of regression analysis .
  • Applications: Predicting outcomes, identifying relationships between variables, and understanding the impact of independent variables on the dependent variable.

4) Correlation Analysis:

  • Description: Correlation analysis is used to measure and assess the strength and direction of the relationship between two or more variables. The Pearson correlation coefficient, Spearman rank correlation coefficient, and Kendall’s tau are commonly used measures of correlation.
  • Applications: Identifying associations between variables and assessing the degree and nature of the relationship.

5) Factor Analysis:

  • Description: Factor analysis is a multivariate statistical technique used to identify and analyze underlying relationships or factors among a set of observed variables. It helps in reducing the dimensionality of data and identifying latent variables or constructs.
  • Applications: Identifying underlying factors or constructs, simplifying data structures, and understanding the underlying relationships among variables.

6) Time Series Analysis:

  • Description: Time series analysis involves analyzing data collected or recorded over a specific period at regular intervals to identify patterns, trends, and seasonality. Techniques such as moving averages, exponential smoothing, autoregressive integrated moving average (ARIMA), and Fourier analysis are used.
  • Applications: Forecasting future trends, analyzing seasonal patterns, and understanding time-dependent relationships in data.

7) ANOVA (Analysis of Variance):

  • Description: Analysis of variance (ANOVA) is a statistical technique used to analyze and compare the means of two or more groups or treatments to determine if they are statistically different from each other. One-way ANOVA, two-way ANOVA, and MANOVA (Multivariate Analysis of Variance) are common types of ANOVA.
  • Applications: Comparing group means, testing hypotheses, and determining the effects of categorical independent variables on a continuous dependent variable.

8) Chi-Square Tests:

  • Description: Chi-square tests are non-parametric statistical tests used to assess the association between categorical variables in a contingency table. The Chi-square test of independence, goodness-of-fit test, and test of homogeneity are common chi-square tests.
  • Applications: Testing relationships between categorical variables, assessing goodness-of-fit, and evaluating independence.

These quantitative data analysis techniques provide researchers with valuable tools and methods to analyze, interpret, and derive meaningful insights from numerical data. The selection of a specific technique often depends on the research objectives, the nature of the data, and the underlying assumptions of the statistical methods being used.

Also Read: Analysis vs. Analytics: How Are They Different?

Data Analysis Methods

Data analysis methods refer to the techniques and procedures used to analyze, interpret, and draw conclusions from data. These methods are essential for transforming raw data into meaningful insights, facilitating decision-making processes, and driving strategies across various fields. Here are some common data analysis methods:

  • Description: Descriptive statistics summarize and organize data to provide a clear and concise overview of the dataset. Measures such as mean, median, mode, range, variance, and standard deviation are commonly used.
  • Description: Inferential statistics involve making predictions or inferences about a population based on a sample of data. Techniques such as hypothesis testing, confidence intervals, and regression analysis are used.

3) Exploratory Data Analysis (EDA):

  • Description: EDA techniques involve visually exploring and analyzing data to discover patterns, relationships, anomalies, and insights. Methods such as scatter plots, histograms, box plots, and correlation matrices are utilized.
  • Applications: Identifying trends, patterns, outliers, and relationships within the dataset.

4) Predictive Analytics:

  • Description: Predictive analytics use statistical algorithms and machine learning techniques to analyze historical data and make predictions about future events or outcomes. Techniques such as regression analysis, time series forecasting, and machine learning algorithms (e.g., decision trees, random forests, neural networks) are employed.
  • Applications: Forecasting future trends, predicting outcomes, and identifying potential risks or opportunities.

5) Prescriptive Analytics:

  • Description: Prescriptive analytics involve analyzing data to recommend actions or strategies that optimize specific objectives or outcomes. Optimization techniques, simulation models, and decision-making algorithms are utilized.
  • Applications: Recommending optimal strategies, decision-making support, and resource allocation.

6) Qualitative Data Analysis:

  • Description: Qualitative data analysis involves analyzing non-numerical data, such as text, images, videos, or audio, to identify themes, patterns, and insights. Methods such as content analysis, thematic analysis, and narrative analysis are used.
  • Applications: Understanding human behavior, attitudes, perceptions, and experiences.

7) Big Data Analytics:

  • Description: Big data analytics methods are designed to analyze large volumes of structured and unstructured data to extract valuable insights. Technologies such as Hadoop, Spark, and NoSQL databases are used to process and analyze big data.
  • Applications: Analyzing large datasets, identifying trends, patterns, and insights from big data sources.

8) Text Analytics:

  • Description: Text analytics methods involve analyzing textual data, such as customer reviews, social media posts, emails, and documents, to extract meaningful information and insights. Techniques such as sentiment analysis, text mining, and natural language processing (NLP) are used.
  • Applications: Analyzing customer feedback, monitoring brand reputation, and extracting insights from textual data sources.

These data analysis methods are instrumental in transforming data into actionable insights, informing decision-making processes, and driving organizational success across various sectors, including business, healthcare, finance, marketing, and research. The selection of a specific method often depends on the nature of the data, the research objectives, and the analytical requirements of the project or organization.

Also Read: Quantitative Data Analysis: Types, Analysis & Examples

Data Analysis Tools

Data analysis tools are essential instruments that facilitate the process of examining, cleaning, transforming, and modeling data to uncover useful information, make informed decisions, and drive strategies. Here are some prominent data analysis tools widely used across various industries:

1) Microsoft Excel:

  • Description: A spreadsheet software that offers basic to advanced data analysis features, including pivot tables, data visualization tools, and statistical functions.
  • Applications: Data cleaning, basic statistical analysis, visualization, and reporting.

2) R Programming Language :

  • Description: An open-source programming language specifically designed for statistical computing and data visualization.
  • Applications: Advanced statistical analysis, data manipulation, visualization, and machine learning.

3) Python (with Libraries like Pandas, NumPy, Matplotlib, and Seaborn):

  • Description: A versatile programming language with libraries that support data manipulation, analysis, and visualization.
  • Applications: Data cleaning, statistical analysis, machine learning, and data visualization.

4) SPSS (Statistical Package for the Social Sciences):

  • Description: A comprehensive statistical software suite used for data analysis, data mining, and predictive analytics.
  • Applications: Descriptive statistics, hypothesis testing, regression analysis, and advanced analytics.

5) SAS (Statistical Analysis System):

  • Description: A software suite used for advanced analytics, multivariate analysis, and predictive modeling.
  • Applications: Data management, statistical analysis, predictive modeling, and business intelligence.

6) Tableau:

  • Description: A data visualization tool that allows users to create interactive and shareable dashboards and reports.
  • Applications: Data visualization , business intelligence , and interactive dashboard creation.

7) Power BI:

  • Description: A business analytics tool developed by Microsoft that provides interactive visualizations and business intelligence capabilities.
  • Applications: Data visualization, business intelligence, reporting, and dashboard creation.

8) SQL (Structured Query Language) Databases (e.g., MySQL, PostgreSQL, Microsoft SQL Server):

  • Description: Database management systems that support data storage, retrieval, and manipulation using SQL queries.
  • Applications: Data retrieval, data cleaning, data transformation, and database management.

9) Apache Spark:

  • Description: A fast and general-purpose distributed computing system designed for big data processing and analytics.
  • Applications: Big data processing, machine learning, data streaming, and real-time analytics.

10) IBM SPSS Modeler:

  • Description: A data mining software application used for building predictive models and conducting advanced analytics.
  • Applications: Predictive modeling, data mining, statistical analysis, and decision optimization.

These tools serve various purposes and cater to different data analysis needs, from basic statistical analysis and data visualization to advanced analytics, machine learning, and big data processing. The choice of a specific tool often depends on the nature of the data, the complexity of the analysis, and the specific requirements of the project or organization.

Also Read: How to Analyze Survey Data: Methods & Examples

Importance of Data Analysis in Research

The importance of data analysis in research cannot be overstated; it serves as the backbone of any scientific investigation or study. Here are several key reasons why data analysis is crucial in the research process:

  • Data analysis helps ensure that the results obtained are valid and reliable. By systematically examining the data, researchers can identify any inconsistencies or anomalies that may affect the credibility of the findings.
  • Effective data analysis provides researchers with the necessary information to make informed decisions. By interpreting the collected data, researchers can draw conclusions, make predictions, or formulate recommendations based on evidence rather than intuition or guesswork.
  • Data analysis allows researchers to identify patterns, trends, and relationships within the data. This can lead to a deeper understanding of the research topic, enabling researchers to uncover insights that may not be immediately apparent.
  • In empirical research, data analysis plays a critical role in testing hypotheses. Researchers collect data to either support or refute their hypotheses, and data analysis provides the tools and techniques to evaluate these hypotheses rigorously.
  • Transparent and well-executed data analysis enhances the credibility of research findings. By clearly documenting the data analysis methods and procedures, researchers allow others to replicate the study, thereby contributing to the reproducibility of research findings.
  • In fields such as business or healthcare, data analysis helps organizations allocate resources more efficiently. By analyzing data on consumer behavior, market trends, or patient outcomes, organizations can make strategic decisions about resource allocation, budgeting, and planning.
  • In public policy and social sciences, data analysis is instrumental in developing and evaluating policies and interventions. By analyzing data on social, economic, or environmental factors, policymakers can assess the effectiveness of existing policies and inform the development of new ones.
  • Data analysis allows for continuous improvement in research methods and practices. By analyzing past research projects, identifying areas for improvement, and implementing changes based on data-driven insights, researchers can refine their approaches and enhance the quality of future research endeavors.

However, it is important to remember that mastering these techniques requires practice and continuous learning. That’s why we highly recommend the Data Analytics Course by Physics Wallah . Not only does it cover all the fundamentals of data analysis, but it also provides hands-on experience with various tools such as Excel, Python, and Tableau. Plus, if you use the “ READER ” coupon code at checkout, you can get a special discount on the course.

For Latest Tech Related Information, Join Our Official Free Telegram Group : PW Skills Telegram Group

Data Analysis Techniques in Research FAQs

What are the 5 techniques for data analysis.

The five techniques for data analysis include: Descriptive Analysis Diagnostic Analysis Predictive Analysis Prescriptive Analysis Qualitative Analysis

What are techniques of data analysis in research?

Techniques of data analysis in research encompass both qualitative and quantitative methods. These techniques involve processes like summarizing raw data, investigating causes of events, forecasting future outcomes, offering recommendations based on predictions, and examining non-numerical data to understand concepts or experiences.

What are the 3 methods of data analysis?

The three primary methods of data analysis are: Qualitative Analysis Quantitative Analysis Mixed-Methods Analysis

What are the four types of data analysis techniques?

The four types of data analysis techniques are: Descriptive Analysis Diagnostic Analysis Predictive Analysis Prescriptive Analysis

  • Data Handling – Definition, Steps, Graphical Representation, Examples

Data Handling

Data handling is the process of collecting, managing, and representing data in such a way that it is easy to…

  • Data Modeling – Overview, Concepts, and Types

research methodology and data analysis

Data modeling is the process of creating visual representations of data structures to define how Data is stored, connected, and…

  • Descriptive Analytics: What It Is and Related Terms

descriptive analytics

Descriptive analytics is the process of analyzing past data to understand what has happened. Read this article to understand what…

right adv

Related Articles

  • What is Prescriptive Analytics? Definition & Examples
  • 10 Best Companies For Data Analysis Internships 2024
  • 9 Deep Learning Books to Check Out!
  • Top Best Big Data Analytics Classes 2024
  • Data Analyst Roadmap 2024: Responsibilities, Skills Required, Career Path
  • The Best Data And Analytics Courses For Beginners
  • Best Courses For Data Analytics: Top 10 Courses For Your Career in Trend

bottom banner

  • University Libraries
  • Research Guides
  • Topic Guides
  • Research Methods Guide
  • Data Analysis

Research Methods Guide: Data Analysis

  • Introduction
  • Research Design & Method
  • Survey Research
  • Interview Research
  • Resources & Consultation

Tools for Analyzing Survey Data

  • R (open source)
  • Stata 
  • DataCracker (free up to 100 responses per survey)
  • SurveyMonkey (free up to 100 responses per survey)

Tools for Analyzing Interview Data

  • AQUAD (open source)
  • NVivo 

Data Analysis and Presentation Techniques that Apply to both Survey and Interview Research

  • Create a documentation of the data and the process of data collection.
  • Analyze the data rather than just describing it - use it to tell a story that focuses on answering the research question.
  • Use charts or tables to help the reader understand the data and then highlight the most interesting findings.
  • Don’t get bogged down in the detail - tell the reader about the main themes as they relate to the research question, rather than reporting everything that survey respondents or interviewees said.
  • State that ‘most people said …’ or ‘few people felt …’ rather than giving the number of people who said a particular thing.
  • Use brief quotes where these illustrate a particular point really well.
  • Respect confidentiality - you could attribute a quote to 'a faculty member', ‘a student’, or 'a customer' rather than ‘Dr. Nicholls.'

Survey Data Analysis

  • If you used an online survey, the software will automatically collate the data – you will just need to download the data, for example as a spreadsheet.
  • If you used a paper questionnaire, you will need to manually transfer the responses from the questionnaires into a spreadsheet.  Put each question number as a column heading, and use one row for each person’s answers.  Then assign each possible answer a number or ‘code’.
  • When all the data is present and correct, calculate how many people selected each response.
  • Once you have calculated how many people selected each response, you can set up tables and/or graph to display the data.  This could take the form of a table or chart.
  • In addition to descriptive statistics that characterize findings from your survey, you can use statistical and analytical reporting techniques if needed.

Interview Data Analysis

  • Data Reduction and Organization: Try not to feel overwhelmed by quantity of information that has been collected from interviews- a one-hour interview can generate 20 to 25 pages of single-spaced text.   Once you start organizing your fieldwork notes around themes, you can easily identify which part of your data to be used for further analysis.
  • What were the main issues or themes that struck you in this contact / interviewee?"
  • Was there anything else that struck you as salient, interesting, illuminating or important in this contact / interviewee? 
  • What information did you get (or failed to get) on each of the target questions you had for this contact / interviewee?
  • Connection of the data: You can connect data around themes and concepts - then you can show how one concept may influence another.
  • Examination of Relationships: Examining relationships is the centerpiece of the analytic process, because it allows you to move from simple description of the people and settings to explanations of why things happened as they did with those people in that setting.
  • << Previous: Interview Research
  • Next: Resources & Consultation >>
  • Last Updated: Aug 21, 2023 10:42 AM

Have a language expert improve your writing

Run a free plagiarism check in 10 minutes, generate accurate citations for free.

  • Knowledge Base

Methodology

  • What Is a Research Design | Types, Guide & Examples

What Is a Research Design | Types, Guide & Examples

Published on June 7, 2021 by Shona McCombes . Revised on September 5, 2024 by Pritha Bhandari.

A research design is a strategy for answering your   research question  using empirical data. Creating a research design means making decisions about:

  • Your overall research objectives and approach
  • Whether you’ll rely on primary research or secondary research
  • Your sampling methods or criteria for selecting subjects
  • Your data collection methods
  • The procedures you’ll follow to collect data
  • Your data analysis methods

A well-planned research design helps ensure that your methods match your research objectives and that you use the right kind of analysis for your data.

You might have to write up a research design as a standalone assignment, or it might be part of a larger   research proposal or other project. In either case, you should carefully consider which methods are most appropriate and feasible for answering your question.

Table of contents

Step 1: consider your aims and approach, step 2: choose a type of research design, step 3: identify your population and sampling method, step 4: choose your data collection methods, step 5: plan your data collection procedures, step 6: decide on your data analysis strategies, other interesting articles, frequently asked questions about research design.

  • Introduction

Before you can start designing your research, you should already have a clear idea of the research question you want to investigate.

There are many different ways you could go about answering this question. Your research design choices should be driven by your aims and priorities—start by thinking carefully about what you want to achieve.

The first choice you need to make is whether you’ll take a qualitative or quantitative approach.

Qualitative approach Quantitative approach
and describe frequencies, averages, and correlations about relationships between variables

Qualitative research designs tend to be more flexible and inductive , allowing you to adjust your approach based on what you find throughout the research process.

Quantitative research designs tend to be more fixed and deductive , with variables and hypotheses clearly defined in advance of data collection.

It’s also possible to use a mixed-methods design that integrates aspects of both approaches. By combining qualitative and quantitative insights, you can gain a more complete picture of the problem you’re studying and strengthen the credibility of your conclusions.

Practical and ethical considerations when designing research

As well as scientific considerations, you need to think practically when designing your research. If your research involves people or animals, you also need to consider research ethics .

  • How much time do you have to collect data and write up the research?
  • Will you be able to gain access to the data you need (e.g., by travelling to a specific location or contacting specific people)?
  • Do you have the necessary research skills (e.g., statistical analysis or interview techniques)?
  • Will you need ethical approval ?

At each stage of the research design process, make sure that your choices are practically feasible.

Here's why students love Scribbr's proofreading services

Discover proofreading & editing

Within both qualitative and quantitative approaches, there are several types of research design to choose from. Each type provides a framework for the overall shape of your research.

Types of quantitative research designs

Quantitative designs can be split into four main types.

  • Experimental and   quasi-experimental designs allow you to test cause-and-effect relationships
  • Descriptive and correlational designs allow you to measure variables and describe relationships between them.
Type of design Purpose and characteristics
Experimental relationships effect on a
Quasi-experimental )
Correlational
Descriptive

With descriptive and correlational designs, you can get a clear picture of characteristics, trends and relationships as they exist in the real world. However, you can’t draw conclusions about cause and effect (because correlation doesn’t imply causation ).

Experiments are the strongest way to test cause-and-effect relationships without the risk of other variables influencing the results. However, their controlled conditions may not always reflect how things work in the real world. They’re often also more difficult and expensive to implement.

Types of qualitative research designs

Qualitative designs are less strictly defined. This approach is about gaining a rich, detailed understanding of a specific context or phenomenon, and you can often be more creative and flexible in designing your research.

The table below shows some common types of qualitative design. They often have similar approaches in terms of data collection, but focus on different aspects when analyzing the data.

Type of design Purpose and characteristics
Grounded theory
Phenomenology

Your research design should clearly define who or what your research will focus on, and how you’ll go about choosing your participants or subjects.

In research, a population is the entire group that you want to draw conclusions about, while a sample is the smaller group of individuals you’ll actually collect data from.

Defining the population

A population can be made up of anything you want to study—plants, animals, organizations, texts, countries, etc. In the social sciences, it most often refers to a group of people.

For example, will you focus on people from a specific demographic, region or background? Are you interested in people with a certain job or medical condition, or users of a particular product?

The more precisely you define your population, the easier it will be to gather a representative sample.

  • Sampling methods

Even with a narrowly defined population, it’s rarely possible to collect data from every individual. Instead, you’ll collect data from a sample.

To select a sample, there are two main approaches: probability sampling and non-probability sampling . The sampling method you use affects how confidently you can generalize your results to the population as a whole.

Probability sampling Non-probability sampling

Probability sampling is the most statistically valid option, but it’s often difficult to achieve unless you’re dealing with a very small and accessible population.

For practical reasons, many studies use non-probability sampling, but it’s important to be aware of the limitations and carefully consider potential biases. You should always make an effort to gather a sample that’s as representative as possible of the population.

Case selection in qualitative research

In some types of qualitative designs, sampling may not be relevant.

For example, in an ethnography or a case study , your aim is to deeply understand a specific context, not to generalize to a population. Instead of sampling, you may simply aim to collect as much data as possible about the context you are studying.

In these types of design, you still have to carefully consider your choice of case or community. You should have a clear rationale for why this particular case is suitable for answering your research question .

For example, you might choose a case study that reveals an unusual or neglected aspect of your research problem, or you might choose several very similar or very different cases in order to compare them.

Data collection methods are ways of directly measuring variables and gathering information. They allow you to gain first-hand knowledge and original insights into your research problem.

You can choose just one data collection method, or use several methods in the same study.

Survey methods

Surveys allow you to collect data about opinions, behaviors, experiences, and characteristics by asking people directly. There are two main survey methods to choose from: questionnaires and interviews .

Questionnaires Interviews
)

Observation methods

Observational studies allow you to collect data unobtrusively, observing characteristics, behaviors or social interactions without relying on self-reporting.

Observations may be conducted in real time, taking notes as you observe, or you might make audiovisual recordings for later analysis. They can be qualitative or quantitative.

Quantitative observation

Other methods of data collection

There are many other ways you might collect data depending on your field and topic.

Field Examples of data collection methods
Media & communication Collecting a sample of texts (e.g., speeches, articles, or social media posts) for data on cultural norms and narratives
Psychology Using technologies like neuroimaging, eye-tracking, or computer-based tasks to collect data on things like attention, emotional response, or reaction time
Education Using tests or assignments to collect data on knowledge and skills
Physical sciences Using scientific instruments to collect data on things like weight, blood pressure, or chemical composition

If you’re not sure which methods will work best for your research design, try reading some papers in your field to see what kinds of data collection methods they used.

Secondary data

If you don’t have the time or resources to collect data from the population you’re interested in, you can also choose to use secondary data that other researchers already collected—for example, datasets from government surveys or previous studies on your topic.

With this raw data, you can do your own analysis to answer new research questions that weren’t addressed by the original study.

Using secondary data can expand the scope of your research, as you may be able to access much larger and more varied samples than you could collect yourself.

However, it also means you don’t have any control over which variables to measure or how to measure them, so the conclusions you can draw may be limited.

Prevent plagiarism. Run a free check.

As well as deciding on your methods, you need to plan exactly how you’ll use these methods to collect data that’s consistent, accurate, and unbiased.

Planning systematic procedures is especially important in quantitative research, where you need to precisely define your variables and ensure your measurements are high in reliability and validity.

Operationalization

Some variables, like height or age, are easily measured. But often you’ll be dealing with more abstract concepts, like satisfaction, anxiety, or competence. Operationalization means turning these fuzzy ideas into measurable indicators.

If you’re using observations , which events or actions will you count?

If you’re using surveys , which questions will you ask and what range of responses will be offered?

You may also choose to use or adapt existing materials designed to measure the concept you’re interested in—for example, questionnaires or inventories whose reliability and validity has already been established.

Reliability and validity

Reliability means your results can be consistently reproduced, while validity means that you’re actually measuring the concept you’re interested in.

Reliability Validity
) )

For valid and reliable results, your measurement materials should be thoroughly researched and carefully designed. Plan your procedures to make sure you carry out the same steps in the same way for each participant.

If you’re developing a new questionnaire or other instrument to measure a specific concept, running a pilot study allows you to check its validity and reliability in advance.

Sampling procedures

As well as choosing an appropriate sampling method , you need a concrete plan for how you’ll actually contact and recruit your selected sample.

That means making decisions about things like:

  • How many participants do you need for an adequate sample size?
  • What inclusion and exclusion criteria will you use to identify eligible participants?
  • How will you contact your sample—by mail, online, by phone, or in person?

If you’re using a probability sampling method , it’s important that everyone who is randomly selected actually participates in the study. How will you ensure a high response rate?

If you’re using a non-probability method , how will you avoid research bias and ensure a representative sample?

Data management

It’s also important to create a data management plan for organizing and storing your data.

Will you need to transcribe interviews or perform data entry for observations? You should anonymize and safeguard any sensitive data, and make sure it’s backed up regularly.

Keeping your data well-organized will save time when it comes to analyzing it. It can also help other researchers validate and add to your findings (high replicability ).

On its own, raw data can’t answer your research question. The last step of designing your research is planning how you’ll analyze the data.

Quantitative data analysis

In quantitative research, you’ll most likely use some form of statistical analysis . With statistics, you can summarize your sample data, make estimates, and test hypotheses.

Using descriptive statistics , you can summarize your sample data in terms of:

  • The distribution of the data (e.g., the frequency of each score on a test)
  • The central tendency of the data (e.g., the mean to describe the average score)
  • The variability of the data (e.g., the standard deviation to describe how spread out the scores are)

The specific calculations you can do depend on the level of measurement of your variables.

Using inferential statistics , you can:

  • Make estimates about the population based on your sample data.
  • Test hypotheses about a relationship between variables.

Regression and correlation tests look for associations between two or more variables, while comparison tests (such as t tests and ANOVAs ) look for differences in the outcomes of different groups.

Your choice of statistical test depends on various aspects of your research design, including the types of variables you’re dealing with and the distribution of your data.

Qualitative data analysis

In qualitative research, your data will usually be very dense with information and ideas. Instead of summing it up in numbers, you’ll need to comb through the data in detail, interpret its meanings, identify patterns, and extract the parts that are most relevant to your research question.

Two of the most common approaches to doing this are thematic analysis and discourse analysis .

Approach Characteristics
Thematic analysis
Discourse analysis

There are many other ways of analyzing qualitative data depending on the aims of your research. To get a sense of potential approaches, try reading some qualitative research papers in your field.

If you want to know more about the research process , methodology , research bias , or statistics , make sure to check out some of our other articles with explanations and examples.

  • Simple random sampling
  • Stratified sampling
  • Cluster sampling
  • Likert scales
  • Reproducibility

 Statistics

  • Null hypothesis
  • Statistical power
  • Probability distribution
  • Effect size
  • Poisson distribution

Research bias

  • Optimism bias
  • Cognitive bias
  • Implicit bias
  • Hawthorne effect
  • Anchoring bias
  • Explicit bias

A research design is a strategy for answering your   research question . It defines your overall approach and determines how you will collect and analyze data.

A well-planned research design helps ensure that your methods match your research aims, that you collect high-quality data, and that you use the right kind of analysis to answer your questions, utilizing credible sources . This allows you to draw valid , trustworthy conclusions.

Quantitative research designs can be divided into two main categories:

  • Correlational and descriptive designs are used to investigate characteristics, averages, trends, and associations between variables.
  • Experimental and quasi-experimental designs are used to test causal relationships .

Qualitative research designs tend to be more flexible. Common types of qualitative design include case study , ethnography , and grounded theory designs.

The priorities of a research design can vary depending on the field, but you usually have to specify:

  • Your research questions and/or hypotheses
  • Your overall approach (e.g., qualitative or quantitative )
  • The type of design you’re using (e.g., a survey , experiment , or case study )
  • Your data collection methods (e.g., questionnaires , observations)
  • Your data collection procedures (e.g., operationalization , timing and data management)
  • Your data analysis methods (e.g., statistical tests  or thematic analysis )

A sample is a subset of individuals from a larger population . Sampling means selecting the group that you will actually collect data from in your research. For example, if you are researching the opinions of students in your university, you could survey a sample of 100 students.

In statistics, sampling allows you to test a hypothesis about the characteristics of a population.

Operationalization means turning abstract conceptual ideas into measurable observations.

For example, the concept of social anxiety isn’t directly observable, but it can be operationally defined in terms of self-rating scores, behavioral avoidance of crowded places, or physical anxiety symptoms in social situations.

Before collecting data , it’s important to consider how you will operationalize the variables that you want to measure.

A research project is an academic, scientific, or professional undertaking to answer a research question . Research projects can take many forms, such as qualitative or quantitative , descriptive , longitudinal , experimental , or correlational . What kind of research approach you choose will depend on your topic.

Cite this Scribbr article

If you want to cite this source, you can copy and paste the citation or click the “Cite this Scribbr article” button to automatically add the citation to our free Citation Generator.

McCombes, S. (2024, September 05). What Is a Research Design | Types, Guide & Examples. Scribbr. Retrieved September 18, 2024, from https://www.scribbr.com/methodology/research-design/

Is this article helpful?

Shona McCombes

Data collection

Data analysis at the Armstrong Flight Research Center in Palmdale, California

data analysis

Our editors will review what you’ve submitted and determine whether to revise the article.

  • Academia - Data Analysis
  • U.S. Department of Health and Human Services - Office of Research Integrity - Data Analysis
  • Chemistry LibreTexts - Data Analysis
  • IBM - What is Exploratory Data Analysis?
  • Table Of Contents

Data analysis at the Armstrong Flight Research Center in Palmdale, California

data analysis , the process of systematically collecting, cleaning, transforming, describing, modeling, and interpreting data , generally employing statistical techniques. Data analysis is an important part of both scientific research and business, where demand has grown in recent years for data-driven decision making . Data analysis techniques are used to gain useful insights from datasets, which can then be used to make operational decisions or guide future research . With the rise of “ big data ,” the storage of vast quantities of data in large databases and data warehouses, there is increasing need to apply data analysis techniques to generate insights about volumes of data too large to be manipulated by instruments of low information-processing capacity.

Datasets are collections of information. Generally, data and datasets are themselves collected to help answer questions, make decisions, or otherwise inform reasoning. The rise of information technology has led to the generation of vast amounts of data of many kinds, such as text, pictures, videos, personal information, account data, and metadata, the last of which provide information about other data. It is common for apps and websites to collect data about how their products are used or about the people using their platforms. Consequently, there is vastly more data being collected today than at any other time in human history. A single business may track billions of interactions with millions of consumers at hundreds of locations with thousands of employees and any number of products. Analyzing that volume of data is generally only possible using specialized computational and statistical techniques.

The desire for businesses to make the best use of their data has led to the development of the field of business intelligence , which covers a variety of tools and techniques that allow businesses to perform data analysis on the information they collect.

For data to be analyzed, it must first be collected and stored. Raw data must be processed into a format that can be used for analysis and be cleaned so that errors and inconsistencies are minimized. Data can be stored in many ways, but one of the most useful is in a database . A database is a collection of interrelated data organized so that certain records (collections of data related to a single entity) can be retrieved on the basis of various criteria . The most familiar kind of database is the relational database , which stores data in tables with rows that represent records (tuples) and columns that represent fields (attributes). A query is a command that retrieves a subset of the information in the database according to certain criteria. A query may retrieve only records that meet certain criteria, or it may join fields from records across multiple tables by use of a common field.

Frequently, data from many sources is collected into large archives of data called data warehouses. The process of moving data from its original sources (such as databases) to a centralized location (generally a data warehouse) is called ETL (which stands for extract , transform , and load ).

  • The extraction step occurs when you identify and copy or export the desired data from its source, such as by running a database query to retrieve the desired records.
  • The transformation step is the process of cleaning the data so that they fit the analytical need for the data and the schema of the data warehouse. This may involve changing formats for certain fields, removing duplicate records, or renaming fields, among other processes.
  • Finally, the clean data are loaded into the data warehouse, where they may join vast amounts of historical data and data from other sources.

After data are effectively collected and cleaned, they can be analyzed with a variety of techniques. Analysis often begins with descriptive and exploratory data analysis. Descriptive data analysis uses statistics to organize and summarize data, making it easier to understand the broad qualities of the dataset. Exploratory data analysis looks for insights into the data that may arise from descriptions of distribution, central tendency, or variability for a single data field. Further relationships between data may become apparent by examining two fields together. Visualizations may be employed during analysis, such as histograms (graphs in which the length of a bar indicates a quantity) or stem-and-leaf plots (which divide data into buckets, or “stems,” with individual data points serving as “leaves” on the stem).

research methodology and data analysis

Data analysis frequently goes beyond descriptive analysis to predictive analysis, making predictions about the future using predictive modeling techniques. Predictive modeling uses machine learning , regression analysis methods (which mathematically calculate the relationship between an independent variable and a dependent variable), and classification techniques to identify trends and relationships among variables. Predictive analysis may involve data mining , which is the process of discovering interesting or useful patterns in large volumes of information. Data mining often involves cluster analysis , which tries to find natural groupings within data, and anomaly detection , which detects instances in data that are unusual and stand out from other patterns. It may also look for rules within datasets, strong relationships among variables in the data.

  • Privacy Policy

Research Method

Home » Research Methodology – Types, Examples and writing Guide

Research Methodology – Types, Examples and writing Guide

Table of Contents

Research Methodology

Research Methodology

Definition:

Research Methodology refers to the systematic and scientific approach used to conduct research, investigate problems, and gather data and information for a specific purpose. It involves the techniques and procedures used to identify, collect , analyze , and interpret data to answer research questions or solve research problems . Moreover, They are philosophical and theoretical frameworks that guide the research process.

Structure of Research Methodology

Research methodology formats can vary depending on the specific requirements of the research project, but the following is a basic example of a structure for a research methodology section:

I. Introduction

  • Provide an overview of the research problem and the need for a research methodology section
  • Outline the main research questions and objectives

II. Research Design

  • Explain the research design chosen and why it is appropriate for the research question(s) and objectives
  • Discuss any alternative research designs considered and why they were not chosen
  • Describe the research setting and participants (if applicable)

III. Data Collection Methods

  • Describe the methods used to collect data (e.g., surveys, interviews, observations)
  • Explain how the data collection methods were chosen and why they are appropriate for the research question(s) and objectives
  • Detail any procedures or instruments used for data collection

IV. Data Analysis Methods

  • Describe the methods used to analyze the data (e.g., statistical analysis, content analysis )
  • Explain how the data analysis methods were chosen and why they are appropriate for the research question(s) and objectives
  • Detail any procedures or software used for data analysis

V. Ethical Considerations

  • Discuss any ethical issues that may arise from the research and how they were addressed
  • Explain how informed consent was obtained (if applicable)
  • Detail any measures taken to ensure confidentiality and anonymity

VI. Limitations

  • Identify any potential limitations of the research methodology and how they may impact the results and conclusions

VII. Conclusion

  • Summarize the key aspects of the research methodology section
  • Explain how the research methodology addresses the research question(s) and objectives

Research Methodology Types

Types of Research Methodology are as follows:

Quantitative Research Methodology

This is a research methodology that involves the collection and analysis of numerical data using statistical methods. This type of research is often used to study cause-and-effect relationships and to make predictions.

Qualitative Research Methodology

This is a research methodology that involves the collection and analysis of non-numerical data such as words, images, and observations. This type of research is often used to explore complex phenomena, to gain an in-depth understanding of a particular topic, and to generate hypotheses.

Mixed-Methods Research Methodology

This is a research methodology that combines elements of both quantitative and qualitative research. This approach can be particularly useful for studies that aim to explore complex phenomena and to provide a more comprehensive understanding of a particular topic.

Case Study Research Methodology

This is a research methodology that involves in-depth examination of a single case or a small number of cases. Case studies are often used in psychology, sociology, and anthropology to gain a detailed understanding of a particular individual or group.

Action Research Methodology

This is a research methodology that involves a collaborative process between researchers and practitioners to identify and solve real-world problems. Action research is often used in education, healthcare, and social work.

Experimental Research Methodology

This is a research methodology that involves the manipulation of one or more independent variables to observe their effects on a dependent variable. Experimental research is often used to study cause-and-effect relationships and to make predictions.

Survey Research Methodology

This is a research methodology that involves the collection of data from a sample of individuals using questionnaires or interviews. Survey research is often used to study attitudes, opinions, and behaviors.

Grounded Theory Research Methodology

This is a research methodology that involves the development of theories based on the data collected during the research process. Grounded theory is often used in sociology and anthropology to generate theories about social phenomena.

Research Methodology Example

An Example of Research Methodology could be the following:

Research Methodology for Investigating the Effectiveness of Cognitive Behavioral Therapy in Reducing Symptoms of Depression in Adults

Introduction:

The aim of this research is to investigate the effectiveness of cognitive-behavioral therapy (CBT) in reducing symptoms of depression in adults. To achieve this objective, a randomized controlled trial (RCT) will be conducted using a mixed-methods approach.

Research Design:

The study will follow a pre-test and post-test design with two groups: an experimental group receiving CBT and a control group receiving no intervention. The study will also include a qualitative component, in which semi-structured interviews will be conducted with a subset of participants to explore their experiences of receiving CBT.

Participants:

Participants will be recruited from community mental health clinics in the local area. The sample will consist of 100 adults aged 18-65 years old who meet the diagnostic criteria for major depressive disorder. Participants will be randomly assigned to either the experimental group or the control group.

Intervention :

The experimental group will receive 12 weekly sessions of CBT, each lasting 60 minutes. The intervention will be delivered by licensed mental health professionals who have been trained in CBT. The control group will receive no intervention during the study period.

Data Collection:

Quantitative data will be collected through the use of standardized measures such as the Beck Depression Inventory-II (BDI-II) and the Generalized Anxiety Disorder-7 (GAD-7). Data will be collected at baseline, immediately after the intervention, and at a 3-month follow-up. Qualitative data will be collected through semi-structured interviews with a subset of participants from the experimental group. The interviews will be conducted at the end of the intervention period, and will explore participants’ experiences of receiving CBT.

Data Analysis:

Quantitative data will be analyzed using descriptive statistics, t-tests, and mixed-model analyses of variance (ANOVA) to assess the effectiveness of the intervention. Qualitative data will be analyzed using thematic analysis to identify common themes and patterns in participants’ experiences of receiving CBT.

Ethical Considerations:

This study will comply with ethical guidelines for research involving human subjects. Participants will provide informed consent before participating in the study, and their privacy and confidentiality will be protected throughout the study. Any adverse events or reactions will be reported and managed appropriately.

Data Management:

All data collected will be kept confidential and stored securely using password-protected databases. Identifying information will be removed from qualitative data transcripts to ensure participants’ anonymity.

Limitations:

One potential limitation of this study is that it only focuses on one type of psychotherapy, CBT, and may not generalize to other types of therapy or interventions. Another limitation is that the study will only include participants from community mental health clinics, which may not be representative of the general population.

Conclusion:

This research aims to investigate the effectiveness of CBT in reducing symptoms of depression in adults. By using a randomized controlled trial and a mixed-methods approach, the study will provide valuable insights into the mechanisms underlying the relationship between CBT and depression. The results of this study will have important implications for the development of effective treatments for depression in clinical settings.

How to Write Research Methodology

Writing a research methodology involves explaining the methods and techniques you used to conduct research, collect data, and analyze results. It’s an essential section of any research paper or thesis, as it helps readers understand the validity and reliability of your findings. Here are the steps to write a research methodology:

  • Start by explaining your research question: Begin the methodology section by restating your research question and explaining why it’s important. This helps readers understand the purpose of your research and the rationale behind your methods.
  • Describe your research design: Explain the overall approach you used to conduct research. This could be a qualitative or quantitative research design, experimental or non-experimental, case study or survey, etc. Discuss the advantages and limitations of the chosen design.
  • Discuss your sample: Describe the participants or subjects you included in your study. Include details such as their demographics, sampling method, sample size, and any exclusion criteria used.
  • Describe your data collection methods : Explain how you collected data from your participants. This could include surveys, interviews, observations, questionnaires, or experiments. Include details on how you obtained informed consent, how you administered the tools, and how you minimized the risk of bias.
  • Explain your data analysis techniques: Describe the methods you used to analyze the data you collected. This could include statistical analysis, content analysis, thematic analysis, or discourse analysis. Explain how you dealt with missing data, outliers, and any other issues that arose during the analysis.
  • Discuss the validity and reliability of your research : Explain how you ensured the validity and reliability of your study. This could include measures such as triangulation, member checking, peer review, or inter-coder reliability.
  • Acknowledge any limitations of your research: Discuss any limitations of your study, including any potential threats to validity or generalizability. This helps readers understand the scope of your findings and how they might apply to other contexts.
  • Provide a summary: End the methodology section by summarizing the methods and techniques you used to conduct your research. This provides a clear overview of your research methodology and helps readers understand the process you followed to arrive at your findings.

When to Write Research Methodology

Research methodology is typically written after the research proposal has been approved and before the actual research is conducted. It should be written prior to data collection and analysis, as it provides a clear roadmap for the research project.

The research methodology is an important section of any research paper or thesis, as it describes the methods and procedures that will be used to conduct the research. It should include details about the research design, data collection methods, data analysis techniques, and any ethical considerations.

The methodology should be written in a clear and concise manner, and it should be based on established research practices and standards. It is important to provide enough detail so that the reader can understand how the research was conducted and evaluate the validity of the results.

Applications of Research Methodology

Here are some of the applications of research methodology:

  • To identify the research problem: Research methodology is used to identify the research problem, which is the first step in conducting any research.
  • To design the research: Research methodology helps in designing the research by selecting the appropriate research method, research design, and sampling technique.
  • To collect data: Research methodology provides a systematic approach to collect data from primary and secondary sources.
  • To analyze data: Research methodology helps in analyzing the collected data using various statistical and non-statistical techniques.
  • To test hypotheses: Research methodology provides a framework for testing hypotheses and drawing conclusions based on the analysis of data.
  • To generalize findings: Research methodology helps in generalizing the findings of the research to the target population.
  • To develop theories : Research methodology is used to develop new theories and modify existing theories based on the findings of the research.
  • To evaluate programs and policies : Research methodology is used to evaluate the effectiveness of programs and policies by collecting data and analyzing it.
  • To improve decision-making: Research methodology helps in making informed decisions by providing reliable and valid data.

Purpose of Research Methodology

Research methodology serves several important purposes, including:

  • To guide the research process: Research methodology provides a systematic framework for conducting research. It helps researchers to plan their research, define their research questions, and select appropriate methods and techniques for collecting and analyzing data.
  • To ensure research quality: Research methodology helps researchers to ensure that their research is rigorous, reliable, and valid. It provides guidelines for minimizing bias and error in data collection and analysis, and for ensuring that research findings are accurate and trustworthy.
  • To replicate research: Research methodology provides a clear and detailed account of the research process, making it possible for other researchers to replicate the study and verify its findings.
  • To advance knowledge: Research methodology enables researchers to generate new knowledge and to contribute to the body of knowledge in their field. It provides a means for testing hypotheses, exploring new ideas, and discovering new insights.
  • To inform decision-making: Research methodology provides evidence-based information that can inform policy and decision-making in a variety of fields, including medicine, public health, education, and business.

Advantages of Research Methodology

Research methodology has several advantages that make it a valuable tool for conducting research in various fields. Here are some of the key advantages of research methodology:

  • Systematic and structured approach : Research methodology provides a systematic and structured approach to conducting research, which ensures that the research is conducted in a rigorous and comprehensive manner.
  • Objectivity : Research methodology aims to ensure objectivity in the research process, which means that the research findings are based on evidence and not influenced by personal bias or subjective opinions.
  • Replicability : Research methodology ensures that research can be replicated by other researchers, which is essential for validating research findings and ensuring their accuracy.
  • Reliability : Research methodology aims to ensure that the research findings are reliable, which means that they are consistent and can be depended upon.
  • Validity : Research methodology ensures that the research findings are valid, which means that they accurately reflect the research question or hypothesis being tested.
  • Efficiency : Research methodology provides a structured and efficient way of conducting research, which helps to save time and resources.
  • Flexibility : Research methodology allows researchers to choose the most appropriate research methods and techniques based on the research question, data availability, and other relevant factors.
  • Scope for innovation: Research methodology provides scope for innovation and creativity in designing research studies and developing new research techniques.

Research Methodology Vs Research Methods

Research MethodologyResearch Methods
Research methodology refers to the philosophical and theoretical frameworks that guide the research process. refer to the techniques and procedures used to collect and analyze data.
It is concerned with the underlying principles and assumptions of research.It is concerned with the practical aspects of research.
It provides a rationale for why certain research methods are used.It determines the specific steps that will be taken to conduct research.
It is broader in scope and involves understanding the overall approach to research.It is narrower in scope and focuses on specific techniques and tools used in research.
It is concerned with identifying research questions, defining the research problem, and formulating hypotheses.It is concerned with collecting data, analyzing data, and interpreting results.
It is concerned with the validity and reliability of research.It is concerned with the accuracy and precision of data.
It is concerned with the ethical considerations of research.It is concerned with the practical considerations of research.

About the author

' src=

Muhammad Hassan

Researcher, Academic Writer, Web developer

You may also like

Background of The Study

Background of The Study – Examples and Writing...

Research Paper Introduction

Research Paper Introduction – Writing Guide and...

Informed Consent in Research

Informed Consent in Research – Types, Templates...

Significance of the Study

Significance of the Study – Examples and Writing...

Dissertation vs Thesis

Dissertation vs Thesis – Key Differences

Research Findings

Research Findings – Types Examples and Writing...

Banner

Research Methods

  • Getting Started
  • What is Research Design?
  • Research Approach
  • Research Methodology
  • Data Collection
  • Data Analysis & Interpretation
  • Population & Sampling
  • Theories, Theoretical Perspective & Theoretical Framework
  • Useful Resources

Further Resources

Cover Art

Data Analysis & Interpretation

  • Quantitative Data

Qualitative Data

  • Mixed Methods

You will need to tidy, analyse and interpret the data you collected to give meaning to it, and to answer your research question.  Your choice of methodology points the way to the most suitable method of analysing your data.

research methodology and data analysis

If the data is numeric you can use a software package such as SPSS, Excel Spreadsheet or “R” to do statistical analysis.  You can identify things like mean, median and average or identify a causal or correlational relationship between variables.  

The University of Connecticut has useful information on statistical analysis.

If your research set out to test a hypothesis your research will either support or refute it, and you will need to explain why this is the case.  You should also highlight and discuss any issues or actions that may have impacted on your results, either positively or negatively.  To fully contribute to the body of knowledge in your area be sure to discuss and interpret your results within the context of your research and the existing literature on the topic.

Data analysis for a qualitative study can be complex because of the variety of types of data that can be collected. Qualitative researchers aren’t attempting to measure observable characteristics, they are often attempting to capture an individual’s interpretation of a phenomena or situation in a particular context or setting.  This data could be captured in text from an interview or focus group, a movie, images, or documents.   Analysis of this type of data is usually done by analysing each artefact according to a predefined and outlined criteria for analysis and then by using a coding system.  The code can be developed by the researcher before analysis or the researcher may develop a code from the research data.  This can be done by hand or by using thematic analysis software such as NVivo.

Interpretation of qualitative data can be presented as a narrative.  The themes identified from the research can be organised and integrated with themes in the existing literature to give further weight and meaning to the research.  The interpretation should also state if the aims and objectives of the research were met.   Any shortcomings with research or areas for further research should also be discussed (Creswell,2009)*.

For further information on analysing and presenting qualitative date, read this article in Nature .

Mixed Methods Data

Data analysis for mixed methods involves aspects of both quantitative and qualitative methods.  However, the sequencing of data collection and analysis is important in terms of the mixed method approach that you are taking.  For example, you could be using a convergent, sequential or transformative model which directly impacts how you use different data to inform, support or direct the course of your study.

The intention in using mixed methods is to produce a synthesis of both quantitative and qualitative information to give a detailed picture of a phenomena in a particular context or setting. To fully understand how best to produce this synthesis it might be worth looking at why researchers choose this method.  Bergin**(2018) states that researchers choose mixed methods because it allows them to triangulate, illuminate or discover a more diverse set of findings.  Therefore, when it comes to interpretation you will need to return to the purpose of your research and discuss and interpret your data in that context. As with quantitative and qualitative methods, interpretation of data should be discussed within the context of the existing literature.

Bergin’s book is available in the Library to borrow. Bolton LTT collection 519.5 BER

Creswell’s book is available in the Library to borrow.  Bolton LTT collection 300.72 CRE

For more information on data analysis look at Sage Research Methods database on the library website.

*Creswell, John W.(2009)  Research design: qualitative, and mixed methods approaches.  Sage, Los Angeles, pp 183

**Bergin, T (2018), Data analysis: quantitative, qualitative and mixed methods. Sage, Los Angeles, pp182

  • << Previous: Data Collection
  • Next: Population & Sampling >>
  • Last Updated: Sep 7, 2023 3:09 PM
  • URL: https://tudublin.libguides.com/research_methods

SOC 211: Research Methods and Data Analysis

The research process, literature review, mind mapping, writing guide, search techniques.

  • Finding Books & Background Information
  • Finding Articles
  • Statistics and Data
  • Citing Sources

Social Sciences Librarian

Profile Photo

  • Guide on Literature Reviews This guide was created by Britt McGowen, Associate Librarian, at the University of West Florida.
  • Literature Review Spreadsheet Customize this spreadsheet as you work on gathering information for your literature review. Created by Lehigh staff Brian Simboli, Elizabeth Young Miller, and Meredith Constable.

research methodology and data analysis

Brainstorming keywords and concepts related to your topic can be helpful.  Consider using University of Arizona's free mind mapping tool linked below.

  • Mind Mapping Tool

research methodology and data analysis

  • Next: Finding Books & Background Information >>
  • Last Updated: Sep 17, 2024 3:26 PM
  • URL: https://libraryguides.lehigh.edu/soc211

Qualitative vs Quantitative Research Methods & Data Analysis

Saul McLeod, PhD

Editor-in-Chief for Simply Psychology

BSc (Hons) Psychology, MRes, PhD, University of Manchester

Saul McLeod, PhD., is a qualified psychology teacher with over 18 years of experience in further and higher education. He has been published in peer-reviewed journals, including the Journal of Clinical Psychology.

Learn about our Editorial Process

Olivia Guy-Evans, MSc

Associate Editor for Simply Psychology

BSc (Hons) Psychology, MSc Psychology of Education

Olivia Guy-Evans is a writer and associate editor for Simply Psychology. She has previously worked in healthcare and educational sectors.

The main difference between quantitative and qualitative research is the type of data they collect and analyze.

Quantitative data is information about quantities, and therefore numbers, and qualitative data is descriptive, and regards phenomenon which can be observed but not measured, such as language.
  • Quantitative research collects numerical data and analyzes it using statistical methods. The aim is to produce objective, empirical data that can be measured and expressed numerically. Quantitative research is often used to test hypotheses, identify patterns, and make predictions.
  • Qualitative research gathers non-numerical data (words, images, sounds) to explore subjective experiences and attitudes, often via observation and interviews. It aims to produce detailed descriptions and uncover new insights about the studied phenomenon.

On This Page:

What Is Qualitative Research?

Qualitative research is the process of collecting, analyzing, and interpreting non-numerical data, such as language. Qualitative research can be used to understand how an individual subjectively perceives and gives meaning to their social reality.

Qualitative data is non-numerical data, such as text, video, photographs, or audio recordings. This type of data can be collected using diary accounts or in-depth interviews and analyzed using grounded theory or thematic analysis.

Qualitative research is multimethod in focus, involving an interpretive, naturalistic approach to its subject matter. This means that qualitative researchers study things in their natural settings, attempting to make sense of, or interpret, phenomena in terms of the meanings people bring to them. Denzin and Lincoln (1994, p. 2)

Interest in qualitative data came about as the result of the dissatisfaction of some psychologists (e.g., Carl Rogers) with the scientific study of psychologists such as behaviorists (e.g., Skinner ).

Since psychologists study people, the traditional approach to science is not seen as an appropriate way of carrying out research since it fails to capture the totality of human experience and the essence of being human.  Exploring participants’ experiences is known as a phenomenological approach (re: Humanism ).

Qualitative research is primarily concerned with meaning, subjectivity, and lived experience. The goal is to understand the quality and texture of people’s experiences, how they make sense of them, and the implications for their lives.

Qualitative research aims to understand the social reality of individuals, groups, and cultures as nearly as possible as participants feel or live it. Thus, people and groups are studied in their natural setting.

Some examples of qualitative research questions are provided, such as what an experience feels like, how people talk about something, how they make sense of an experience, and how events unfold for people.

Research following a qualitative approach is exploratory and seeks to explain ‘how’ and ‘why’ a particular phenomenon, or behavior, operates as it does in a particular context. It can be used to generate hypotheses and theories from the data.

Qualitative Methods

There are different types of qualitative research methods, including diary accounts, in-depth interviews , documents, focus groups , case study research , and ethnography .

The results of qualitative methods provide a deep understanding of how people perceive their social realities and in consequence, how they act within the social world.

The researcher has several methods for collecting empirical materials, ranging from the interview to direct observation, to the analysis of artifacts, documents, and cultural records, to the use of visual materials or personal experience. Denzin and Lincoln (1994, p. 14)

Here are some examples of qualitative data:

Interview transcripts : Verbatim records of what participants said during an interview or focus group. They allow researchers to identify common themes and patterns, and draw conclusions based on the data. Interview transcripts can also be useful in providing direct quotes and examples to support research findings.

Observations : The researcher typically takes detailed notes on what they observe, including any contextual information, nonverbal cues, or other relevant details. The resulting observational data can be analyzed to gain insights into social phenomena, such as human behavior, social interactions, and cultural practices.

Unstructured interviews : generate qualitative data through the use of open questions.  This allows the respondent to talk in some depth, choosing their own words.  This helps the researcher develop a real sense of a person’s understanding of a situation.

Diaries or journals : Written accounts of personal experiences or reflections.

Notice that qualitative data could be much more than just words or text. Photographs, videos, sound recordings, and so on, can be considered qualitative data. Visual data can be used to understand behaviors, environments, and social interactions.

Qualitative Data Analysis

Qualitative research is endlessly creative and interpretive. The researcher does not just leave the field with mountains of empirical data and then easily write up his or her findings.

Qualitative interpretations are constructed, and various techniques can be used to make sense of the data, such as content analysis, grounded theory (Glaser & Strauss, 1967), thematic analysis (Braun & Clarke, 2006), or discourse analysis .

For example, thematic analysis is a qualitative approach that involves identifying implicit or explicit ideas within the data. Themes will often emerge once the data has been coded .

RESEARCH THEMATICANALYSISMETHOD

Key Features

  • Events can be understood adequately only if they are seen in context. Therefore, a qualitative researcher immerses her/himself in the field, in natural surroundings. The contexts of inquiry are not contrived; they are natural. Nothing is predefined or taken for granted.
  • Qualitative researchers want those who are studied to speak for themselves, to provide their perspectives in words and other actions. Therefore, qualitative research is an interactive process in which the persons studied teach the researcher about their lives.
  • The qualitative researcher is an integral part of the data; without the active participation of the researcher, no data exists.
  • The study’s design evolves during the research and can be adjusted or changed as it progresses. For the qualitative researcher, there is no single reality. It is subjective and exists only in reference to the observer.
  • The theory is data-driven and emerges as part of the research process, evolving from the data as they are collected.

Limitations of Qualitative Research

  • Because of the time and costs involved, qualitative designs do not generally draw samples from large-scale data sets.
  • The problem of adequate validity or reliability is a major criticism. Because of the subjective nature of qualitative data and its origin in single contexts, it is difficult to apply conventional standards of reliability and validity. For example, because of the central role played by the researcher in the generation of data, it is not possible to replicate qualitative studies.
  • Also, contexts, situations, events, conditions, and interactions cannot be replicated to any extent, nor can generalizations be made to a wider context than the one studied with confidence.
  • The time required for data collection, analysis, and interpretation is lengthy. Analysis of qualitative data is difficult, and expert knowledge of an area is necessary to interpret qualitative data. Great care must be taken when doing so, for example, looking for mental illness symptoms.

Advantages of Qualitative Research

  • Because of close researcher involvement, the researcher gains an insider’s view of the field. This allows the researcher to find issues that are often missed (such as subtleties and complexities) by the scientific, more positivistic inquiries.
  • Qualitative descriptions can be important in suggesting possible relationships, causes, effects, and dynamic processes.
  • Qualitative analysis allows for ambiguities/contradictions in the data, which reflect social reality (Denscombe, 2010).
  • Qualitative research uses a descriptive, narrative style; this research might be of particular benefit to the practitioner as she or he could turn to qualitative reports to examine forms of knowledge that might otherwise be unavailable, thereby gaining new insight.

What Is Quantitative Research?

Quantitative research involves the process of objectively collecting and analyzing numerical data to describe, predict, or control variables of interest.

The goals of quantitative research are to test causal relationships between variables , make predictions, and generalize results to wider populations.

Quantitative researchers aim to establish general laws of behavior and phenomenon across different settings/contexts. Research is used to test a theory and ultimately support or reject it.

Quantitative Methods

Experiments typically yield quantitative data, as they are concerned with measuring things.  However, other research methods, such as controlled observations and questionnaires , can produce both quantitative information.

For example, a rating scale or closed questions on a questionnaire would generate quantitative data as these produce either numerical data or data that can be put into categories (e.g., “yes,” “no” answers).

Experimental methods limit how research participants react to and express appropriate social behavior.

Findings are, therefore, likely to be context-bound and simply a reflection of the assumptions that the researcher brings to the investigation.

There are numerous examples of quantitative data in psychological research, including mental health. Here are a few examples:

Another example is the Experience in Close Relationships Scale (ECR), a self-report questionnaire widely used to assess adult attachment styles .

The ECR provides quantitative data that can be used to assess attachment styles and predict relationship outcomes.

Neuroimaging data : Neuroimaging techniques, such as MRI and fMRI, provide quantitative data on brain structure and function.

This data can be analyzed to identify brain regions involved in specific mental processes or disorders.

For example, the Beck Depression Inventory (BDI) is a clinician-administered questionnaire widely used to assess the severity of depressive symptoms in individuals.

The BDI consists of 21 questions, each scored on a scale of 0 to 3, with higher scores indicating more severe depressive symptoms. 

Quantitative Data Analysis

Statistics help us turn quantitative data into useful information to help with decision-making. We can use statistics to summarize our data, describing patterns, relationships, and connections. Statistics can be descriptive or inferential.

Descriptive statistics help us to summarize our data. In contrast, inferential statistics are used to identify statistically significant differences between groups of data (such as intervention and control groups in a randomized control study).

  • Quantitative researchers try to control extraneous variables by conducting their studies in the lab.
  • The research aims for objectivity (i.e., without bias) and is separated from the data.
  • The design of the study is determined before it begins.
  • For the quantitative researcher, the reality is objective, exists separately from the researcher, and can be seen by anyone.
  • Research is used to test a theory and ultimately support or reject it.

Limitations of Quantitative Research

  • Context: Quantitative experiments do not take place in natural settings. In addition, they do not allow participants to explain their choices or the meaning of the questions they may have for those participants (Carr, 1994).
  • Researcher expertise: Poor knowledge of the application of statistical analysis may negatively affect analysis and subsequent interpretation (Black, 1999).
  • Variability of data quantity: Large sample sizes are needed for more accurate analysis. Small-scale quantitative studies may be less reliable because of the low quantity of data (Denscombe, 2010). This also affects the ability to generalize study findings to wider populations.
  • Confirmation bias: The researcher might miss observing phenomena because of focus on theory or hypothesis testing rather than on the theory of hypothesis generation.

Advantages of Quantitative Research

  • Scientific objectivity: Quantitative data can be interpreted with statistical analysis, and since statistics are based on the principles of mathematics, the quantitative approach is viewed as scientifically objective and rational (Carr, 1994; Denscombe, 2010).
  • Useful for testing and validating already constructed theories.
  • Rapid analysis: Sophisticated software removes much of the need for prolonged data analysis, especially with large volumes of data involved (Antonius, 2003).
  • Replication: Quantitative data is based on measured values and can be checked by others because numerical data is less open to ambiguities of interpretation.
  • Hypotheses can also be tested because of statistical analysis (Antonius, 2003).

Antonius, R. (2003). Interpreting quantitative data with SPSS . Sage.

Black, T. R. (1999). Doing quantitative research in the social sciences: An integrated approach to research design, measurement and statistics . Sage.

Braun, V. & Clarke, V. (2006). Using thematic analysis in psychology . Qualitative Research in Psychology , 3, 77–101.

Carr, L. T. (1994). The strengths and weaknesses of quantitative and qualitative research : what method for nursing? Journal of advanced nursing, 20(4) , 716-721.

Denscombe, M. (2010). The Good Research Guide: for small-scale social research. McGraw Hill.

Denzin, N., & Lincoln. Y. (1994). Handbook of Qualitative Research. Thousand Oaks, CA, US: Sage Publications Inc.

Glaser, B. G., Strauss, A. L., & Strutzel, E. (1968). The discovery of grounded theory; strategies for qualitative research. Nursing research, 17(4) , 364.

Minichiello, V. (1990). In-Depth Interviewing: Researching People. Longman Cheshire.

Punch, K. (1998). Introduction to Social Research: Quantitative and Qualitative Approaches. London: Sage

Further Information

  • Mixed methods research
  • Designing qualitative research
  • Methods of data collection and analysis
  • Introduction to quantitative and qualitative research
  • Checklists for improving rigour in qualitative research: a case of the tail wagging the dog?
  • Qualitative research in health care: Analysing qualitative data
  • Qualitative data analysis: the framework approach
  • Using the framework method for the analysis of
  • Qualitative data in multi-disciplinary health research
  • Content Analysis
  • Grounded Theory
  • Thematic Analysis

Print Friendly, PDF & Email

The 7 Most Useful Data Analysis Methods and Techniques

Data analytics is the process of analyzing raw data to draw out meaningful insights. These insights are then used to determine the best course of action.

When is the best time to roll out that marketing campaign? Is the current team structure as effective as it could be? Which customer segments are most likely to purchase your new product?

Ultimately, data analytics is a crucial driver of any successful business strategy. But how do data analysts actually turn raw data into something useful? There are a range of methods and techniques that data analysts use depending on the type of data in question and the kinds of insights they want to uncover.

You can get a hands-on introduction to data analytics in this free short course .

In this post, we’ll explore some of the most useful data analysis techniques. By the end, you’ll have a much clearer idea of how you can transform meaningless data into business intelligence. We’ll cover:

  • What is data analysis and why is it important?
  • What is the difference between qualitative and quantitative data?
  • Regression analysis
  • Monte Carlo simulation
  • Factor analysis
  • Cohort analysis
  • Cluster analysis
  • Time series analysis
  • Sentiment analysis
  • The data analysis process
  • The best tools for data analysis
  •  Key takeaways

The first six methods listed are used for quantitative data , while the last technique applies to qualitative data. We briefly explain the difference between quantitative and qualitative data in section two, but if you want to skip straight to a particular analysis technique, just use the clickable menu.

1. What is data analysis and why is it important?

Data analysis is, put simply, the process of discovering useful information by evaluating data. This is done through a process of inspecting, cleaning, transforming, and modeling data using analytical and statistical tools, which we will explore in detail further along in this article.

Why is data analysis important? Analyzing data effectively helps organizations make business decisions. Nowadays, data is collected by businesses constantly: through surveys, online tracking, online marketing analytics, collected subscription and registration data (think newsletters), social media monitoring, among other methods.

These data will appear as different structures, including—but not limited to—the following:

The concept of big data —data that is so large, fast, or complex, that it is difficult or impossible to process using traditional methods—gained momentum in the early 2000s. Then, Doug Laney, an industry analyst, articulated what is now known as the mainstream definition of big data as the three Vs: volume, velocity, and variety. 

  • Volume: As mentioned earlier, organizations are collecting data constantly. In the not-too-distant past it would have been a real issue to store, but nowadays storage is cheap and takes up little space.
  • Velocity: Received data needs to be handled in a timely manner. With the growth of the Internet of Things, this can mean these data are coming in constantly, and at an unprecedented speed.
  • Variety: The data being collected and stored by organizations comes in many forms, ranging from structured data—that is, more traditional, numerical data—to unstructured data—think emails, videos, audio, and so on. We’ll cover structured and unstructured data a little further on.

This is a form of data that provides information about other data, such as an image. In everyday life you’ll find this by, for example, right-clicking on a file in a folder and selecting “Get Info”, which will show you information such as file size and kind, date of creation, and so on.

Real-time data

This is data that is presented as soon as it is acquired. A good example of this is a stock market ticket, which provides information on the most-active stocks in real time.

Machine data

This is data that is produced wholly by machines, without human instruction. An example of this could be call logs automatically generated by your smartphone.

Quantitative and qualitative data

Quantitative data—otherwise known as structured data— may appear as a “traditional” database—that is, with rows and columns. Qualitative data—otherwise known as unstructured data—are the other types of data that don’t fit into rows and columns, which can include text, images, videos and more. We’ll discuss this further in the next section.

2. What is the difference between quantitative and qualitative data?

How you analyze your data depends on the type of data you’re dealing with— quantitative or qualitative . So what’s the difference?

Quantitative data is anything measurable , comprising specific quantities and numbers. Some examples of quantitative data include sales figures, email click-through rates, number of website visitors, and percentage revenue increase. Quantitative data analysis techniques focus on the statistical, mathematical, or numerical analysis of (usually large) datasets. This includes the manipulation of statistical data using computational techniques and algorithms. Quantitative analysis techniques are often used to explain certain phenomena or to make predictions.

Qualitative data cannot be measured objectively , and is therefore open to more subjective interpretation. Some examples of qualitative data include comments left in response to a survey question, things people have said during interviews, tweets and other social media posts, and the text included in product reviews. With qualitative data analysis, the focus is on making sense of unstructured data (such as written text, or transcripts of spoken conversations). Often, qualitative analysis will organize the data into themes—a process which, fortunately, can be automated.

Data analysts work with both quantitative and qualitative data , so it’s important to be familiar with a variety of analysis methods. Let’s take a look at some of the most useful techniques now.

3. Data analysis techniques

Now we’re familiar with some of the different types of data, let’s focus on the topic at hand: different methods for analyzing data. 

a. Regression analysis

Regression analysis is used to estimate the relationship between a set of variables. When conducting any type of regression analysis , you’re looking to see if there’s a correlation between a dependent variable (that’s the variable or outcome you want to measure or predict) and any number of independent variables (factors which may have an impact on the dependent variable). The aim of regression analysis is to estimate how one or more variables might impact the dependent variable, in order to identify trends and patterns. This is especially useful for making predictions and forecasting future trends.

Let’s imagine you work for an ecommerce company and you want to examine the relationship between: (a) how much money is spent on social media marketing, and (b) sales revenue. In this case, sales revenue is your dependent variable—it’s the factor you’re most interested in predicting and boosting. Social media spend is your independent variable; you want to determine whether or not it has an impact on sales and, ultimately, whether it’s worth increasing, decreasing, or keeping the same. Using regression analysis, you’d be able to see if there’s a relationship between the two variables. A positive correlation would imply that the more you spend on social media marketing, the more sales revenue you make. No correlation at all might suggest that social media marketing has no bearing on your sales. Understanding the relationship between these two variables would help you to make informed decisions about the social media budget going forward. However: It’s important to note that, on their own, regressions can only be used to determine whether or not there is a relationship between a set of variables—they don’t tell you anything about cause and effect. So, while a positive correlation between social media spend and sales revenue may suggest that one impacts the other, it’s impossible to draw definitive conclusions based on this analysis alone.

There are many different types of regression analysis, and the model you use depends on the type of data you have for the dependent variable. For example, your dependent variable might be continuous (i.e. something that can be measured on a continuous scale, such as sales revenue in USD), in which case you’d use a different type of regression analysis than if your dependent variable was categorical in nature (i.e. comprising values that can be categorised into a number of distinct groups based on a certain characteristic, such as customer location by continent). You can learn more about different types of dependent variables and how to choose the right regression analysis in this guide .

Regression analysis in action: Investigating the relationship between clothing brand Benetton’s advertising expenditure and sales

b. Monte Carlo simulation

When making decisions or taking certain actions, there are a range of different possible outcomes. If you take the bus, you might get stuck in traffic. If you walk, you might get caught in the rain or bump into your chatty neighbor, potentially delaying your journey. In everyday life, we tend to briefly weigh up the pros and cons before deciding which action to take; however, when the stakes are high, it’s essential to calculate, as thoroughly and accurately as possible, all the potential risks and rewards.

Monte Carlo simulation, otherwise known as the Monte Carlo method, is a computerized technique used to generate models of possible outcomes and their probability distributions. It essentially considers a range of possible outcomes and then calculates how likely it is that each particular outcome will be realized. The Monte Carlo method is used by data analysts to conduct advanced risk analysis, allowing them to better forecast what might happen in the future and make decisions accordingly.

So how does Monte Carlo simulation work, and what can it tell us? To run a Monte Carlo simulation, you’ll start with a mathematical model of your data—such as a spreadsheet. Within your spreadsheet, you’ll have one or several outputs that you’re interested in; profit, for example, or number of sales. You’ll also have a number of inputs; these are variables that may impact your output variable. If you’re looking at profit, relevant inputs might include the number of sales, total marketing spend, and employee salaries. If you knew the exact, definitive values of all your input variables, you’d quite easily be able to calculate what profit you’d be left with at the end. However, when these values are uncertain, a Monte Carlo simulation enables you to calculate all the possible options and their probabilities. What will your profit be if you make 100,000 sales and hire five new employees on a salary of $50,000 each? What is the likelihood of this outcome? What will your profit be if you only make 12,000 sales and hire five new employees? And so on. It does this by replacing all uncertain values with functions which generate random samples from distributions determined by you, and then running a series of calculations and recalculations to produce models of all the possible outcomes and their probability distributions. The Monte Carlo method is one of the most popular techniques for calculating the effect of unpredictable variables on a specific output variable, making it ideal for risk analysis.

Monte Carlo simulation in action: A case study using Monte Carlo simulation for risk analysis

 c. Factor analysis

Factor analysis is a technique used to reduce a large number of variables to a smaller number of factors. It works on the basis that multiple separate, observable variables correlate with each other because they are all associated with an underlying construct. This is useful not only because it condenses large datasets into smaller, more manageable samples, but also because it helps to uncover hidden patterns. This allows you to explore concepts that cannot be easily measured or observed—such as wealth, happiness, fitness, or, for a more business-relevant example, customer loyalty and satisfaction.

Let’s imagine you want to get to know your customers better, so you send out a rather long survey comprising one hundred questions. Some of the questions relate to how they feel about your company and product; for example, “Would you recommend us to a friend?” and “How would you rate the overall customer experience?” Other questions ask things like “What is your yearly household income?” and “How much are you willing to spend on skincare each month?”

Once your survey has been sent out and completed by lots of customers, you end up with a large dataset that essentially tells you one hundred different things about each customer (assuming each customer gives one hundred responses). Instead of looking at each of these responses (or variables) individually, you can use factor analysis to group them into factors that belong together—in other words, to relate them to a single underlying construct. In this example, factor analysis works by finding survey items that are strongly correlated. This is known as covariance . So, if there’s a strong positive correlation between household income and how much they’re willing to spend on skincare each month (i.e. as one increases, so does the other), these items may be grouped together. Together with other variables (survey responses), you may find that they can be reduced to a single factor such as “consumer purchasing power”. Likewise, if a customer experience rating of 10/10 correlates strongly with “yes” responses regarding how likely they are to recommend your product to a friend, these items may be reduced to a single factor such as “customer satisfaction”.

In the end, you have a smaller number of factors rather than hundreds of individual variables. These factors are then taken forward for further analysis, allowing you to learn more about your customers (or any other area you’re interested in exploring).

Factor analysis in action: Using factor analysis to explore customer behavior patterns in Tehran

d. Cohort analysis

Cohort analysis is a data analytics technique that groups users based on a shared characteristic , such as the date they signed up for a service or the product they purchased. Once users are grouped into cohorts, analysts can track their behavior over time to identify trends and patterns.

So what does this mean and why is it useful? Let’s break down the above definition further. A cohort is a group of people who share a common characteristic (or action) during a given time period. Students who enrolled at university in 2020 may be referred to as the 2020 cohort. Customers who purchased something from your online store via the app in the month of December may also be considered a cohort.

With cohort analysis, you’re dividing your customers or users into groups and looking at how these groups behave over time. So, rather than looking at a single, isolated snapshot of all your customers at a given moment in time (with each customer at a different point in their journey), you’re examining your customers’ behavior in the context of the customer lifecycle. As a result, you can start to identify patterns of behavior at various points in the customer journey—say, from their first ever visit to your website, through to email newsletter sign-up, to their first purchase, and so on. As such, cohort analysis is dynamic, allowing you to uncover valuable insights about the customer lifecycle.

This is useful because it allows companies to tailor their service to specific customer segments (or cohorts). Let’s imagine you run a 50% discount campaign in order to attract potential new customers to your website. Once you’ve attracted a group of new customers (a cohort), you’ll want to track whether they actually buy anything and, if they do, whether or not (and how frequently) they make a repeat purchase. With these insights, you’ll start to gain a much better understanding of when this particular cohort might benefit from another discount offer or retargeting ads on social media, for example. Ultimately, cohort analysis allows companies to optimize their service offerings (and marketing) to provide a more targeted, personalized experience. You can learn more about how to run cohort analysis using Google Analytics .

Cohort analysis in action: How Ticketmaster used cohort analysis to boost revenue

e. Cluster analysis

Cluster analysis is an exploratory technique that seeks to identify structures within a dataset. The goal of cluster analysis is to sort different data points into groups (or clusters) that are internally homogeneous and externally heterogeneous. This means that data points within a cluster are similar to each other, and dissimilar to data points in another cluster. Clustering is used to gain insight into how data is distributed in a given dataset, or as a preprocessing step for other algorithms.

There are many real-world applications of cluster analysis. In marketing, cluster analysis is commonly used to group a large customer base into distinct segments, allowing for a more targeted approach to advertising and communication. Insurance firms might use cluster analysis to investigate why certain locations are associated with a high number of insurance claims. Another common application is in geology, where experts will use cluster analysis to evaluate which cities are at greatest risk of earthquakes (and thus try to mitigate the risk with protective measures).

It’s important to note that, while cluster analysis may reveal structures within your data, it won’t explain why those structures exist. With that in mind, cluster analysis is a useful starting point for understanding your data and informing further analysis. Clustering algorithms are also used in machine learning—you can learn more about clustering in machine learning in our guide .

Cluster analysis in action: Using cluster analysis for customer segmentation—a telecoms case study example

f. Time series analysis

Time series analysis is a statistical technique used to identify trends and cycles over time. Time series data is a sequence of data points which measure the same variable at different points in time (for example, weekly sales figures or monthly email sign-ups). By looking at time-related trends, analysts are able to forecast how the variable of interest may fluctuate in the future.

When conducting time series analysis, the main patterns you’ll be looking out for in your data are:

  • Trends: Stable, linear increases or decreases over an extended time period.
  • Seasonality: Predictable fluctuations in the data due to seasonal factors over a short period of time. For example, you might see a peak in swimwear sales in summer around the same time every year.
  • Cyclic patterns: Unpredictable cycles where the data fluctuates. Cyclical trends are not due to seasonality, but rather, may occur as a result of economic or industry-related conditions.

As you can imagine, the ability to make informed predictions about the future has immense value for business. Time series analysis and forecasting is used across a variety of industries, most commonly for stock market analysis, economic forecasting, and sales forecasting. There are different types of time series models depending on the data you’re using and the outcomes you want to predict. These models are typically classified into three broad types: the autoregressive (AR) models, the integrated (I) models, and the moving average (MA) models. For an in-depth look at time series analysis, refer to our guide .

Time series analysis in action: Developing a time series model to predict jute yarn demand in Bangladesh

g. Sentiment analysis

When you think of data, your mind probably automatically goes to numbers and spreadsheets.

Many companies overlook the value of qualitative data, but in reality, there are untold insights to be gained from what people (especially customers) write and say about you. So how do you go about analyzing textual data?

One highly useful qualitative technique is sentiment analysis , a technique which belongs to the broader category of text analysis —the (usually automated) process of sorting and understanding textual data.

With sentiment analysis, the goal is to interpret and classify the emotions conveyed within textual data. From a business perspective, this allows you to ascertain how your customers feel about various aspects of your brand, product, or service.

There are several different types of sentiment analysis models, each with a slightly different focus. The three main types include:

Fine-grained sentiment analysis

If you want to focus on opinion polarity (i.e. positive, neutral, or negative) in depth, fine-grained sentiment analysis will allow you to do so.

For example, if you wanted to interpret star ratings given by customers, you might use fine-grained sentiment analysis to categorize the various ratings along a scale ranging from very positive to very negative.

Emotion detection

This model often uses complex machine learning algorithms to pick out various emotions from your textual data.

You might use an emotion detection model to identify words associated with happiness, anger, frustration, and excitement, giving you insight into how your customers feel when writing about you or your product on, say, a product review site.

Aspect-based sentiment analysis

This type of analysis allows you to identify what specific aspects the emotions or opinions relate to, such as a certain product feature or a new ad campaign.

If a customer writes that they “find the new Instagram advert so annoying”, your model should detect not only a negative sentiment, but also the object towards which it’s directed.

In a nutshell, sentiment analysis uses various Natural Language Processing (NLP) algorithms and systems which are trained to associate certain inputs (for example, certain words) with certain outputs.

For example, the input “annoying” would be recognized and tagged as “negative”. Sentiment analysis is crucial to understanding how your customers feel about you and your products, for identifying areas for improvement, and even for averting PR disasters in real-time!

Sentiment analysis in action: 5 Real-world sentiment analysis case studies

4. The data analysis process

In order to gain meaningful insights from data, data analysts will perform a rigorous step-by-step process. We go over this in detail in our step by step guide to the data analysis process —but, to briefly summarize, the data analysis process generally consists of the following phases:

Defining the question

The first step for any data analyst will be to define the objective of the analysis, sometimes called a ‘problem statement’. Essentially, you’re asking a question with regards to a business problem you’re trying to solve. Once you’ve defined this, you’ll then need to determine which data sources will help you answer this question.

Collecting the data

Now that you’ve defined your objective, the next step will be to set up a strategy for collecting and aggregating the appropriate data. Will you be using quantitative (numeric) or qualitative (descriptive) data? Do these data fit into first-party, second-party, or third-party data?

Learn more: Quantitative vs. Qualitative Data: What’s the Difference? 

Cleaning the data

Unfortunately, your collected data isn’t automatically ready for analysis—you’ll have to clean it first. As a data analyst, this phase of the process will take up the most time. During the data cleaning process, you will likely be:

  • Removing major errors, duplicates, and outliers
  • Removing unwanted data points
  • Structuring the data—that is, fixing typos, layout issues, etc.
  • Filling in major gaps in data

Analyzing the data

Now that we’ve finished cleaning the data, it’s time to analyze it! Many analysis methods have already been described in this article, and it’s up to you to decide which one will best suit the assigned objective. It may fall under one of the following categories:

  • Descriptive analysis , which identifies what has already happened
  • Diagnostic analysis , which focuses on understanding why something has happened
  • Predictive analysis , which identifies future trends based on historical data
  • Prescriptive analysis , which allows you to make recommendations for the future

Visualizing and sharing your findings

We’re almost at the end of the road! Analyses have been made, insights have been gleaned—all that remains to be done is to share this information with others. This is usually done with a data visualization tool, such as Google Charts, or Tableau.

Learn more: 13 of the Most Common Types of Data Visualization

To sum up the process, Will’s explained it all excellently in the following video:

5. The best tools for data analysis

As you can imagine, every phase of the data analysis process requires the data analyst to have a variety of tools under their belt that assist in gaining valuable insights from data. We cover these tools in greater detail in this article , but, in summary, here’s our best-of-the-best list, with links to each product:

The top 9 tools for data analysts

  • Microsoft Excel
  • Jupyter Notebook
  • Apache Spark
  • Microsoft Power BI

6. Key takeaways and further reading

As you can see, there are many different data analysis techniques at your disposal. In order to turn your raw data into actionable insights, it’s important to consider what kind of data you have (is it qualitative or quantitative?) as well as the kinds of insights that will be useful within the given context. In this post, we’ve introduced seven of the most useful data analysis techniques—but there are many more out there to be discovered!

So what now? If you haven’t already, we recommend reading the case studies for each analysis technique discussed in this post (you’ll find a link at the end of each section). For a more hands-on introduction to the kinds of methods and techniques that data analysts use, try out this free introductory data analytics short course. In the meantime, you might also want to read the following:

  • The Best Online Data Analytics Courses for 2024
  • What Is Time Series Data and How Is It Analyzed?
  • What is Spatial Analysis?

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

The PMC website is updating on October 15, 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • Can J Hosp Pharm
  • v.68(3); May-Jun 2015

Logo of cjhp

Qualitative Research: Data Collection, Analysis, and Management

Introduction.

In an earlier paper, 1 we presented an introduction to using qualitative research methods in pharmacy practice. In this article, we review some principles of the collection, analysis, and management of qualitative data to help pharmacists interested in doing research in their practice to continue their learning in this area. Qualitative research can help researchers to access the thoughts and feelings of research participants, which can enable development of an understanding of the meaning that people ascribe to their experiences. Whereas quantitative research methods can be used to determine how many people undertake particular behaviours, qualitative methods can help researchers to understand how and why such behaviours take place. Within the context of pharmacy practice research, qualitative approaches have been used to examine a diverse array of topics, including the perceptions of key stakeholders regarding prescribing by pharmacists and the postgraduation employment experiences of young pharmacists (see “Further Reading” section at the end of this article).

In the previous paper, 1 we outlined 3 commonly used methodologies: ethnography 2 , grounded theory 3 , and phenomenology. 4 Briefly, ethnography involves researchers using direct observation to study participants in their “real life” environment, sometimes over extended periods. Grounded theory and its later modified versions (e.g., Strauss and Corbin 5 ) use face-to-face interviews and interactions such as focus groups to explore a particular research phenomenon and may help in clarifying a less-well-understood problem, situation, or context. Phenomenology shares some features with grounded theory (such as an exploration of participants’ behaviour) and uses similar techniques to collect data, but it focuses on understanding how human beings experience their world. It gives researchers the opportunity to put themselves in another person’s shoes and to understand the subjective experiences of participants. 6 Some researchers use qualitative methodologies but adopt a different standpoint, and an example of this appears in the work of Thurston and others, 7 discussed later in this paper.

Qualitative work requires reflection on the part of researchers, both before and during the research process, as a way of providing context and understanding for readers. When being reflexive, researchers should not try to simply ignore or avoid their own biases (as this would likely be impossible); instead, reflexivity requires researchers to reflect upon and clearly articulate their position and subjectivities (world view, perspectives, biases), so that readers can better understand the filters through which questions were asked, data were gathered and analyzed, and findings were reported. From this perspective, bias and subjectivity are not inherently negative but they are unavoidable; as a result, it is best that they be articulated up-front in a manner that is clear and coherent for readers.

THE PARTICIPANT’S VIEWPOINT

What qualitative study seeks to convey is why people have thoughts and feelings that might affect the way they behave. Such study may occur in any number of contexts, but here, we focus on pharmacy practice and the way people behave with regard to medicines use (e.g., to understand patients’ reasons for nonadherence with medication therapy or to explore physicians’ resistance to pharmacists’ clinical suggestions). As we suggested in our earlier article, 1 an important point about qualitative research is that there is no attempt to generalize the findings to a wider population. Qualitative research is used to gain insights into people’s feelings and thoughts, which may provide the basis for a future stand-alone qualitative study or may help researchers to map out survey instruments for use in a quantitative study. It is also possible to use different types of research in the same study, an approach known as “mixed methods” research, and further reading on this topic may be found at the end of this paper.

The role of the researcher in qualitative research is to attempt to access the thoughts and feelings of study participants. This is not an easy task, as it involves asking people to talk about things that may be very personal to them. Sometimes the experiences being explored are fresh in the participant’s mind, whereas on other occasions reliving past experiences may be difficult. However the data are being collected, a primary responsibility of the researcher is to safeguard participants and their data. Mechanisms for such safeguarding must be clearly articulated to participants and must be approved by a relevant research ethics review board before the research begins. Researchers and practitioners new to qualitative research should seek advice from an experienced qualitative researcher before embarking on their project.

DATA COLLECTION

Whatever philosophical standpoint the researcher is taking and whatever the data collection method (e.g., focus group, one-to-one interviews), the process will involve the generation of large amounts of data. In addition to the variety of study methodologies available, there are also different ways of making a record of what is said and done during an interview or focus group, such as taking handwritten notes or video-recording. If the researcher is audio- or video-recording data collection, then the recordings must be transcribed verbatim before data analysis can begin. As a rough guide, it can take an experienced researcher/transcriber 8 hours to transcribe one 45-minute audio-recorded interview, a process than will generate 20–30 pages of written dialogue.

Many researchers will also maintain a folder of “field notes” to complement audio-taped interviews. Field notes allow the researcher to maintain and comment upon impressions, environmental contexts, behaviours, and nonverbal cues that may not be adequately captured through the audio-recording; they are typically handwritten in a small notebook at the same time the interview takes place. Field notes can provide important context to the interpretation of audio-taped data and can help remind the researcher of situational factors that may be important during data analysis. Such notes need not be formal, but they should be maintained and secured in a similar manner to audio tapes and transcripts, as they contain sensitive information and are relevant to the research. For more information about collecting qualitative data, please see the “Further Reading” section at the end of this paper.

DATA ANALYSIS AND MANAGEMENT

If, as suggested earlier, doing qualitative research is about putting oneself in another person’s shoes and seeing the world from that person’s perspective, the most important part of data analysis and management is to be true to the participants. It is their voices that the researcher is trying to hear, so that they can be interpreted and reported on for others to read and learn from. To illustrate this point, consider the anonymized transcript excerpt presented in Appendix 1 , which is taken from a research interview conducted by one of the authors (J.S.). We refer to this excerpt throughout the remainder of this paper to illustrate how data can be managed, analyzed, and presented.

Interpretation of Data

Interpretation of the data will depend on the theoretical standpoint taken by researchers. For example, the title of the research report by Thurston and others, 7 “Discordant indigenous and provider frames explain challenges in improving access to arthritis care: a qualitative study using constructivist grounded theory,” indicates at least 2 theoretical standpoints. The first is the culture of the indigenous population of Canada and the place of this population in society, and the second is the social constructivist theory used in the constructivist grounded theory method. With regard to the first standpoint, it can be surmised that, to have decided to conduct the research, the researchers must have felt that there was anecdotal evidence of differences in access to arthritis care for patients from indigenous and non-indigenous backgrounds. With regard to the second standpoint, it can be surmised that the researchers used social constructivist theory because it assumes that behaviour is socially constructed; in other words, people do things because of the expectations of those in their personal world or in the wider society in which they live. (Please see the “Further Reading” section for resources providing more information about social constructivist theory and reflexivity.) Thus, these 2 standpoints (and there may have been others relevant to the research of Thurston and others 7 ) will have affected the way in which these researchers interpreted the experiences of the indigenous population participants and those providing their care. Another standpoint is feminist standpoint theory which, among other things, focuses on marginalized groups in society. Such theories are helpful to researchers, as they enable us to think about things from a different perspective. Being aware of the standpoints you are taking in your own research is one of the foundations of qualitative work. Without such awareness, it is easy to slip into interpreting other people’s narratives from your own viewpoint, rather than that of the participants.

To analyze the example in Appendix 1 , we will adopt a phenomenological approach because we want to understand how the participant experienced the illness and we want to try to see the experience from that person’s perspective. It is important for the researcher to reflect upon and articulate his or her starting point for such analysis; for example, in the example, the coder could reflect upon her own experience as a female of a majority ethnocultural group who has lived within middle class and upper middle class settings. This personal history therefore forms the filter through which the data will be examined. This filter does not diminish the quality or significance of the analysis, since every researcher has his or her own filters; however, by explicitly stating and acknowledging what these filters are, the researcher makes it easer for readers to contextualize the work.

Transcribing and Checking

For the purposes of this paper it is assumed that interviews or focus groups have been audio-recorded. As mentioned above, transcribing is an arduous process, even for the most experienced transcribers, but it must be done to convert the spoken word to the written word to facilitate analysis. For anyone new to conducting qualitative research, it is beneficial to transcribe at least one interview and one focus group. It is only by doing this that researchers realize how difficult the task is, and this realization affects their expectations when asking others to transcribe. If the research project has sufficient funding, then a professional transcriber can be hired to do the work. If this is the case, then it is a good idea to sit down with the transcriber, if possible, and talk through the research and what the participants were talking about. This background knowledge for the transcriber is especially important in research in which people are using jargon or medical terms (as in pharmacy practice). Involving your transcriber in this way makes the work both easier and more rewarding, as he or she will feel part of the team. Transcription editing software is also available, but it is expensive. For example, ELAN (more formally known as EUDICO Linguistic Annotator, developed at the Technical University of Berlin) 8 is a tool that can help keep data organized by linking media and data files (particularly valuable if, for example, video-taping of interviews is complemented by transcriptions). It can also be helpful in searching complex data sets. Products such as ELAN do not actually automatically transcribe interviews or complete analyses, and they do require some time and effort to learn; nonetheless, for some research applications, it may be a valuable to consider such software tools.

All audio recordings should be transcribed verbatim, regardless of how intelligible the transcript may be when it is read back. Lines of text should be numbered. Once the transcription is complete, the researcher should read it while listening to the recording and do the following: correct any spelling or other errors; anonymize the transcript so that the participant cannot be identified from anything that is said (e.g., names, places, significant events); insert notations for pauses, laughter, looks of discomfort; insert any punctuation, such as commas and full stops (periods) (see Appendix 1 for examples of inserted punctuation), and include any other contextual information that might have affected the participant (e.g., temperature or comfort of the room).

Dealing with the transcription of a focus group is slightly more difficult, as multiple voices are involved. One way of transcribing such data is to “tag” each voice (e.g., Voice A, Voice B). In addition, the focus group will usually have 2 facilitators, whose respective roles will help in making sense of the data. While one facilitator guides participants through the topic, the other can make notes about context and group dynamics. More information about group dynamics and focus groups can be found in resources listed in the “Further Reading” section.

Reading between the Lines

During the process outlined above, the researcher can begin to get a feel for the participant’s experience of the phenomenon in question and can start to think about things that could be pursued in subsequent interviews or focus groups (if appropriate). In this way, one participant’s narrative informs the next, and the researcher can continue to interview until nothing new is being heard or, as it says in the text books, “saturation is reached”. While continuing with the processes of coding and theming (described in the next 2 sections), it is important to consider not just what the person is saying but also what they are not saying. For example, is a lengthy pause an indication that the participant is finding the subject difficult, or is the person simply deciding what to say? The aim of the whole process from data collection to presentation is to tell the participants’ stories using exemplars from their own narratives, thus grounding the research findings in the participants’ lived experiences.

Smith 9 suggested a qualitative research method known as interpretative phenomenological analysis, which has 2 basic tenets: first, that it is rooted in phenomenology, attempting to understand the meaning that individuals ascribe to their lived experiences, and second, that the researcher must attempt to interpret this meaning in the context of the research. That the researcher has some knowledge and expertise in the subject of the research means that he or she can have considerable scope in interpreting the participant’s experiences. Larkin and others 10 discussed the importance of not just providing a description of what participants say. Rather, interpretative phenomenological analysis is about getting underneath what a person is saying to try to truly understand the world from his or her perspective.

Once all of the research interviews have been transcribed and checked, it is time to begin coding. Field notes compiled during an interview can be a useful complementary source of information to facilitate this process, as the gap in time between an interview, transcribing, and coding can result in memory bias regarding nonverbal or environmental context issues that may affect interpretation of data.

Coding refers to the identification of topics, issues, similarities, and differences that are revealed through the participants’ narratives and interpreted by the researcher. This process enables the researcher to begin to understand the world from each participant’s perspective. Coding can be done by hand on a hard copy of the transcript, by making notes in the margin or by highlighting and naming sections of text. More commonly, researchers use qualitative research software (e.g., NVivo, QSR International Pty Ltd; www.qsrinternational.com/products_nvivo.aspx ) to help manage their transcriptions. It is advised that researchers undertake a formal course in the use of such software or seek supervision from a researcher experienced in these tools.

Returning to Appendix 1 and reading from lines 8–11, a code for this section might be “diagnosis of mental health condition”, but this would just be a description of what the participant is talking about at that point. If we read a little more deeply, we can ask ourselves how the participant might have come to feel that the doctor assumed he or she was aware of the diagnosis or indeed that they had only just been told the diagnosis. There are a number of pauses in the narrative that might suggest the participant is finding it difficult to recall that experience. Later in the text, the participant says “nobody asked me any questions about my life” (line 19). This could be coded simply as “health care professionals’ consultation skills”, but that would not reflect how the participant must have felt never to be asked anything about his or her personal life, about the participant as a human being. At the end of this excerpt, the participant just trails off, recalling that no-one showed any interest, which makes for very moving reading. For practitioners in pharmacy, it might also be pertinent to explore the participant’s experience of akathisia and why this was left untreated for 20 years.

One of the questions that arises about qualitative research relates to the reliability of the interpretation and representation of the participants’ narratives. There are no statistical tests that can be used to check reliability and validity as there are in quantitative research. However, work by Lincoln and Guba 11 suggests that there are other ways to “establish confidence in the ‘truth’ of the findings” (p. 218). They call this confidence “trustworthiness” and suggest that there are 4 criteria of trustworthiness: credibility (confidence in the “truth” of the findings), transferability (showing that the findings have applicability in other contexts), dependability (showing that the findings are consistent and could be repeated), and confirmability (the extent to which the findings of a study are shaped by the respondents and not researcher bias, motivation, or interest).

One way of establishing the “credibility” of the coding is to ask another researcher to code the same transcript and then to discuss any similarities and differences in the 2 resulting sets of codes. This simple act can result in revisions to the codes and can help to clarify and confirm the research findings.

Theming refers to the drawing together of codes from one or more transcripts to present the findings of qualitative research in a coherent and meaningful way. For example, there may be examples across participants’ narratives of the way in which they were treated in hospital, such as “not being listened to” or “lack of interest in personal experiences” (see Appendix 1 ). These may be drawn together as a theme running through the narratives that could be named “the patient’s experience of hospital care”. The importance of going through this process is that at its conclusion, it will be possible to present the data from the interviews using quotations from the individual transcripts to illustrate the source of the researchers’ interpretations. Thus, when the findings are organized for presentation, each theme can become the heading of a section in the report or presentation. Underneath each theme will be the codes, examples from the transcripts, and the researcher’s own interpretation of what the themes mean. Implications for real life (e.g., the treatment of people with chronic mental health problems) should also be given.

DATA SYNTHESIS

In this final section of this paper, we describe some ways of drawing together or “synthesizing” research findings to represent, as faithfully as possible, the meaning that participants ascribe to their life experiences. This synthesis is the aim of the final stage of qualitative research. For most readers, the synthesis of data presented by the researcher is of crucial significance—this is usually where “the story” of the participants can be distilled, summarized, and told in a manner that is both respectful to those participants and meaningful to readers. There are a number of ways in which researchers can synthesize and present their findings, but any conclusions drawn by the researchers must be supported by direct quotations from the participants. In this way, it is made clear to the reader that the themes under discussion have emerged from the participants’ interviews and not the mind of the researcher. The work of Latif and others 12 gives an example of how qualitative research findings might be presented.

Planning and Writing the Report

As has been suggested above, if researchers code and theme their material appropriately, they will naturally find the headings for sections of their report. Qualitative researchers tend to report “findings” rather than “results”, as the latter term typically implies that the data have come from a quantitative source. The final presentation of the research will usually be in the form of a report or a paper and so should follow accepted academic guidelines. In particular, the article should begin with an introduction, including a literature review and rationale for the research. There should be a section on the chosen methodology and a brief discussion about why qualitative methodology was most appropriate for the study question and why one particular methodology (e.g., interpretative phenomenological analysis rather than grounded theory) was selected to guide the research. The method itself should then be described, including ethics approval, choice of participants, mode of recruitment, and method of data collection (e.g., semistructured interviews or focus groups), followed by the research findings, which will be the main body of the report or paper. The findings should be written as if a story is being told; as such, it is not necessary to have a lengthy discussion section at the end. This is because much of the discussion will take place around the participants’ quotes, such that all that is needed to close the report or paper is a summary, limitations of the research, and the implications that the research has for practice. As stated earlier, it is not the intention of qualitative research to allow the findings to be generalized, and therefore this is not, in itself, a limitation.

Planning out the way that findings are to be presented is helpful. It is useful to insert the headings of the sections (the themes) and then make a note of the codes that exemplify the thoughts and feelings of your participants. It is generally advisable to put in the quotations that you want to use for each theme, using each quotation only once. After all this is done, the telling of the story can begin as you give your voice to the experiences of the participants, writing around their quotations. Do not be afraid to draw assumptions from the participants’ narratives, as this is necessary to give an in-depth account of the phenomena in question. Discuss these assumptions, drawing on your participants’ words to support you as you move from one code to another and from one theme to the next. Finally, as appropriate, it is possible to include examples from literature or policy documents that add support for your findings. As an exercise, you may wish to code and theme the sample excerpt in Appendix 1 and tell the participant’s story in your own way. Further reading about “doing” qualitative research can be found at the end of this paper.

CONCLUSIONS

Qualitative research can help researchers to access the thoughts and feelings of research participants, which can enable development of an understanding of the meaning that people ascribe to their experiences. It can be used in pharmacy practice research to explore how patients feel about their health and their treatment. Qualitative research has been used by pharmacists to explore a variety of questions and problems (see the “Further Reading” section for examples). An understanding of these issues can help pharmacists and other health care professionals to tailor health care to match the individual needs of patients and to develop a concordant relationship. Doing qualitative research is not easy and may require a complete rethink of how research is conducted, particularly for researchers who are more familiar with quantitative approaches. There are many ways of conducting qualitative research, and this paper has covered some of the practical issues regarding data collection, analysis, and management. Further reading around the subject will be essential to truly understand this method of accessing peoples’ thoughts and feelings to enable researchers to tell participants’ stories.

Appendix 1. Excerpt from a sample transcript

The participant (age late 50s) had suffered from a chronic mental health illness for 30 years. The participant had become a “revolving door patient,” someone who is frequently in and out of hospital. As the participant talked about past experiences, the researcher asked:

  • What was treatment like 30 years ago?
  • Umm—well it was pretty much they could do what they wanted with you because I was put into the er, the er kind of system er, I was just on
  • endless section threes.
  • Really…
  • But what I didn’t realize until later was that if you haven’t actually posed a threat to someone or yourself they can’t really do that but I didn’t know
  • that. So wh-when I first went into hospital they put me on the forensic ward ’cause they said, “We don’t think you’ll stay here we think you’ll just
  • run-run away.” So they put me then onto the acute admissions ward and – er – I can remember one of the first things I recall when I got onto that
  • ward was sitting down with a er a Dr XXX. He had a book this thick [gestures] and on each page it was like three questions and he went through
  • all these questions and I answered all these questions. So we’re there for I don’t maybe two hours doing all that and he asked me he said “well
  • when did somebody tell you then that you have schizophrenia” I said “well nobody’s told me that” so he seemed very surprised but nobody had
  • actually [pause] whe-when I first went up there under police escort erm the senior kind of consultants people I’d been to where I was staying and
  • ermm so er [pause] I . . . the, I can remember the very first night that I was there and given this injection in this muscle here [gestures] and just
  • having dreadful side effects the next day I woke up [pause]
  • . . . and I suffered that akathesia I swear to you, every minute of every day for about 20 years.
  • Oh how awful.
  • And that side of it just makes life impossible so the care on the wards [pause] umm I don’t know it’s kind of, it’s kind of hard to put into words
  • [pause]. Because I’m not saying they were sort of like not friendly or interested but then nobody ever seemed to want to talk about your life [pause]
  • nobody asked me any questions about my life. The only questions that came into was they asked me if I’d be a volunteer for these student exams
  • and things and I said “yeah” so all the questions were like “oh what jobs have you done,” er about your relationships and things and er but
  • nobody actually sat down and had a talk and showed some interest in you as a person you were just there basically [pause] um labelled and you
  • know there was there was [pause] but umm [pause] yeah . . .

This article is the 10th in the CJHP Research Primer Series, an initiative of the CJHP Editorial Board and the CSHP Research Committee. The planned 2-year series is intended to appeal to relatively inexperienced researchers, with the goal of building research capacity among practising pharmacists. The articles, presenting simple but rigorous guidance to encourage and support novice researchers, are being solicited from authors with appropriate expertise.

Previous articles in this series:

Bond CM. The research jigsaw: how to get started. Can J Hosp Pharm . 2014;67(1):28–30.

Tully MP. Research: articulating questions, generating hypotheses, and choosing study designs. Can J Hosp Pharm . 2014;67(1):31–4.

Loewen P. Ethical issues in pharmacy practice research: an introductory guide. Can J Hosp Pharm. 2014;67(2):133–7.

Tsuyuki RT. Designing pharmacy practice research trials. Can J Hosp Pharm . 2014;67(3):226–9.

Bresee LC. An introduction to developing surveys for pharmacy practice research. Can J Hosp Pharm . 2014;67(4):286–91.

Gamble JM. An introduction to the fundamentals of cohort and case–control studies. Can J Hosp Pharm . 2014;67(5):366–72.

Austin Z, Sutton J. Qualitative research: getting started. C an J Hosp Pharm . 2014;67(6):436–40.

Houle S. An introduction to the fundamentals of randomized controlled trials in pharmacy research. Can J Hosp Pharm . 2014; 68(1):28–32.

Charrois TL. Systematic reviews: What do you need to know to get started? Can J Hosp Pharm . 2014;68(2):144–8.

Competing interests: None declared.

Further Reading

Examples of qualitative research in pharmacy practice.

  • Farrell B, Pottie K, Woodend K, Yao V, Dolovich L, Kennie N, et al. Shifts in expectations: evaluating physicians’ perceptions as pharmacists integrated into family practice. J Interprof Care. 2010; 24 (1):80–9. [ PubMed ] [ Google Scholar ]
  • Gregory P, Austin Z. Postgraduation employment experiences of new pharmacists in Ontario in 2012–2013. Can Pharm J. 2014; 147 (5):290–9. [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • Marks PZ, Jennnings B, Farrell B, Kennie-Kaulbach N, Jorgenson D, Pearson-Sharpe J, et al. “I gained a skill and a change in attitude”: a case study describing how an online continuing professional education course for pharmacists supported achievement of its transfer to practice outcomes. Can J Univ Contin Educ. 2014; 40 (2):1–18. [ Google Scholar ]
  • Nair KM, Dolovich L, Brazil K, Raina P. It’s all about relationships: a qualitative study of health researchers’ perspectives on interdisciplinary research. BMC Health Serv Res. 2008; 8 :110. [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • Pojskic N, MacKeigan L, Boon H, Austin Z. Initial perceptions of key stakeholders in Ontario regarding independent prescriptive authority for pharmacists. Res Soc Adm Pharm. 2014; 10 (2):341–54. [ PubMed ] [ Google Scholar ]

Qualitative Research in General

  • Breakwell GM, Hammond S, Fife-Schaw C. Research methods in psychology. Thousand Oaks (CA): Sage Publications; 1995. [ Google Scholar ]
  • Given LM. 100 questions (and answers) about qualitative research. Thousand Oaks (CA): Sage Publications; 2015. [ Google Scholar ]
  • Miles B, Huberman AM. Qualitative data analysis. Thousand Oaks (CA): Sage Publications; 2009. [ Google Scholar ]
  • Patton M. Qualitative research and evaluation methods. Thousand Oaks (CA): Sage Publications; 2002. [ Google Scholar ]
  • Willig C. Introducing qualitative research in psychology. Buckingham (UK): Open University Press; 2001. [ Google Scholar ]

Group Dynamics in Focus Groups

  • Farnsworth J, Boon B. Analysing group dynamics within the focus group. Qual Res. 2010; 10 (5):605–24. [ Google Scholar ]

Social Constructivism

  • Social constructivism. Berkeley (CA): University of California, Berkeley, Berkeley Graduate Division, Graduate Student Instruction Teaching & Resource Center; [cited 2015 June 4]. Available from: http://gsi.berkeley.edu/gsi-guide-contents/learning-theory-research/social-constructivism/ [ Google Scholar ]

Mixed Methods

  • Creswell J. Research design: qualitative, quantitative, and mixed methods approaches. Thousand Oaks (CA): Sage Publications; 2009. [ Google Scholar ]

Collecting Qualitative Data

  • Arksey H, Knight P. Interviewing for social scientists: an introductory resource with examples. Thousand Oaks (CA): Sage Publications; 1999. [ Google Scholar ]
  • Guest G, Namey EE, Mitchel ML. Collecting qualitative data: a field manual for applied research. Thousand Oaks (CA): Sage Publications; 2013. [ Google Scholar ]

Constructivist Grounded Theory

  • Charmaz K. Grounded theory: objectivist and constructivist methods. In: Denzin N, Lincoln Y, editors. Handbook of qualitative research. 2nd ed. Thousand Oaks (CA): Sage Publications; 2000. pp. 509–35. [ Google Scholar ]

Research-Methodology

Data Analysis

Methodology chapter of your dissertation should include discussions about the methods of data analysis. You have to explain in a brief manner how you are going to analyze the primary data you will collect employing the methods explained in this chapter.

There are differences between qualitative data analysis and quantitative data analysis . In qualitative researches using interviews, focus groups, experiments etc. data analysis is going to involve identifying common patterns within the responses and critically analyzing them in order to achieve research aims and objectives.

Data analysis for quantitative studies, on the other hand, involves critical analysis and interpretation of figures and numbers, and attempts to find rationale behind the emergence of main findings. Comparisons of primary research findings to the findings of the literature review are critically important for both types of studies – qualitative and quantitative.

Data analysis methods in the absence of primary data collection can involve discussing common patterns, as well as, controversies within secondary data directly related to the research area.

Data analysis

John Dudovskiy

  • Request Info

Introduction to Research Methods and Analysis

HLTH4205 - Introduction to Research Methods and Analysis

Undergraduate | 5 Quarter Credits |

  • Explore Our Offerings  / Nursing and Healthcare
  • Introduction to Research Methods and Analysis

Please Select a Start Date

  • Undergraduate

Course Description

In this course, students examine the basic components required for the conduct of health-related research and provides students with the analytic tools needed to understand and assess research methods described in the scientific literature. Basic research methods are described, including surveys, observational studies, experimental and quasi-experimental design, use of primary and secondary data, and statistical techniques for analyzing and interpreting data.

Related Programs

  • BS in Health Studies
  • 5 Quarter Credits

Recommended Courses

Environmental Health

PUBH3000 - Environmental Health

Undergraduate | 5 Quarter Credits | 6 Weeks

November 25, 2024

Concepts of Health Promotion

HLTH1000 - Concepts of Health Promotion

October 7, 2024

Please sign in or create an account to manage your favorites.

Walden Logo

Thank You! Walden is ready to support your goals

Thank you for requesting information about Walden University School of Lifelong Learning offerings. Our courses provide actionable learning that you can apply immediately. We are ready to help make you more marketable to employers and take your career to the next level.

Your dedicated Enrollment Specialist will be reaching out to you via email or through our 866-492-5336 phone number to provide more information and answer any questions you may have about our offerings.

Request For Information

Remove product.

Numbers, Facts and Trends Shaping Your World

Read our research on:

Full Topic List

Regions & Countries

  • Publications
  • Our Methods
  • Short Reads
  • Tools & Resources

Read Our Research On:

Are you in the American middle class? Find out with our income calculator

About half of U.S. adults (52%) lived in middle-income households in 2022, according to a Pew Research Center analysis of the most recent available government data. Roughly three-in-ten (28%) were in lower-income households and 19% were in upper-income households.

Our calculator below, updated with 2022 data, lets you find out which group you are in, and compares you with:

  • Other adults in your metropolitan area
  • U.S. adults overall
  • U.S. adults similar to you in education, age, race or ethnicity, and marital status

Find more research about the U.S. middle class on our topic page .

Our latest analysis shows that the estimated share of adults who live in middle-income households varies widely across the 254 metropolitan areas we examined, from 42% in San Jose-Sunnyvale-Santa Clara, California, to 66% in Olympia-Lacey-Tumwater, Washington. The share of adults who live in lower-income households ranges from 16% in Bismarck, North Dakota, to 46% in Laredo, Texas. The share living in upper-income households is smallest in Muskegon-Norton Shores, Michigan (8%), and greatest in San Jose-Sunnyvale-Santa Clara, California (41%).

How the income calculator works

The calculator takes your household income and adjusts it for the size of your household. The income is revised upward for households that are below average in size and downward for those of above-average size. This way, each household’s income is made equivalent to the income of a three-person household. (Three is the whole number nearest to the  average size of a U.S. household , which was 2.5 people in 2023.)

Pew Research Center does not store or share any of the information you enter.

We use your size-adjusted household income and the cost of living in your area to determine your income tier. Middle-income households – those with an income that is two-thirds to double the U.S. median household income – had incomes ranging from about $56,600 to $169,800 in 2022. Lower-income households had incomes less than $56,600, and upper-income households had incomes greater than $169,800. (All figures are computed for three-person households, adjusted for the cost of living in a metropolitan area, and expressed in 2022 dollars.)

The following example illustrates how cost-of-living adjustment for a given area was calculated: Jackson, Tennessee, is a relatively inexpensive area, with a  price level in 2022 that was 13.0% less than the national average. The San Francisco-Oakland-Berkeley metropolitan area in California is one of the most expensive, with a price level that was 17.9% higher than the national average. Thus, to step over the national middle-class threshold of $56,600, a household in Jackson needs an income of only about $49,200, or 13.0% less than the national threshold. But a household in the San Francisco area needs an income of about $66,700, or 17.9% more than the U.S. threshold, to be considered middle class.

The income calculator encompasses 254 of 387 metropolitan areas in the United States, as defined by the Office of Management and Budget  . If you live outside of one of these 254 areas, the calculator reports the estimates for your state.

The second part of our calculator asks about your education, age, race or ethnicity, and marital status. This allows you to see how other adults who are similar to you demographically are distributed across lower-, middle- and upper-income tiers in the U.S. overall. It does not recompute your economic tier.

Note: This post and interactive calculator were originally published Dec. 9, 2015, and have been updated to reflect the Center’s new analysis.   Former Senior Researcher Rakesh Kochhar and former Research Analyst Jesse Bennett also contributed to this analysis.

The Center recently published an analysis of the distribution of the  American population across income tiers . In that analysis, the estimates of the overall shares in each income tier are slightly different, because it relies on a separate government data source and includes children as well as adults.

Pew Research Center designed this calculator as a way for users to find out, based on our analysis, where they appear in the distribution of U.S. adults by income tier, as well as how they compare with others who match their demographic profile.

The data underlying the calculator come from the 2022 American Community Survey (ACS). The ACS contains approximately 3 million records, or about 1% of the U.S. population.

In our analysis, “middle-income” Americans are adults whose annual household income is two-thirds to double the national median, after incomes have been adjusted for household size. Lower-income households have incomes less than two-thirds of the median, and upper-income households have incomes more than double the median. American adults refers to those ages 18 and older who reside in a household (as opposed to group quarters).

In 2022, the  national  middle-income range was about $56,600 to $169,800 annually for a household of three. Lower-income households had incomes less than $56,600, and upper-income households had incomes greater than $169,800. (Incomes are calculated in 2022 dollars.) The median adjusted household income used to derive this middle-income range is based on household heads, regardless of their age.

These income ranges vary with the cost of living in metropolitan areas and with household size. A household in a metropolitan area with a higher-than-average cost of living, or one with more than three people, needs more than $56,600 to be included in the middle-income tier. Households in less expensive areas or with fewer than three people need less than $56,600 to be considered middle income. Additional details on the methodology are available in our  earlier analyses .

  • Income & Wages
  • Middle Class

Download Richard Fry's photo

Richard Fry is a senior researcher focusing on economics and education at Pew Research Center .

Income inequality is greater among Chinese Americans than any other Asian origin group in the U.S.

Is college worth it, 7 facts about americans and taxes, methodology: 2023 focus groups of asian americans, 1 in 10: redefining the asian american dream (short film), most popular.

901 E St. NW, Suite 300 Washington, DC 20004 USA (+1) 202-419-4300 | Main (+1) 202-857-8562 | Fax (+1) 202-419-4372 |  Media Inquiries

Research Topics

  • Email Newsletters

ABOUT PEW RESEARCH CENTER  Pew Research Center is a nonpartisan, nonadvocacy fact tank that informs the public about the issues, attitudes and trends shaping the world. It does not take policy positions. The Center conducts public opinion polling, demographic research, computational social science research and other data-driven research. Pew Research Center is a subsidiary of The Pew Charitable Trusts , its primary funder.

© 2024 Pew Research Center

Global burden of 288 causes of death and life expectancy decomposition in 204 countries and territories and 811 subnational locations, 1990–2021

Published April 3, 2024, in The Lancet (opens in a new window)

In 2021, COVID-19 replaced stroke as the second-leading age-standardised cause of death, with 94 deaths per 100,000 population.

Regular, detailed reporting on population health by underlying cause of death is fundamental for public health decision making. Cause-specific estimates of mortality and the subsequent effects on life expectancy worldwide are valuable metrics to gauge progress in reducing mortality rates. 

These estimates are particularly important following large-scale mortality spikes, such as the COVID-19 pandemic. When systematically analysed, mortality rates and life expectancy allow comparisons of the consequences of causes of death globally and over time, providing a nuanced understanding of the effect of these causes on global populations.

Read cause-specific methods appendices

The Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2021 cause-of-death analysis estimated mortality and years of life lost (YLLs) from 288 causes of death by age-sex-location-year in 204 countries and territories and 811 subnational locations for each year from 1990 until 2021. The analysis used 56 604 data sources, including data from vital registration and verbal autopsy as well as surveys, censuses, surveillance systems, and cancer registries, among others. 

As with previous GBD rounds, cause-specific death rates for most causes were estimated using the Cause of Death Ensemble model—a modelling tool developed for GBD to assess the out-of-sample predictive validity of different statistical models and covariate permutations and combine those results to produce cause-specific mortality estimates—with alternative strategies adapted to model causes with insufficient data, substantial changes in reporting over the study period, or unusual epidemiology. 

YLLs were computed as the product of the number of deaths for each cause-age-sex-location-year and the standard life expectancy at each age. As part of the modelling process, uncertainty intervals (UIs) were generated using the 2·5th and 97·5th percentiles from a 1000-draw distribution for each metric. We decomposed life expectancy by cause of death, location, and year to show cause-specific effects on life expectancy from 1990 to 2021. We also used the coefficient of variation and the fraction of population affected by 90% of deaths to highlight concentrations of mortality. 

Findings are reported in counts and age-standardised rates. Methodological improvements for cause-of-death estimates in GBD 2021 include the expansion of under-5-years age group to include four new age groups, enhanced methods to account for stochastic variation of sparse data, and the inclusion of COVID-19 and other pandemic-related mortality—which includes excess mortality associated with the pandemic, excluding COVID-19, lower respiratory infections, measles, malaria, and pertussis. 

For this analysis, 199 new country-years of vital registration cause-of-death data, 5 country-years of surveillance data, 21 country-years of verbal autopsy data, and 94 country-years of other data types were added to those used in previous GBD rounds.

The leading causes of age-standardised deaths globally were the same in 2019 as they were in 1990; in descending order, these were, ischaemic heart disease, stroke, chronic obstructive pulmonary disease, and lower respiratory infections. In 2021, however, COVID-19 replaced stroke as the second-leading age-standardised cause of death, with 94·0 deaths (95% UI 89·2–100·0) per 100 000 population. 

The COVID-19 pandemic shifted the rankings of the leading five causes, lowering stroke to the third-leading and chronic obstructive pulmonary disease to the fourth-leading position. In 2021, the highest age-standardised death rates from COVID-19 occurred in sub-Saharan Africa (271·0 deaths [250·1–290·7] per 100 000 population) and Latin America and the Caribbean (195·4 deaths [182·1–211·4] per 100 000 population). The lowest age-standardised death rates from COVID-19 were in the high-income super-region (48·1 deaths [47·4–48·8] per 100 000 population) and southeast Asia, east Asia, and Oceania (23·2 deaths [16·3–37·2] per 100 000 population). 

Globally, life expectancy steadily improved between 1990 and 2019 for 18 of the 22 investigated causes. Decomposition of global and regional life expectancy showed the positive effect that reductions in deaths from enteric infections, lower respiratory infections, stroke, and neonatal deaths, among others have contributed to improved survival over the study period. However, a net reduction of 1·6 years occurred in global life expectancy between 2019 and 2021, primarily due to increased death rates from COVID-19 and other pandemic-related mortality. 

Life expectancy was highly variable between super-regions over the study period, with southeast Asia, east Asia, and Oceania gaining 8·3 years (6·7–9·9) overall, while having the smallest reduction in life expectancy due to COVID-19 (0·4 years). The largest reduction in life expectancy due to COVID-19 occurred in Latin America and the Caribbean (3·6 years). 

Additionally, 53 of the 288 causes of death were highly concentrated in locations with less than 50% of the global population as of 2021, and these causes of death became progressively more concentrated since 1990, when only 44 causes showed this pattern. The concentration phenomenon is discussed heuristically with respect to enteric and lower respiratory infections, malaria, HIV/AIDS, neonatal disorders, tuberculosis, and measles.

Interpretation

Long-standing gains in life expectancy and reductions in many of the leading causes of death have been disrupted by the COVID-19 pandemic, the adverse effects of which were spread unevenly among populations. Despite the pandemic, there has been continued progress in combatting several notable causes of death, leading to improved global life expectancy over the study period. 

Each of the seven GBD super-regions showed an overall improvement from 1990 and 2021, obscuring the negative effect in the years of the pandemic. Additionally, our findings regarding regional variation in causes of death driving increases in life expectancy hold clear policy utility. Analyses of shifting mortality trends reveal that several causes, once widespread globally, are now increasingly concentrated geographically. 

These changes in mortality concentration, alongside further investigation of changing risks, interventions, and relevant policy, present an important opportunity to deepen our understanding of mortality-reduction strategies. Examining patterns in mortality concentration might reveal areas where successful public health interventions have been implemented. Translating these successes to locations where certain causes of death remain entrenched can inform policies that work to improve life expectancy for people everywhere.

Bill & Melinda Gates Foundation.

GBD 2021 Causes of Death Collaborators. Global burden of 288 causes of death and life expectancy decomposition in 204 countries and territories and 811 subnational locations, 1990–2021: a systematic analysis for the Global Burden of Disease Study 2021. The Lancet. 3 April 2024. doi: 10.1016/S0140-6736(24)00367-2.

  • Mohsen Naghavi ,
  • Liane Ong ,
  • Christopher J.L. Murray ,
  • Cat Antony,
  • Aleksandr Aravkin ,
  • Michael Arndt,
  • Ann Basting,
  • Rose Bender,
  • Betyna Berice,
  • Greg Bertolacci,
  • Kayleigh Bhangdia,
  • Jessica Bishai,
  • Catherine Bisignano,
  • Michael Brauer ,
  • Kelly Cercy,
  • Catherine Chen,
  • Eunice Chung,
  • Global burden of 288 causes of death and life expectancy decomposition in 204 countries and territories and 811 subnational locations, 1990–2021,
  • Kaleb Coberly,
  • Rebecca Cogen,
  • Haley Comfort,
  • Ewerton Cousin,
  • Garland Culbreth,
  • Matthew Cunningham,
  • Xiaochen Dai,
  • Nicole Weaver,
  • Louisa Degenhardt ,
  • Lee Deitesfeld,
  • Maegan Dirac ,
  • Regina-Mae Dominguez,
  • Laura Dwyer-Lindgren ,
  • Valery Feigin ,
  • Alize Ferrari ,
  • John Fuller,
  • Emmanuela Gakidou ,
  • Natalie Galles,
  • William Gardner,
  • Sama Ghoba,
  • Erin Hamilton,
  • Simon Hay ,
  • Rebecca Hsu,
  • Kevin Ikuta ,
  • Nicholas Kassebaum ,
  • Molly Kassel,
  • Cathleen Keller,
  • Kasey Kinzel,
  • Jonathan Kocarnik,
  • Kris Krohn,
  • Hmwe Hmwe Kyu ,
  • Jorge Ledesma,
  • Kate LeGrand,
  • Stephen Lim ,
  • Megan Lindstrom,
  • Rafael Lozano ,
  • Kelsey Maass,
  • Anna Laura McKowen,
  • Susan McLaughlin,
  • Tomislav Mestrovic ,
  • Madeline Moberg,
  • Ali Mokdad ,
  • Jon Mosser ,
  • Vincent Mougin,
  • Amanda Novotney,
  • Sam Ostroff,
  • Maja Pašović,
  • Spencer Pease,
  • David Pigott ,
  • Natalie Pritchett,
  • Quinn Rafferty,
  • Christian Razo ,
  • Bobby Reiner ,
  • Hannah Robinson-Oden,
  • Gregory Roth ,
  • Tilleye Runghien,
  • Damian Santomauro ,
  • Austin Schumacher ,
  • Sarah Sirota,
  • Reed Sorensen,
  • Sandra Spearman,
  • Jeff Stanaway ,
  • Benjamin Stark,
  • Caroline Stein,
  • Caitlyn Steiner ,
  • Azalea Thomson,
  • Anna Torre,
  • Nicholas Verghese,
  • Stein Emil Vollset ,
  • Avina Vongpradith,
  • Stefanie Watson,
  • Joanna Whisnant,
  • Caroline Wilkerson,
  • Shadrach Wilson,

All our datasets are housed in our data catalog, the Global Health Data Exchange (GHDx). Visit the GHDx to download data from this article.

GBD 2021 Cause-Specific Mortality 1990-2021

Life expectancy increased as world addressed major killers including diarrhea, lower respiratory infections, and stroke, causes of death: examining effects on life expectancy, gbd 2021 methods appendices, how long will we live in future, subscribe to our newsletter.

Information

  • Author Services

Initiatives

You are accessing a machine-readable page. In order to be human-readable, please install an RSS reader.

All articles published by MDPI are made immediately available worldwide under an open access license. No special permission is required to reuse all or part of the article published by MDPI, including figures and tables. For articles published under an open access Creative Common CC BY license, any part of the article may be reused without permission provided that the original article is clearly cited. For more information, please refer to https://www.mdpi.com/openaccess .

Feature papers represent the most advanced research with significant potential for high impact in the field. A Feature Paper should be a substantial original Article that involves several techniques or approaches, provides an outlook for future research directions and describes possible research applications.

Feature papers are submitted upon individual invitation or recommendation by the scientific editors and must receive positive feedback from the reviewers.

Editor’s Choice articles are based on recommendations by the scientific editors of MDPI journals from around the world. Editors select a small number of articles recently published in the journal that they believe will be particularly interesting to readers, or important in the respective research area. The aim is to provide a snapshot of some of the most exciting work published in the various research areas of the journal.

Original Submission Date Received: .

  • Active Journals
  • Find a Journal
  • Journal Proposal
  • Proceedings Series
  • For Authors
  • For Reviewers
  • For Editors
  • For Librarians
  • For Publishers
  • For Societies
  • For Conference Organizers
  • Open Access Policy
  • Institutional Open Access Program
  • Special Issues Guidelines
  • Editorial Process
  • Research and Publication Ethics
  • Article Processing Charges
  • Testimonials
  • Preprints.org
  • SciProfiles
  • Encyclopedia

mathematics-logo

Article Menu

research methodology and data analysis

  • Subscribe SciFeed
  • Recommended Articles
  • Google Scholar
  • on Google Scholar
  • Table of Contents

Find support for a specific problem in the support section of our website.

Please let us know what you think of our products and services.

Visit our dedicated information section to learn more about MDPI.

JSmol Viewer

A combined ocba–aic method for stochastic variable selection in data envelopment analysis.

research methodology and data analysis

1. Introduction

2. literature review.

ReferenceModelKey FindingTypeYear
[ ]Ranking scalesVariable selection in DEA contextStatic1998
[ ]Statistical approachReduces variables in DEAStatic2003
[ ]Stepwise selectionProcedures for variable selectionStatic2007
[ ]Variable-selection techniquesGuidelines for selection methodsStatic2011
[ ]Multicriteria selectionClassifies production batchesStatic2012
[ ]cross-efficiencyInterval DEAStochastic2012
[ ]Alternative approachesAlternative performance measuresStatic2015
[ ]cross-efficiencysupplier selectionStatic2015
[ ]Akaike’s criteriaVariable selection via AICstochastic2017
[ ]Stepwise selectionPort efficiency assessmentStatic2020
[ ]Variable selectionOverview of variable selectionStatic2020
[ ]Relevance measuresCritical values methodologyStatic2020
[ ]Virtual frontierEfficiency modelingStatic2021
[ ]Stepwise methodNew method for variable selectionStatic2021
[ ]Entropy measuresNovel selection methodStatic2021
[ ]Machine learningVariable selection in DEAStatic2023

3. Our Proposed Method

3.1. stochastic dea, 3.2. the model of ocba.

  • γ : Total number of simulation replications.
  • B ¯ i : Sample mean of the AIC for the i -th variable combination.
  • σ i 2 : Variance of the AIC for the i -th variable combination.
  • c 1 : The variable combination with the smallest estimated AIC.
  • c 2 : The variable combination with the second smallest estimated AIC.
  • T i : Number of replications allocated to the i -th variable combination.
  • δ i , j : The difference between the sample means of the AIC for combinations i and j , calculated as δ i , j = B ¯ i − B ¯ j .

4. Numerical Example with Stochastic Data

5. real case study, 5.1. supplier selection, 5.2. tourism technology jobs ranking, 6. conclusions, data availability statement, acknowledgments, conflicts of interest.

  • Emrouznejad, A.; Yang, G. A survey and analysis of the first 40 years of scholarly literature in DEA: 1978–2016. Socio-Econ. Plan. Sci. 2017 , 61 , 4–8. [ Google Scholar ] [ CrossRef ]
  • Wagner, J.M.; Shimshak, D.G. Stepwise selection of variables in data envelopment analysis: Procedures and managerial perspectives. Eur. J. Oper. Res. 2007 , 180 , 57–67. [ Google Scholar ] [ CrossRef ]
  • Dyson, R.G.; Shale, E.A. Data envelopment analysis, operational research and uncertainty. J. Oper. Res. Soc. 2010 , 61 , 25–34. [ Google Scholar ] [ CrossRef ]
  • Nataraja, N.R.; Johnson, A.L. Guidelines for using variable selection techniques in data envelopment analysis. Eur. J. Oper. Res. 2011 , 215 , 662–669. [ Google Scholar ] [ CrossRef ]
  • Toloo, M.; Tichỳ, T. Two alternative approaches for selecting performance measures in data envelopment analysis. Measurement 2015 , 65 , 29–40. [ Google Scholar ] [ CrossRef ]
  • Duras, T.; Javed, F.; Kristofer, M.; Pär, S.; Magnus, S. Using machine learning to select variables in data envelopment analysis: Simulations and application using electricity distribution data. Energy Econ. 2023 , 120 , 106621. [ Google Scholar ] [ CrossRef ]
  • Kannan, P.M.; Govindan, M.; Rathimala, K. Modelling Efficiency of Electric Utilities Using Three Stage Virtual Frontier Data Envelopment Analysis with Variable Selection by Loads Method. Energies 2021 , 14 , 3436. [ Google Scholar ] [ CrossRef ]
  • Benítez-Peña, S.; Bogetoft, P.; Romero Morales, D. Feature Selection in Data Envelopment Analysis: A Mathematical Optimization approach. Omega 2020 , 96 , 102068. [ Google Scholar ] [ CrossRef ]
  • Sonal, S.; Qianmei, F. Assessment of port efficiency using stepwise selection and window analysis in data envelopment analysis. Marit. Econ. Logist. 2020 , 22 , 536–561. [ Google Scholar ]
  • Emrouznejad, A.; Parker, B.R.; Tavares, G. Evaluation of research in efficiency and productivity: A survey and analysis of the first 30 years of scholarly literature in DEA. Socio-Econ. Plan. Sci. 2008 , 42 , 151–157. [ Google Scholar ] [ CrossRef ]
  • Li, Y.; Shi, X.; Yang, M.; Liang, L. Variable selection in data envelopment analysis via Akaike’s information criteria. Ann. Oper. Res. 2017 , 253 , 453–476. [ Google Scholar ] [ CrossRef ]
  • Boudreau, J.W.; Ramstad, P.M. Using advanced analytics to improve talent management decisions: Examples and lessons from leading firms. J. Bus. Psychol. 2013 , 28 , 445–459. [ Google Scholar ]
  • Burnham, K.P.; Anderson, D.R. Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach ; Springer Texts in Statistics; Springer: New York, NY, USA, 2002. [ Google Scholar ]
  • Rynes, S.L.; Gerhart, B.; Parks, L. E-Recruitment: A Review and Future Research Agenda. Hum. Resour. Manag. Rev. 2015 , 25 , 302–316. [ Google Scholar ]
  • Lepak, D.P.; Smith, K.G.; Taylor, M.S. HR strategy and firm performance: When is HR strategic and why does it impact firm performance? Acad. Manag. Ann. 2016 , 10 , 107–148. [ Google Scholar ]
  • Chen, C.H.; Lee, L.H. Stochastic Simulation Optimization: An Optimal Computing Budget Allocation ; World Scientific: Singapore, 2011; Volume 1. [ Google Scholar ]
  • Jenkins, L.; Anderson, M. A multivariate statistical approach to reducing the number of variables in data envelopment analysis. Eur. J. Oper. Res. 2003 , 147 , 51–61. [ Google Scholar ] [ CrossRef ]
  • Wong, W.P.; Jaruphongsa, W.; Lee, L.H. Budget allocation for effective data collection in predicting an accurate DEA efficiency score. IEEE Trans. Auto. Control 2011 , 56 , 1235–1246. [ Google Scholar ] [ CrossRef ]
  • Gholami, R.; Higón, D.A.; Emrouznejad, A. Hospital performance: Efficiency or quality? Can we have both with IT? Expert Syst. Appl. 2015 , 42 , 5390–5400. [ Google Scholar ] [ CrossRef ]
  • Charnes, A.; Cooper, W.W.; Rhodes, E. Measuring the efficiency of decision making units. Eur. J. Oper. Res. 1978 , 2 , 429–444. [ Google Scholar ] [ CrossRef ]
  • Liang, L.; Wu, J.; Cook, W.D.; Zhu, J. The DEA game cross-efficiency model and its Nash equilibrium. Oper. Res. 2008 , 56 , 1278–1288. [ Google Scholar ] [ CrossRef ]
  • Cooper, W.W. Data envelopment analysis. In Encyclopedia of Operations Research and Management Science ; Springer: Cham, Switzerland, 2013; pp. 349–358. [ Google Scholar ]
  • Friedman, L.; Sinuany-Stern, Z. Combining ranking scales and selecting variables in the DEA context: The case of industrial branches. Comp. Oper. Res. 1998 , 25 , 781–791. [ Google Scholar ] [ CrossRef ]
  • Norman, M.; Stoker, B. Data Envelopment Analysis: The Assessment of Performance ; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1991. [ Google Scholar ]
  • Adler, N.; Golany, B. Evaluation of deregulated airline networks using data envelopment analysis combined with principal component analysis with an application to Western Europe. Eur. J. Oper. Res. 2001 , 132 , 260–273. [ Google Scholar ] [ CrossRef ]
  • Anzanello, M.J.; Albin, S.L.; Chaovalitwongse, W.A. Multicriteria variable selection for classification of production batches. Eur. J. Oper. Res. 2012 , 218 , 97–105. [ Google Scholar ] [ CrossRef ]
  • Yang, F.; Ang, S.; Xia, Q.; Yang, C. Ranking DMUs by using interval DEA cross efficiency matrix with acceptability analysis. Eur. J. Oper. Res. 2012 , 223 , 483–488. [ Google Scholar ] [ CrossRef ]
  • Dotoli, M.; Epicoco, N.; Falagario, M.; Sciancalepore, F. A stochastic cross-efficiency data envelopment analysis approach for supplier selection under uncertainty. Int. Trans. Oper. Res. 2016 , 23 , 725–748. [ Google Scholar ] [ CrossRef ]
  • Peyrache, A.; Rose, C.; Sicilia, G. Variable selection in Data Envelopment Analysis. Eur. J. Oper. Res. 2020 , 282 , 644–659. [ Google Scholar ] [ CrossRef ]
  • Jeyms, V.C.; Manuel, M.M. Methodology for calculating critical values of relevance measures in variable selection methods in data envelopment analysis. Eur. J. Oper. Res. 2020 , 290 , 657–670. [ Google Scholar ]
  • Subramanyam, T.; Donthi, R.; Satish Kumar, V.; Amalanathan, S.; Zalki, M. A new stepwise method for selection of input and output variables in data envelopment analysis. J. Math. Comput. Sci. 2021 , 11 , 703–715. [ Google Scholar ]
  • Deng, Q.; Fu, Q.; Lian, Z. A Novel Method of Variable Selection in Data Envelopment Analysis with Entropy Measures. Int. Trans. Oper. Res. 2021 , 41 , 514. [ Google Scholar ] [ CrossRef ]
  • Despotis, D.K.; Smirlis, Y.G. Data envelopment analysis with imprecise data. Eur. J. Oper. Res. 2002 , 140 , 24–36. [ Google Scholar ] [ CrossRef ]
  • Jahanshahloo, G.R.; Lotfi, F.H.; Izadikhah, M. An algorithmic method to extend TOPSIS for decision-making problems with interval data. App. Math. Comp. 2006 , 175 , 1375–1384. [ Google Scholar ] [ CrossRef ]
  • Kao, C.; Liu, S.T. Stochastic data envelopment analysis in measuring the efficiency of Taiwan commercial banks. Eur. J. Oper. Res. 2009 , 196 , 312–322. [ Google Scholar ] [ CrossRef ]
  • Morita, H.; Seiford, L.M. Characteristics on stochastic DEA efficiency: Reliability and probability being efficient. J. Oper. Res. Soc. Jpn. 1999 , 42 , 389–404. [ Google Scholar ]
  • Lu, T.; Liu, S.T. Ranking DMUs by Comparing DEA Cross-Efficiency Intervals Using Entropy Measures. Entropy 2016 , 18 , 452. [ Google Scholar ] [ CrossRef ]
  • Halkos, G.E.; Salamouris, D.S. Efficiency measurement of the Greek commercial banks with the use of financial ratios: A data envelopment analysis approach. Manag. Account. Res. 2004 , 15 , 201–224. [ Google Scholar ] [ CrossRef ]
  • Hamilton, L.C. Statistics with Stata: Version 12 ; Cengage Learning: Boston, MA, USA, 2012. [ Google Scholar ]
  • Yang, H.; Pollitt, M. Incorporating both undesirable outputs and uncontrollable variables into DEA: The performance of Chinese coal-fired power plants. Eur. J. Oper. Res. 2009 , 197 , 1095–1105. [ Google Scholar ] [ CrossRef ]
DMU12n
1[ , ][ , ][ , ]
2[ , ][ , ][ , ]
n[ , ][ , ][ , ]
Average[ ][ ][
DMUX1X2Y1Y2Y3
143253
285341
357594
428132
513671
641264
No.X1X2Y1Y2Y3Our MethodRef. [ ]Ref. [ ]
Reps. AIC1 Reps. AIC2 Imp. Reps AIC3 Imp.
110100466−3.94100−1.9899%100−2.3865%
210110157−0.981000.66248%1001.16184%
30100197−0.911001.24173%1002.44137%
410010102.111003.1132%1004.2851%
501011342.021003.9549%1005.5964%
601010102.751004.3437%1005.6151%
711010104.381005.0513%1006.0528%
801101103.381005.5039%1007.0752%
901110123.851005.6231%1006.8344%
1011110106.341007.1211%1008.5526%
1101100323.931005.6530%1007.2746%
1201111105.221006.8924%1008.3137%
1310101314.441005.9125%1007.2138%
1410011104.561006.0925%1007.6240%
1510001105.411005.888%1007.1424%
1611100105.541007.2624%1009.5042%
1710111195.701007.3222%1009.3539%
1811011106.621007.9917%10010.5337%
1911101107.341008.2311%10010.3829%
2011001126.311007.0410%1009.1031%
2111111108.121009.6616%10010.3021%
Total 980 2100 2100
Average 47 100 45% 52%
X1X2X3X4Y1Y2Y3
DMU1(152;153;229)(280;282;421)299(272;274;410)(13.0;14.5;15.9)(98.1;109.0;120.0)(8.5;9.5;10.4)
DMU2(118;118;132)(242;242;272)160(235;235;264)(9.4;10.5;11.5)(66.6;74.0;81.4)(5.6;6.2;6.8)
DMU3(109;109;122)(245;245;270)205(238;238;268)(9.0;10.0;11.0)(67.6;75.1;82.6)(7.9;8.8;9.6)
DMU4(34;34;39)(93;93;107)120(101;101;128)(4.7;5.2;5.8)(31.2;34.7;38.1)(2.1;2.3;2.6)
DMU5(92;106;123)(176;203;237)276(180;207;242)(7.0;7.8;8.6)(51.3;57.0;62.7)(9.8;10.9;12.0)
DMU6(143;157;182)(198;217;252)233(251;275;318)(13.0;14.5;15.9)(85.9;95.4;105.0)(10.7;11.9;13.1)
DMU7(122;122;194)(215;215;339)257(157;157;248)(8.3;9.2;10.2)(60.39;67.1;73.8)(11.2;12.5;13.7)
DMU8(162;162;249)(338;338;518)635(169;169;259)(6.9;7.7;8.5)(55.1;61.3;67.4)(9.7;10.7;11.8)
DMU9(185;185;304)(376;376;625)481(336;336;557)(18.7;20.7;22.8)(145.8;162.0;178.2)(23.4;25.9;28.5)
DMU10(106;106;134)(296;296;379)339(173;173;235)(7.3;8.2;9.0)(53.4;59.4;65.3)(5.2;5.7;6.3)
DMU11(933;933;1033)(1782;1782;1955)1762(1392;1392;1532)(48.9;54.4;59.8)(366.5;407.2;447.9)(94.6;105.1;115.6)
DMU12(69;86;71)(124;155;129)123(104;130;130)(4.4;4.9;5.4)(19.2;21.4;23.5)(8.3;9.3;10.2)
DMU13(57;68;57)(94;113;94)123(101;122;122)(3.7;4.1;4.5)(26.8;29.8;32.7)(2.4;2.7;3.0)
DMU14(34;35;35)(88;90;89)150(226;230;230)(2.8;3.2;3.4)(69.0;76.7;84.4)(0.0;0.0;0.0)
DMU15(240;240;243)(668;668;672)439(571;571;600)(21.8;24.3;26.7)(155.9;173.2;190.5)(37.3;41.4;45.6)
No.X1X2X3X4Y1Y2Y3Total VariablesRep. TimesAIC
K1100001024100.3397
K2110001035103.6151
K31000011351715.4299
K41110010466214.8139
K51000111446722.0476
K61010011454323.1462
K71110110550426.9565
K81010111547928.0382
K91111010551928.9712
SupplierK1K2K3K4K5K6K7K8K9Borda ScoreRanking
DMU141111212211231
DMU4235819118972
DMU15392432537973
DMU6729374453914
DMU9444745872905
DMU268112103344846
DMU3578566669777
DMU186126118785648
DMU5910612811111012469
DMU12141431457109154410
DMU712111091212121164011
DMU1310513101313912133712
DMU1113127119101313143313
DMU101113141314141414101814
DMU8151515151515151511415
VariableNMinimumMaximumMeanStandard Deviation
Experience (X1)2200.310.02.9572.0802
Education (X2)22014.022.015.5821.0103
Skill (X3)22015.045.029.5275.3143
Environment (Y1)2202.020.07.8915.2861
Welfare (Y2)2201.076.027.33619.3373
Salary (Y3)2200.445.017.2097.7670
IDX1X2X3Y1Y2Y3Total VariablesRep. TimesAIC
K11000102462−2.291644
K21100103391−1.665309
K310001133252.252313
K411101045715.488801
K510011143616.319803
K610101145139.650136
K711101043759.866833
K810111156529.952996
K9111011545710.024476
K10111111655910.066365
JobK1K2K3K4K5K6K7K8K9K10Borda ScoreRanking
DMU921251912153621551
DMU2124161167231221462
DMU1991810153144106521243
DMU2195119213131281121164
DMU27452438172341821125
DMU7461671212115251221126
DMU169139652937251321087
DMU148141912161358111121008
DMU19791511177214149420989
DMU1468682251818978209110
DMU38209205201210208218203217208217104211
DMU16320721620021621221220821221520795212
DMU12621220820521721420921021320921093213
DMU15021321421120921519721520821820991214
DMU18420521121321520021619821921321991215
DMU17521720921821819921520920621221285216
DMU3021821821221221320521621020720584217
DMU7321921721420621120021721620321681218
DMU1021522021722021922022021822022011219
DMU352202192202192202192192202172189220
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

Deng, Q. A Combined OCBA–AIC Method for Stochastic Variable Selection in Data Envelopment Analysis. Mathematics 2024 , 12 , 2913. https://doi.org/10.3390/math12182913

Deng Q. A Combined OCBA–AIC Method for Stochastic Variable Selection in Data Envelopment Analysis. Mathematics . 2024; 12(18):2913. https://doi.org/10.3390/math12182913

Deng, Qiang. 2024. "A Combined OCBA–AIC Method for Stochastic Variable Selection in Data Envelopment Analysis" Mathematics 12, no. 18: 2913. https://doi.org/10.3390/math12182913

Article Metrics

Further information, mdpi initiatives, follow mdpi.

MDPI

Subscribe to receive issue release notifications and newsletters from MDPI journals

  • Open access
  • Published: 18 September 2024

Physical activity from the perspective of older adults: a convergent mixed-method study

  • Anna Nilstomt   ORCID: orcid.org/0009-0000-9061-7669 1 ,
  • Johanna Gustavsson 2 , 3 ,
  • Linda Beckman 4 , 5 ,
  • Charlotte Bäccman 1 ,
  • Finn Nilson 2 , 3 ,
  • Stefan Wagnsson 6 &
  • Erik Wästlund 1  

BMC Geriatrics volume  24 , Article number:  768 ( 2024 ) Cite this article

Metrics details

Older adults are insufficiently physically active, despite its importance for healthy aging. To develop appropriate physical activity interventions, it is necessary to understand their physical activity. This study applies a theoretical perspective, the COM-B model, and a mixed-method design to examine what influences older adults’ physical activity levels with three questions: (1) What individual and external factors predict older adults’ physical activity levels? (2) What do older adults perceive as influencing their levels of physical activity? (3) To what extent do the quantitative results on older adults’ physical activity levels agree and disagree with the qualitative findings on older adults’ physical activity levels?

A convergent mixed-method design was used with questionnaire ( n  = 334) and interview ( n  = 14) data from adults 65 years and older. Regression analyses were used for quantitative measurements: physical activity, age, subjective socioeconomic status, health status, capability, opportunity, motivation, and depression. Content analysis was applied to the qualitative data. The two forms of data were then integrated to provide greater insights than would be obtained by either dataset separately.

The regression analyses showed that previous physical activity, current motivation, health status, and age significantly predicted older adults’ physical activity levels. The content analysis revealed that participants addressed all subcomponents of the COM-B model, indicating its pertinence in understanding how older adults discuss their current physical activity levels. The integrated findings showed convergent and divergent results. Overall results indicated that previous physical activity engagement, present motivation, capability, and opportunity influenced older adults’ physical activity levels.

Conclusions

This study is the first to use this mixed-methods design to examine factors influencing physical activity levels among older adults living in rental apartments with community hosts. The integrated result reveals convergence for findings on motivation and physical capability but divergence on psychological capability, opportunity, and previous physical activity. The findings underscore a complex interplay of factors influencing older adults’ physical activity levels and indicate relevance for the COM-B model. The results can guide future research on theoretically informed interventions to promote physical activity and healthy aging. Future research should clarify the role of opportunity for older adults’ physical activity.

Peer Review reports

Physical activity (PA) is vital for healthy aging as it promotes mobility and independence [ 1 ] and reduces declines in health and functioning [ 2 ]. The World Health Organization (3, p. vii) defines PA as “any bodily movement produced by skeletal muscles that requires energy expenditure”, which implies various activities, such as leisure-time activities like walking and gardening, transportation by bike, household chores, occupational tasks if the person still works, or planned exercise.

Adults aged 65 years or over are recommended to engage in at least 150 min of aerobic activities per week, perform muscle-strengthening exercises twice a week, and practice their balance three days a week [ 3 ], or at a minimum be as active as their abilities and conditions allow [ 3 , 4 ]. Previous research has shown that older adults have a more flexible attitude towards PA, ranging from any activity moving the body and mind to strictly planned activities outside the home [ 5 ]. In Sweden, fewer than 60 percent of older adults aged 65–84 meet the aerobic recommendations [ 6 ], meaning many older adults are insufficiently active. Also, the oldest adults are less likely to engage in PA than those of younger ages [ 7 ], which raises the question of what influences older adults’ PA and how it can be promoted in society? The answer to these questions could improve healthy aging and support the Agenda 2030 sustainable development goal of good health and well-being [ 8 ].

Previous research has identified several individual and external factors that influence older adults´ levels of PA, for example, being physically active earlier in life [ 9 ], wanting to continue with PA [ 10 ], and perceiving their health status as good [ 11 , 12 ]. Also, participating in group activities [ 5 ], having a supportive social network [ 13 ], access to facilities, and favorable weather conditions seem to promote older adults’ level of PA [ 13 ]. Factors that seem to decrease the likelihood of PA engagement among older adults are, for example, physical deterioration of the body [ 14 ], pain, fatigue, fear of falling [ 15 ], low self-confidence [ 14 ], low subjective socioeconomic status (SES) [ 16 , 17 ] and depressive symptoms [ 18 ]. However, to gain a deeper understanding of older adults’ PA engagement, applying a theoretical perspective, such as the COM-B model, is valuable [ 19 ] and increases the chances of designing effective interventions [ 20 ]. Thus, the different individual and external factors linked to older adults’ PA levels can be understood through the COM-B model [ 19 ].

The COM-B model refers to three components – capability, opportunity, and motivation – that must be present to generate a specific behavior, such as PA [ 19 , 21 ]. Capability and motivation relate to intrinsic factors, whilst opportunity relates to external factors. More specifically, capability concerns the individual’s physical and psychological capacity to engage in a behavior, for example, skills, knowledge, and thought processes; opportunity can be social or physical possibilities that allow a behavior to occur; and motivation is a mental process that can be either reflective or emotional and that energizes the targeted behavior as well as including goal-setting, decision-making, habits, and emotional responses [ 21 ]. Each component can directly impact behavior, and opportunity and capability can also indirectly impact behavior through motivation. The behavior can also impact capability, opportunity, and motivation [ 21 ].

The COM-B model is commonly related to PA [ 22 , 23 , 24 ] but has only to a limited degree been used for understanding older adults’ levels of PA. A review by Meredith et al. [ 25 ], which included qualitative studies on older adults, mapped the results to the COM-B model to better understand what influences older adults’ PA engagement. This secondary analysis revealed that all the COM-B components interacted and affected older adults’ PA engagement and that opportunity was the most frequently identified component [ 25 ]. Although reviews can be valuable for summarizing research results, they rely solely on secondary data instead of primary data, which may limit understanding nuances and the ability to establish relationships in the data [ 23 ]. Therefore, it is important to conduct more studies on the levels of PA in older adults, using the COM-B model. These studies should preferably use a mixed-methods approach since the convergence of two forms of data brings greater insights than would be obtained by either type of data separately [ 26 ]. Qualitative methods can provide deeper insights and a more nuanced view of a phenomenon, while quantitative methods enable statistical generalizability [ 27 ].

By understanding what influences older adults’ PA in relation to the COM-B model and two datasets, we can better understand the phenomenon of PA in this population. The COM-B model is useful for analyzing behavior as it is part of the theoretical framework for behavior change interventions [ 19 ]. Thus, we can acquire valuable knowledge that can be used to create and improve interventions that support older adults’ PA engagement. Ultimately, this can contribute to healthier aging.

This study aimed to examine what influences older adults’ levels of PA and how it could be understood in terms of the COM-B model and by means of a convergent mixed-method design. It is a design where quantitative and qualitative data are collected approximately simultaneously and analyzed separately before integration [ 26 ]. In this study, the two datasets were equally emphasized [ 26 ]. This study used quantitative observational data from a cross-sectional study to understand older adults’ PA levels through the COM-B model. The qualitative data from interviews explored PA among older adults. The reason for collecting both quantitative and qualitative data was to converge the two forms of data to bring greater insights into the research problem than would be obtained by either type of data separately. Qualitative methods can provide deeper insights and a more nuanced view of a phenomenon, while quantitative methods enable statistical generalizability to a greater degree [ 27 ]. A pragmatic approach was applied, as this allows using both quantitative and qualitative research methods to collect the data needed to address the study’s research questions [ 26 ]. The National Research Ethics Committee approved the study before recruitment.

The aim was operationalized into three research questions:

What individual and external factors predict older adults’ PA levels?

What do older adults perceive as influencing their levels of PA?

To what extent do the quantitative results on older adults’ PA levels agree and disagree with the qualitative findings on older adults’ PA levels?

Study setting

The recruited participants were 65 years or older and lived in rental apartments owned by the municipality. The apartments are specifically designed to provide independent living facilities for older adults and include a so-called ‘Trygghetsvärd’ (community host) who functions as a social support. Aside from arranging social activities like café meetings and walking groups, the community hosts clean the common areas and assist with simple tasks in the residents’ homes, such as changing light bulbs. The fact that the participants lived in rented apartments means that they differ from the Swedish average. In Sweden, owning a house is the most common housing arrangement, followed by renting an apartment and then owning an apartment [ 28 ]. The difference between the study group and the general population should be considered when interpreting the results.

The quantitative method

A cross-sectional study addressed the first research question: What individual and external factors predict older adults’ PA levels?

Participants

The sample consisted of 334 community-dwelling older adults (71.3% women) in a middle-sized town in Sweden. The majority of participants (65.2%) lived alone. Age was measured as a five-year categorical variable ranging from 65–70 years to 95–100 years. The mode age category was 76–80 years. Almost half of all participants (49.4%) could be classified as sufficiently active, 17.4 percent as moderately active, and 33.3 percent as insufficiently active, according to the Godin-Shepard leisure-time exercise questionnaire [ 29 ]. The participants reported a low degree of depressive symptoms ( M  = 3.98, SD  = 3.84, Range = 0–19). However, 24.4 percent of the sample reported six or more symptoms of depression, which, according to the cut-off score of the Geriatric Depression Scale [ 30 ], might indicate depression.

Procedure and response rate

A pilot survey was tested among a sample of self-recruited older adults ( n  = 6) from the local residential areas. This resulted in a shortened survey that was distributed to a total of 700 older adults in February 2022. Participants returned the survey either by pre-paid postage or through a sealed envelope that the community hosts mailed. One reminder was sent to their postbox and notes were posted on one occasion at the apartment complexes’ entrance to boost the response rate. Upon request from the older adults, the community hosts were available to assist with the questionnaires. This approach aimed to minimize dropouts caused, for example, by visual impairments. The community hosts assisted a total of three older adults. In total, 334 older adults provided informed consent to participate, resulting in a response rate of 48 percent.

Instruments

The Godin-Shepard leisure-time exercise questionnaire (GSLTEQ), adjusted by Godin [ 29 ], assessed the individual’s weekly PA. The participants self-reported on three items their frequencies of at least 15 min of mild, moderate, and strenuous PA per week on an eight-point scale ranging from 0 to 7 days . The standard procedure described by Godin [ 29 ] was used to calculate the total PA score and classify individuals as either sufficiently, moderately, or insufficiently active. The total PA scale index ranges from 0–119, where a higher score indicates an increased PA level. The Cronbach’s alpha was 0.65, and the inter-item correlation was 0.39 in this study.

An adjusted version of the COM-B instrument, constructed by Bäccman, Bergkvist, and Wästlund [ 31 ], was used to assess the COM-B model [ 21 ]. It contained 12 items designed to measure capability (four items, such as “I know why it is important to be physically active”), opportunity (four items; for example, “I have access to facilities and equipment required to be physically active”), and motivation (four items, such as “I really want to be physically active”). Each item was answered on a five-point scale ranging from strongly disagree to strongly agree. The mean for each index, capability, opportunity, and motivation was used for the analysis. In this study, Cronbach’s alphas for the capability, opportunity, and motivation index were 0.80, 0.78, and 0.92, respectively.

Health-related quality of life was assessed by EQ-5D-3L [ 32 ]. The participants classified their health on five dimensions (mobility, self-care, usual activities, pain/discomfort, anxiety/depression) and three severity levels ( no, moderate, or severe problems ). The health levels were transformed into an index value by applying the Swedish value set by Burström et al. [ 33 ]. The index values ranged between 0.3402 and 0.9694, where the former represents the lowest self-rated health-related quality of life and the latter the highest. In this study, the Cronbach’s alpha was 0.60, with an inter-item correlation of 0.24.

Symptoms of depression were assessed with the Swedish version of the Geriatric Depression Scale (GDS) developed by Gottfries, Noltorp, and Nørgaard [ 30 ]. The scale is a screening instrument consisting of 20 items to be answered with a yes or no . Five items were reversed before summing a total score that ranged from 0–20, with a higher score indicating more symptoms of depression. A score of six or more indicates that depression might be suspected, whereas a score of five or lower suggests that depression is unlikely [ 30 ]. The Cronbach’s alpha was 0.86 in this study.

The individual’s perception of their subjective socioeconomic status (subjective SES) was assessed with a single item, asking the participants to rank their socioeconomic status compared to other older adults in the society on a 10-point scale ranging from lowest to highest [ 34 ].

Previous PA experiences were assessed with a single item, asking the person to “Rate the average degree to which you have been physically active (exercised/worked out) in your life up until today.” The answers were given on a five-point scale ranging from a very high degree of physical activity to a very low degree of physical activity .

The questionnaire also included demographic information (age, gender, and cohabitation). Age was measured as a categorical variable with an interval of five years (e.g., 65–70). The lowest age category was 65–70 years and the highest was 106 years and older .

Data analysis

The data were analyzed using descriptive statistics and multiple regression. The data did not indicate any outliers. Only two cases had an age of 96 years or above. Therefore, the two age categories 91–95 and 96–100 were merged into an age category of 91–100.

A hierarchical regression analysis was applied using bootstrapping to obviate skewed data. First, a hierarchical regression with three steps was completed, using forced entry within each step. The reason for this procedure was to test the COM-B model initially before adding characteristics. In the first step, opportunity and capability were included, as these variables are theorized to generate behavior and contribute to motivation. In the second step, motivation was added. The additional individual factors (previous PA, EQ-5D-3L, subjective SES, age, and GDS) were included in the third step. After that, a trimmed regression model with forced entry was completed, containing only the significant variables retained from the hierarchical regression. For both regression models, GSLTEQ was the dependent variable. The significance level was kept at an alpha level of 0.05 and cases were excluded listwise. Statistical analysis was completed with SPSS version 28.

The qualitative method

Semi-structured interviews addressed the second research question: What do older adults perceive as influencing their PA levels?

The sample ( N  = 14) is a subset of the quantitative sample. Among the participants in the qualitative sample, seven were women, 12 lived alone, and nine reported age 80 or younger. The mode age category was 86–90 years. Among the participants, four were classified as sufficiently active, four were deemed moderately active, and six were classified as insufficiently active.

Researcher description

The first and second authors conducted the interviews ( n  = 10 and n  = 4, respectively). The initial coding was done by the first, second, and third authors ( n  = 8, n  = 2, n  = 4, respectively). The coding was calibrated through discussions among the coders, and when uncertainty remained, all the authors were consulted. After that, the first author iteratively refined the coding for all 14 interviews. Our preunderstanding was managed by self-reflection through notes. The research team has experience in quantitative and qualitative studies regarding older adults and behavior change, and its members are from psychology, nursing, public health, sports science, and physical therapy.

Procedure and data collection

The older adults who had participated in the quantitative data collection and had agreed to be contacted for future research were approached. The participants were selected based on their gender (male, female), age (65 to 80 years, 80 years and older), and level of PA (insufficient, moderate, sufficient) derived from their survey answers for sample diversity. A four-cycle purposive sequential sampling procedure was used to contact 35 potential participants, 14 of whom consented in writing to be interviewed. Hence, the included sample is based on the maximum number of consented participants. There was no researcher-participant interaction before the data collection.

One-on-one semi-structured interviews were conducted in March and April 2023. The interviews took place either in the participant’s home or in a secluded area at the community hosts’ office, without the presence of non-participants. The participants chose the option that was most comfortable for them. The participants were informed about the purpose of the study, their right to refrain from answering questions, and that their participation was voluntary.

An interview guide with open-ended questions, organized into four themes, was developed, and follow-up questions were tailored during each interview based on the participant’s response. The themes included the participants’ definition of PA, their experience with PA today and previously, and their thoughts on maintaining PA. While the core of the interview guide remained consistent, the questions were nuanced and refined throughout the interview process in response to participants’ answers. The interviews were audio-recorded and transcribed verbatim. The average duration for the interviews was 70 min. The range of interview length was 31–105 min.

To ensure trustworthiness, we spent lengthy time with the specific population, made notes throughout the research process, and applied investigators’ triangulation by being multiple interviewers and coders discussing the results within the research team.

Entire transcripts were analyzed using qualitative content analysis, a method for systematically interpreting text content through coding and identifying patterns [ 35 ]. The method described by Graneheim and Lundman [ 36 ] was applied to the transcripts with the adjustment of not condensing the meaning units. The text was initially read as a whole, then coded and sorted into categories using abductive reasoning. Through this process, the COM-B model was identified as a relevant framework for categorizing the codes based on the entire sample’s quotes. The analysis was completed in NVivo 14.

The mixed-method

The quantitative results were merged with the qualitative findings to address the third research question: To what extent do the quantitative results on older adults’ PA levels agree and disagree with the qualitative findings on older adults’ PA levels?

Before integrating the two datasets through methodological triangulation, each part was independently completed according to its methodological quality standards to ensure trustworthiness [ 26 ]. After identifying quantitative and qualitative findings, the research team compared the results, discussing convergence and divergence in content by reviewing constructs, scale items, and verbal statements. We also reflected upon discrepancies in findings. Those reflections can be located in the discussion section of this paper. A joint table was created to array the results. The table includes only the statistically significant predictors and the main categories identified in each dataset.

The quantitative results

To investigate the first research question – What individual or external factors predict older adults’ PA levels? – a hierarchical regression model was used, with capability, opportunity, motivation, previous PA, EQ-5D-3L, subjective SES, age, and GDS as predictors, and GSLTEQ as the dependent variable. Descriptive statistics (sample size, mean, standard deviation, median, and range) of the variables are shown in Table  1 .

The hierarchical regression model was significant, R 2  = 0.38, F [8, 294] = 24.52, p  < 0.001, although only the variables motivation, EQ-5D-3L, age, and previous PA were significant (see Table 2 ). The participants’ previous PA had a higher semipartial correlation value ( sr  = 0.30, p  < 0.001) than EQ-5D-3L ( sr  = 0.17, p  < 0.001), motivation ( sr  = 0.12, p  = 0.008) and age ( sr  = -0.12, p  = 0.01).

Therefore, a trimmed regression model that only included the significant variables was completed (see Table 3 ). This model remained significant and continued to explain a total variance of 39 percent, F [4, 305] = 49.32, p  < 0.001. The previous PA still had the higher semipartial correlation value ( sr  = 0.29, p  < 0.001) compared to motivation ( sr  = 0.19, p  < 0.001), EQ-5D-3L ( sr  = 0.18, p  < 0.001) and age ( sr  = -0.13, p  = 0.004).

The qualitative results

We used content analysis to explore the second research question: What do older adults perceive as influencing their levels of PA? The analysis confirmed that all the COM-B subcomponents—physical capability, psychological capability, physical opportunity, social opportunity, reflective motivation, and automatic motivation—were relevant to older adults' PA levels based on the entire sample’s quotes (see Table  4 ).

Physical capability

Physical capability concerns how a long life may take a toll on a person’s body and cause mobility issues and focused on strength and stamina . The participants described different illnesses and ailments, restrictions in mobility, and a general decreased fitness as reasons for reduced strength and stamina. Common ailments were stiffness, tiredness, and pain, frequently leading to avoidance of actions like running or even walking. Restrictions in mobility caused a range of issues, from being unable to stand up by oneself to not being able to walk at all or only walking with a walker: “/…/ Right now, I do nothing because my back hurts so bad, it's not possible … you see … I can barely get out of the kitchen " (Participant 6). In a seemingly downward spiral, decreased general fitness was related to reduced cardio-fitness, strength, and balance. A decreased strength and increased stiffness made lifting items difficult, and stiffness combined with poor balance prevented most physical activities such as biking, dancing, or going to the gym.

Psychological capability

Psychological capability describes a person’s mental functioning and understanding; three subcategories were identified: attention , knowledge , and acceptance . The first concerns decreased attention in traffic or not noticing body signals (such as low blood sugar levels). The second, mainly referred to a lack of knowledge, not knowing how to get to an activity (for example, being unfamiliar with the bus system), or not knowing what appropriate or available activities would be for their health conditions.

But I want to do something: move my body. In every newspaper and on TV, they say that you should move, but how and where? Who should I turn to? Because I can’t go to a gym since I can't stand up right there. (Participant 7)

The third subcategory, acceptance, was a permissive approach that some participants adopted when faced with lost abilities. This strategy seemed to boost their well-being.

Physical opportunities

Physical opportunities concern the external living conditions related to time and inanimate aspects of an environmental system. Aspects that benefited PA levels included time, financial means, activities to choose between, and access to music, equipment, nature, and facilities. The COVID-19 pandemic, unpleasant weather conditions, and physical surroundings sometimes prevented mobility. For example, the stairs into busses hindered rides or the pavement levels prevented getting on and off a bus everywhere. Dark and slippery roads, as well as long distances, were additional barriers: “/…/ When I walk, it’s to go and play bingo, but then I have to stop several times, and I only wish there were more benches along the pathway for me to sit down for a little while …” (Participant 5).

Social opportunities

Social opportunities relate to people and cultural elements of an environmental system. These concerned interpersonal influences, belonging to a community, and the presence of others who encouraged, pushed, or guided the participants facilitated engagement in PA: “I have actually done [some physical activity]. And if I had someone close by who … liked the training, we could motivate each other, and then it would surely be even more [activity]” (Participant 4). Others’ opinions and actions directed older adults’ behavior. When one partner did not engage in PA, the other often reduced or stopped their activities too. Reasons were feelings of guilt or lack of time due to an increased need for them to do the household chores.

Automatic motivation

Automatic motivation concerns emotions and impulses that energize behavior. Three subcategories were identified: impulses and inhibitions, emotions , and motives . The first, impulses and inhibition, concerned relatively unreflective internal forces that propel or restrain actions. Habits of regular exercise or walks were automatic behaviors that facilitated PA. At the same time, a struggle to stop exercising before pain or exhaustion prevented future PA. The second subcategory, emotions, concerns positive and negative feelings for PA. Positive feelings like fun, play, and enjoyment facilitated PA, whereas negative emotions like sadness, fear, and guilt limited activity levels. The fear that restricted behavior was mainly about hurting oneself and losing body functions and abilities.

Like I said, I try to be active ... within my limits. I no longer expose myself to anything extreme, although I feel I should. /…/ This stiffness I have, I wish I didn’t, but to a certain degree, you have to accept it as well when you have passed the age of 80, I think. Perhaps not endangering the body too much. (Participant 2). 

The third subcategory, motives, involved wants and needs that elicited PA and facilitated it as far as mobility issues did not pose a hindrance. The desire to do activities other than PA increased older adults’ motives to be physically active.

Everything comes down to one’s will ... if I don’t want something, I don't want it ... you can’t force someone, it’s quite simple when you come to an understanding ... so ... but sometimes I think about the fact that ... I’ll be 72 years old now, I don’t have damn long left on Earth, and the time I have left I actually want to be a little active. I have no desire to use an electric wheelchair or become a vegetable. (Participant 8)

Reflective motivation

Reflective motivation involves conscious thinking that can ignite behavior, and two subcategories were identified: beliefs, and goals and plans . Beliefs concerned ideas about PA, age and age-appropriate manners, and the self. If one believed participating in PA was valuable and worthwhile, it facilitated their engagement. Statements of PA were often accompanied by imperative thoughts, such as ‘must’ or ‘should.’ Regarding beliefs about age, those participants who identified themselves as old tended to perceive PA as too late to engage in, which often restricted their range of activities. Ideas about the self as an active or inactive person influenced the level of PA, where the former facilitated and the latter hindered. “I don't move my body because I have no interest in it” (Participant 5). The subcategory, goals and plans, involved cognitive representations of desired outcomes and intentions for PA. PA helped participants achieve health and independence by reducing illnesses and ailments, lowering restrictions in mobility, improving general fitness, or boosting well-being.

Yes. No, [the goal of these activities] it’s to keep the body going to remain able to cope as I age. I mean, the muscles disappear, the older they get. Of course, you want to try to remain strong and live a long life. I think it’s super-important to exercise because you stay healthier. If you exercise, you have more stamina, and you have fun while exercising. So no, I think it is very important. (Participant 4)

Other goals were to save money and experience nature. A more or less conscious objective with the PA concerned structure and meaning of the days. For example, getting outside the home allowed the participants to explore and be stimulated. Intentions to be active facilitate an active lifestyle and committing to oneself or others further prompted PA. “/… /I take care of dogs and sometimes when I go out, the weather is miserable, but since I’ve promised her, I’ll go out anyway … /…/even if it’s not very tempting to go out when it rains” (Participant 2).

The mixed-method results

The quantitative results were integrated with the qualitative findings to address the third research question: To what extent do the quantitative results on older adults’ PA levels agree and disagree with the qualitative findings on older adults’ PA levels? This revealed both convergence and divergence (see Table  5 ).

This mixed-method study utilized quantitative and qualitative methods to examine the factors influencing older adults’ PA levels. This allowed a greater insight than would be obtained by either dataset separately, as quantitative methods enable statistical generalizability to a greater degree and qualitative methods can provide deeper insights and a more nuanced view of a phenomenon [ 27 ]. The quantitative analysis of standardized questionnaires was used to identify individual and external factors predicting older adults’ PA levels, and the qualitative analysis of semi-structured interviews was used to gain a better understanding of nuances of what older adults perceive as influencing their PA levels. The mixed-method analysis assessed to what extent the quantitative results on older adults’ PA levels agree and disagree with the qualitative findings on older adults’ PA levels. The discussion will follow the two research questions, answered by quantitative and qualitative methods. After that, an integrated discussion will address the third research question.

Predictors of PA levels in older adults

The findings revealed that the best predictor for older adults’ PA levels was their previous PA engagement, followed by their current motivation, health status, and age, which aligns with prior research [ 7 , 9 , 11 , 23 , 24 , 25 , 37 ]. Non-significant variables in the regression analysis were capability, opportunity, subjective SES, and depressive symptoms. Capability changed to a non-significant predictor when health status and age were included, indicating that body-related capacity matters for older adults’ PA levels, which Jancey et al. [ 14 ] also postulated. This indicates that physical- rather than psychological capability matters for older adults’ PA levels. Our results contradict previous research by not identifying opportunity as a significant predictor [ 22 , 23 , 24 , 25 ]. This may be due to multicollinearity and other factors that matter more to older adults’ PA levels. Additionally, the availability of walking groups in their residential areas may have impacted participants’ perceptions of opportunities, influencing the results. Another possibility is range restriction in the participants’ questionnaire answers in this study or that other research studies have used different items to measure the COM-B variables.

Despite previous negative associations between subjective SES with physical inactivity [ 16 ] and mobility issues [ 17 ], subjective SES was not a significant predictor in our study. The reason could be the homogeneity in the sample, with all participants living in rented apartments reporting fairly similar subjective SES.

Many older adults suffer from depression and in Sweden approximately 10 percent of individuals aged 65–74 use antidepressants, increasing to nearly 20 percent for those aged 85–94 [ 38 ]. The prevalence of depression in our sample (potentially 24.4%) is higher than that of Sweden; however, it is lower than the global equivalent (28.4%) [ 39 ]. Depressive symptoms were not a significant predictor of PA levels, although previous research has shown that depressive symptoms may hinder PA engagement [ 18 ]. The discrepancy in findings may be due to different study designs and sample sizes. Our quantitative study included 334 participants in a single measurement, whereas Lindwall et al. [ 18 ] employed a repeated measurement design with a sample of almost 18,000 participants. In summary, early engagement in PA seems to be a precursor for maintaining an active lifestyle later in life. Still, other factors like the current motivation and age-related health status are also important.

What do older adults perceive as influencing their PA levels?

All subcomponents of the COM-B model were identified as relevant to help understand what influences PA levels among older adults, which validates the results from previous reviews [ 22 , 25 ]. As with all models, the COM-B model is a simplified representation of reality, and the subjectivity in data interpretation might account for differences in findings among researchers. Meredith et al. [ 25 ], who reviewed qualitative studies on older adults’ PA participation and mapped their findings to the COM-B model, perceived fear as an individual vulnerability related to capability. However, we interpreted it as an emotion related to automatic motivation. Nonetheless, our findings and Meredith et al. [ 25 ] suggest that fear of falling or losing physical abilities can limit PA.

It is well known that older adults can face challenges in PA due to reduced strength and stamina, especially if the social and physical environments do not support their health condition. Our results show that all the COM-B model’s subcomponents are relevant and have complex interactions. This has also been acknowledged by Meredith et al. [ 25 ], who reported that a greater portion of their findings was related to social and physical opportunities. Other researchers, who did not only include older adults in their samples, have emphasized physical opportunity while downplaying physical capability and social opportunity [ 22 , 24 ]. This suggests that social environment and physical capabilities become more significant later in life and implicates the importance of analyzing age groups separately, as differences can become camouflaged when combined. In summary, the subcomponents of the COM-B model help explain older adults’ engagement or withstanding of PA. The importance of age-disaggregated analysis is also revealed when comparing our results on older adults to those of other researchers that include samples with not only older adults.

Integrative discussion

The quantitative and qualitative results were both convergent and divergent (see Table  5 ). In the quantitative results, opportunity was not a significant predictor, but it was identified as an influential category in the qualitative findings. This discrepancy may be due to how opportunity was measured in the quantitative analysis. The qualitative analysis revealed additional nuances of opportunity that were not assessed in the survey. For example, the qualitative analysis suggested that PA levels are influenced by the existence of PA facilities as well as the physical surroundings that govern how a person can access the facility. Therefore, participants may have responded in the survey that facilities for exercising existed without considering if the physical surroundings allowed them to access them. Similarly, the quantitative survey did not include any item on how the ability of a partner to engage in PA affected the participant's opportunity to be active, which was reported as influential in the qualitative analysis. Nonetheless, this divergence calls for more research regarding the role of opportunity for older adults’ PA levels. Regarding capability, the quantitative analysis only reveals physical capability as core to older adults’ PA levels, while both physical- and psychological capability are identified in the qualitative analysis. Motivation was recognized as central in both analyses. The qualitative findings nuanced the quantitative results by indicating both reflective and automatic motivation as relevant. This life-span perspective concerns a distinction between the two analyses, as previous PA was a statistically significant predictor in the quantitative results but did not surface as a category in the qualitative findings. During the interviews, some participants talked about being active in the past, but they did not associate it with their current PA levels. This divergence between the datasets may be related to the qualitative data collection’s focus on individual experiences, while the quantitative approach emphasizes patterns in large groups.

Comparing the quantitative and qualitative data can improve our understanding of what influences PA in older adults. Our findings show that, as people age, their behavior and cognition change, as does their motivation to engage in PA. Pleasurable, meaningful, and social activities reinforce older adults’ PA positively. However, as shown in this study, aging deteriorates a person’s body and can restrict their current PA levels. In these situations, the surrounding opportunities and the individuals’ knowledge of safely engaging in PA matters. Also, people’s previous experience of PA influences their present behavior. In other words, our study indicates that many factors influence older adults’ PA levels in a complex manner. The COM-B model and its subcomponents seem like a relevant model for understanding older adults’ PA levels. In summary, these findings suggest that applying a life-span perspective and considering the COM-B model's subcomponents can help explain why older adults engage in PA or not.

Implications and practical significance

Stakeholders may promote healthy aging and contribute to the 2030 Agenda’s sustainable developmental goal of health and well-being [ 8 ] by utilizing knowledge of factors influencing PA levels in older adults. It is important to recognize that aging can look very different from one person to another, and that this heterogeneity tends to increase with age, peaking at approximately 70 years for various health characteristics [ 40 ]. Our findings reveal both hetero- and homogeneity among the participants. For example, in the qualitative analysis, they all reported reduced strength and stamina, but the reasons varied from biological to behavioral. The individual differences must be considered, but common features allow for PA promotions for healthy aging.

Firstly, this result implies the importance of prioritizing PA at early life stages, as this positively affects PA levels in older adults. However, this alone will not suffice to increase PA levels in the aging population, as their past cannot be changed. Hence, interventions to target PA in older adults are necessary.

Secondly, interventions to increase PA among older adults should review all subcomponents of the COM-B model. This knowledge is valuable since the COM-B model is the hub of The Behavior Change Wheel (BCW) and can be used to develop and evaluate interventions [ 19 ]. According to the BCW, interventions should be developed systematically, and the first stage is understanding the behavior through the COM-B lens [ 19 ]. This can be considered to have been achieved by the present study and the previous study of Meredith et al. [ 25 ]. In the second stage of intervention development; stakeholders can consult the BCW [ 19 ] while using the results from this study to create interventions that are specific to their settings to promote PA among older adults. For example, our research indicates that in addition to providing PA facilities, successful PA intervention may also require attention to the physical surroundings or the person’s knowledge of how to use the facility.

Limitations, strengths, and future research

The mixed-method design is a strength, as two data sets allow a thorough assessment of what influences older adults’ PA levels. For example, the quantitative analysis identified previous PA as an important influencer to older adults’ PA, which is not emphasized in our qualitative analysis nor by Meredith et al. [ 25 ]. Our results regarding the COM-B model validate Meredith et al. [ 25 ] findings. It can, therefore, be concluded that the COM-B model is useful for understanding older adults’ PA. This is valuable information since the COM-B related results can conveniently be transformed into interventions due to its connection with the framework of (BCW) [ 19 ]. Another strength of this study is that our analysis sheds light on the importance of age-disaggregated research when our findings, related to an older population, are compared to prior studies [ 22 , 24 ] that not only include older adults. Future research should preferably age disaggregate their analysis by the recommended five-year age brackets [ 41 ].

A limitation concerns the findings' generalizability or transferability since the drop-out rate was rather large in the quantitative sample from which the qualitative sample was recruited. Additionally, not everyone invited to the qualitative study consented to participate, meaning that those who chose to participate in this mixed-method study may differ from non-participants. However, the sample size in both datasets was relatively sizable, which boosts the statistical power of the quantitative analysis and nuances in the verbal statements in the qualitative analysis. The living conditions of the older adults, included in our study, may differ slightly from other samples used in prior research, challenging result comparisons. Future studies are recommended to include a more diverse population of older adults. Additionally, our sampling resulted in different mode ages in the two samples, which is a limitation as it may mirror different realities among older adults, thereby potentially influencing the result integration. Another weakness within the quantitative dataset concerns the precision of measurement, multicollinearity, and range restriction for the included variables in the regression analysis. A potential limitation of the qualitative study was the lack of member checks. However, to compensate for this, the participants were invited to contact the researchers with adjustments or additional comments on their interviews. To further ensure credibility and trustworthiness, all interviews were conducted within two months. Additionally, multiple interviewers and coders from different disciplines (i.e., investigator triangulation) helped minimize research bias, enhancing the qualitative analysis. Some talkative participants occasionally strayed off-topic, which may have influenced the data collection and the qualitative findings.

The qualitative analysis indicated the importance of PA as a pleasurable activity and previous research has associated subjective well-being [ 42 ] and morale [ 43 ] with older adults’ PA. However, the quantitative analysis did not include these emotional aspects as variables. We suggest that future studies review and clarify the value of positive and negative emotions for older adults’ PA levels. Additionally, longitudinal experimental study designs are needed to clarify the role of physical- and social opportunities for older adults’ PA.

To our knowledge, this study is the first to use a convergent mixed-method design to examine factors influencing PA levels in older adults aged 65 and above who rent apartments from the municipality with access to community hosts, providing a more comprehensive understanding of the topic. It seems that many factors influence older adults’ PA levels in a complex manner, with the integrated result showing convergence regarding motivation and physical capability but divergence in psychological capability, opportunity, and previous PA engagement. The findings also indicate relevance to the COM-B model as a framework for understanding older adults’ PA levels. Overall, we suggest that it is important to consider all the COM-B model’s subcomponents when designing a PA program for older adults and to apply a life-span perspective, as previous PA engagement seems to influence the current level of PA in older adults. However, it is also central to consider their current motivation, capability, and opportunities to understand what influences their PA levels. More research is needed to clarify the role of emotions and opportunities for older adults’ PA levels since the findings are inconsistent. Furthermore, the value of age-disaggregated data is revealed when our findings from samples of only older adults are compared to previous research that does not only include older people.

From a public health perspective, prioritizing PA early in life appears important, as this can positively impact older adults’ PA engagement. Based on our findings, we would make the following recommendations for promoting PA among older adults. Since the findings can be related to the intervention framework of BCW, stakeholders are encouraged to use these results while also seeking further guidance from the BCW to design interventions to improve PA levels and promote healthier aging among older adults. For instance, our findings suggest that it is important to consider the targeted population's physical abilities and offer appropriate options for their health condition when designing an intervention. Also, to alleviate the fear of injury that can hinder motivation for PA, it is central to address older adults’ concerns and provide them with the necessary knowledge to engage in PA safely. The findings also indicate the importance of PA to be fun, playful, and meaningful. This knowledge can be used to frame and present PA options to participants to motivate PA engagement. Challenging age stereotypes and emphasizing that it is never too late to start exercising appears also important. Additionally, to ensure the success of PA interventions, it is also important to consider the physical surroundings and social settings at macro and meso levels. For example, long walking distances with no resting spots or a partner's physical ability may prevent PA engagement. Addressing such issues can help individuals to partake in activities without limitations.

Availability of data and materials

The datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.

Abbreviations

  • Physical activity

Subjective socioeconomic status

The Behavior Change Wheel

Eckstrom E, Neukam S, Kalin L, Wright J. Physical activity and healthy aging. Clin Geriatr Med. 2020;36(4):671–83.

Article   PubMed   Google Scholar  

Moreno-Agostino D, Daskalopoulou C, Wu Y-T, Koukounari A, Haro JM, Tyrovolas S, et al. The impact of physical activity on healthy ageing trajectories: Evidence from eight cohort studies. Int J Behav Nutr Phys Act. 2020;17(1):92.

WHO. WHO guidelines on physical activity and sedentary behaviour 2020 [Available from: https://iris.who.int/bitstream/handle/10665/336656/9789240015128-eng.pdf?sequence=1andisAllowed=y .

WHO. Global recommendations on physical activity for health 2010 [Available from: https://iris.who.int/bitstream/handle/10665/44399/9789241599979_eng.pdf?sequence=1&isAllowed=y .

Wang H, King B. Understanding community-dwelling Chinese older adults’ engagement in physical activity: a grounded theory study. Gerontologist. 2022;62(3):342–51.

Public Health Agency of Sweden. Riktlinjer för fysisk aktivitet och stillasittande: Kunskapsstöd för främjande av fysisk aktivitet och minskat stillasittande 2021 [Available from: https://www.folkhalsomyndigheten.se/contentassets/106a679e1f6047eca88262bfdcbeb145/riktlinjer-fysisk-aktivitet-stillasittande.pdf .

Sun F, Norman IJ, While AE. Physical activity in older people: a systematic review. BMC Public Health. 2013;13(1):449.

UN General Assembly. Transforming our world: The 2030 agenda for sustainable development 2015 [Available from: https://sustainabledevelopment.un.org/post2015/transformingourworld/publication .

Castillo JMD, Navarro JEJ-B, Sanz JLG, Rodríguez MM, Izquierdo AC, Pinés DDH. Being physically active in old age: Relationships with being active earlier in life, social status and agents of socialization. Ageing and Society. 2010;30(7):1097–113.

Article   Google Scholar  

Barbaccia V, Bravi L, Murmura F, Savelli E, Viganò E. Mature and older adults’ perception of active ageing and the need for supporting services: Insights from a qualitative study. Int J Environ Res Public Health. 2022;19(13):7660.

Sansano-Nadal O, Giné-Garriga M, Rodríguez-Roca B, Guerra-Balic M, Ferri K, Wilson JJ, et al. Association of self-reported and device-measured sedentary behaviour and physical activity with health-related quality of life among european older adults. Int J Environ Res Public Health. 2021;18(24):13252.

Olivares PR, Gusi N, Prieto J, Hernandez-Mocholi MA. Fitness and health-related quality of life dimensions in community-dwelling middle aged and older adults. Health Qual Life Outcomes. 2011;9:117.

Huffman MK, Amireault S. What keeps them going, and what gets them back? Older adults’ beliefs about physical activity maintenance. Gerontologist. 2021;61(3):392–402.

Jancey JM, Clarke A, Howat P, Maycock B, Lee AH. Perceptions of physical activity by older adults: A qualitative study. Health Educ J. 2009;68(3):196–206.

Janssen SL, Stube JE. Older adults’ perceptions of physical activity: a qualitative study. Occup Ther Int. 2014;21(2):53–62.

Shankar A, McMunn A, Steptoe A. Health-related behaviors in older adults relationships with socioeconomic status. Am J Prev Med. 2010;38(1):39–46.

Hu P, Adler NE, Goldman N, Weinstein M, Seeman TE. Relationship between subjective social status and measures of health in older Taiwanese persons. J Am Geriatr Soc. 2005;53(3):483–8.

Lindwall M, Larsman P, Hagger MS. The reciprocal relationship between physical activity and depression in older European adults: a prospective cross-lagged panel design using SHARE data. Health Psychol. 2011;30(4):453–62.

Michie S, Atkins L, West R. The behaviour change wheel: A guide to designing interventions. Sutton: Silverback Publishing; 2014.

Campbell M, Fitzpatrick R, Haines A, Kinmonth AL, Sandercock P, Spiegelhalter D, et al. Framework for design and evaluation of complex interventions to improve health. BMJ. 2000;321(7262):694–6.

Article   CAS   PubMed   PubMed Central   Google Scholar  

Michie S, van Stralen MM, West R. The behaviour change wheel: A new method for characterising and designing behaviour change interventions. Implementation Science. 2011;6(1):42.

Knight RL, McNarry MA, Sheeran L, Runacres AW, Thatcher R, Shelley J, et al. Moving forward: Understanding correlates of physical activity and sedentary behaviour during covid-19—an integrative review and socioecological approach. Int J Environ Res Public Health. 2021;18(20):10910.

Roche C, Fisher A, Fancourt D, Burton A. Exploring barriers and facilitators to physical activity during the COVID-19 pandemic: A qualitative study. Int J Environ Res Public Health. 2022;19(15):9169.

Pelletier C, Cornish K, Amyot T, Pousette A, Fox G, Snadden D, et al. Physical activity promotion in rural health care settings: a rapid realist review. Prevent Med Rep. 2022;29:29.

Google Scholar  

Meredith SJ, Cox NJ, Ibrahim K, Higson J, McNiff J, Mitchell S, et al. Factors that influence older adults' participation in physical activity: A systematic review of qualitative studies. Age and Ageing. 2023;52(8).

Creswell JW, Plano Clark VL. Designing and conducting mixed methods research. 3 ed. Thousand Oaks, CA: SAGE; 2017.

Cohen L, Manion L, Morrison K. Research methods in education. 6: ed. Routledge; 2007.

Book   Google Scholar  

Statistics Sweden. Boende i Sverige 2023 [Available from: https://www.scb.se/hitta-statistik/sverige-i-siffror/manniskorna-i-sverige/boende-i-sverige/ .

Godin G. The godin-shephard leisure-time physical activity questionnaire. Health Fitness J Can. 2011;4(1):18–22.

Gottfries GG, Noltorp S, Nørgaard N. Experience with a Swedish version of the geriatric depression scale in primary care centres. Int J Geriatr Psychiatry. 1997;12(10):1029–34.

Article   CAS   PubMed   Google Scholar  

Bäccman C, Bergkvist L, Wästlund E. Personalized coaching via texting for behavior change to understand a healthy lifestyle intervention in a naturalistic setting: Mixed methods study. JMIR Format Res. 2023;7:e47312.

EuroQol Research Foundation. EQ-5D-3L User Guide: Basic information on how to use the EQ-5D-3L instrument. Version 6.0.; 2018.

Burström K, Sun S, Gerdtham U-G, Henriksson M, Johannesson M, Levin L-Å, et al. Swedish experience-based value sets for EQ-5D health states. Qual Life Res. 2014;23(2):431–42.

Adler NE, Epel ES, Castellazzo G, Ickovics JR. Relationship of subjective and objective social status with psychological and physiological functioning: reliminary data in healthy. White Women Health Psychol. 2000;19(6):586–92.

Hsieh H-F, Shannon SE. Three approaches to qualitative content analysis. Qual Health Res. 2005;15(9):1277–88.

Graneheim UH, Lundman B. Qualitative content analysis in nursing research: concepts, procedures and measures to achieve trustworthiness. Nurse Educ Today. 2004;24(2):105–12.

Zhang J, Bloom I, Dennison EM, Ward KA, Robinson SM, Barker M, et al. Understanding influences on physical activity participation by older adults: A qualitative study of community-dwelling older adults from the Hertfordshire Cohort Study, UK. PLoS ONE. 2022;17(1):e0263050.

The National Board of Health and Welfare. Kartläggning och analys av föreskrivning av antidepressiva läkemedel till personer 65 år och äldre 2023 [Available from: https://www.socialstyrelsen.se/globalassets/sharepoint-dokument/artikelkatalog/ovrigt/2023-11-8828.pdf .

Hu T, Zhao X, Wu M, Li Z, Luo L, Yang C, et al. Prevalence of depression in older adults: A systematic review and meta-analysis. Psychiatry Res. 2022;311:114511.

Nguyen QD, Moodie EM, Forget M, Desmarais P, Keezer MR, Wolfson C. Health heterogeneity in older adults: exploration in the Canadian longitudinal study on aging. J Am Geriatr Soc. 2021;69(3):678–87.

WHO. UN decade of healthy ageing: Plan of action (2021–2030) 2020 [Available from: https://cdn.who.int/media/docs/default-source/decade-of-healthy-ageing/decade-proposal-final-apr2020-en.pdf .

Chen S, Calderón-Larrañaga A, Saadeh M, Dohrn I-M, Welmer A-K. Correlations of subjective and social well-being with sedentary behavior and physical activity in older adults-A population-Based study. J Gerontol A Biol Sci Med Sci. 2021;76(10):1789–95.

Article   PubMed   PubMed Central   Google Scholar  

Almevall AD, Wennberg P, Zingmark K, Öhlin J, Söderberg S, Olofsson B, et al. Associations between everyday physical activity and morale in older adults. Geriatr Nurs. 2022;48:37–42.

Download references

Acknowledgements

We thank the Community hosts for their assistance during the quantitative and qualitative data collection.

Open access funding provided by Karlstad University. Our work was supported by Grant No. 20210102 from the Kamprad Family Foundation for Entrepreneurship, Research & Charity.

Author information

Authors and affiliations.

Service Research Center (CTF), Karlstad University, Karlstad, Sweden

Anna Nilstomt, Charlotte Bäccman & Erik Wästlund

Department of Political, Historical, Religious and Cultural Studies, Karlstad University, Karlstad, Sweden

Johanna Gustavsson & Finn Nilson

Center for Societal Risk Research, Karlstad University, Karlstad, Sweden

Department of Public Health, Karlstad University, Karlstad, Sweden

Linda Beckman

Department of Health Services Research, Management & Policy, University of Florida, Gainesville, FL, USA

Department of Educational Studies, Karlstad University, Karlstad, Sweden

Stefan Wagnsson

You can also search for this author in PubMed   Google Scholar

Contributions

AN made substantial contributions to the design of the work, the acquisition, analysis, and interpretation of data, and has drafted and revised the work. JG made substantial contributions to the design of the work, the acquisition, analysis, and interpretation of data, and has drafted and revised the work. LB made substantial contributions to the design of the work, the acquisition, analysis, and interpretation of data, and has drafted and revised the work. CB made substantial contributions to the design of the work, the analysis and interpretation of data and has drafted and revised the work. FN made substantial contributions to the design of the work and the analysis and interpretation of data and has drafted and revised the work. SW made substantial contributions to the design of the work and the data acquisition and has drafted and revised the work. EW made substantial contributions to the design of the work, the acquisition, analysis, and interpretation of data, and has drafted and revised the work. All authors have read and approved the final manuscript.

Authors’ information

AN is a licensed psychologist and a PhD student researching the maintenance of behavior with a focus on PA among older adults. JG is a senior researcher in public health with expertise in injury prevention for older adults and is also a licensed nurse. LB is a senior researcher in public health with expertise in young and older people’s mental health. CB is a senior researcher in psychology with expertise in health and well-being, behavior change, and digitalization. FN is a professor in risk management and a licensed physical therapist. SW is a senior researcher in sport science with expertise in PA and motivation. EW is a senior researcher in psychology with expertise in decision-making, behavior change, and digitalization.

Corresponding author

Correspondence to Anna Nilstomt .

Ethics declarations

Ethics approval and consent to participate.

The participants gave their informed consent before participating in the study. For the quantitative study, participants provided consent by filling out the questionnaire. For the qualitative study, participants provided written consent. The Swedish Ethical Authority approved this study (No. 2020–00950 and No. 2021–05133).

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Nilstomt, A., Gustavsson, J., Beckman, L. et al. Physical activity from the perspective of older adults: a convergent mixed-method study. BMC Geriatr 24 , 768 (2024). https://doi.org/10.1186/s12877-024-05362-x

Download citation

Received : 05 April 2024

Accepted : 06 September 2024

Published : 18 September 2024

DOI : https://doi.org/10.1186/s12877-024-05362-x

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Active aging
  • Community-dwelling older adults
  • COM-B model
  • Convergent mixed-methods
  • SDG 3: Good health and well-being

BMC Geriatrics

ISSN: 1471-2318

research methodology and data analysis

COMMENTS

  1. Data Analysis in Research: Types & Methods

    Methods used for data analysis in qualitative research. There are several techniques to analyze the data in qualitative research, but here are some commonly used methods, Content Analysis: It is widely accepted and the most frequently employed technique for data analysis in research methodology. It can be used to analyze the documented ...

  2. Data Analysis

    Data Analysis. Definition: Data analysis refers to the process of inspecting, cleaning, transforming, and modeling data with the goal of discovering useful information, drawing conclusions, and supporting decision-making. It involves applying various statistical and computational techniques to interpret and derive insights from large datasets.

  3. Research Methods

    For quantitative data, you can use statistical analysis methods to test relationships between variables. For qualitative data, you can use methods such as thematic analysis to interpret patterns and meanings in the data. Note AI tools can be useful when brainstorming research method ideas and designing certain data collection methods. However ...

  4. Data Analysis Techniques in Research

    Data Analysis Methods. Data analysis methods refer to the techniques and procedures used to analyze, interpret, and draw conclusions from data. These methods are essential for transforming raw data into meaningful insights, facilitating decision-making processes, and driving strategies across various fields. Here are some common data analysis ...

  5. A practical guide to data analysis in general literature reviews

    This article is a practical guide to conducting data analysis in general literature reviews. The general literature review is a synthesis and analysis of published research on a relevant clinical issue, and is a common format for academic theses at the bachelor's and master's levels in nursing, physiotherapy, occupational therapy, public health and other related fields.

  6. Introduction to Research Statistical Analysis: An Overview of the

    Introduction. Statistical analysis is necessary for any research project seeking to make quantitative conclusions. The following is a primer for research-based statistical analysis. It is intended to be a high-level overview of appropriate statistical testing, while not diving too deep into any specific methodology.

  7. The Beginner's Guide to Statistical Analysis

    This article is a practical introduction to statistical analysis for students and researchers. We'll walk you through the steps using two research examples. The first investigates a potential cause-and-effect relationship, while the second investigates a potential correlation between variables. Example: Causal research question.

  8. Research Methods Guide: Data Analysis

    Data Analysis and Presentation Techniques that Apply to both Survey and Interview Research. Create a documentation of the data and the process of data collection. Analyze the data rather than just describing it - use it to tell a story that focuses on answering the research question. Use charts or tables to help the reader understand the data ...

  9. What Is a Research Design

    A research design is a strategy for answering your research question using empirical data. Creating a research design means making decisions about: Your overall research objectives and approach. Whether you'll rely on primary research or secondary research. Your sampling methods or criteria for selecting subjects. Your data collection methods.

  10. Sage Research Methods

    This is a book that will rapidly be recognized as the bible for social researchers. It provides a first-class, reliable guide to the basic issues in data analysis, such as the construction of variables, the characterization of distributions and the notions of inference. Scholars and students can turn to it for teaching and applied needs with ...

  11. Research Methodology and Data Analysis Second Edition

    The book also guides in research methodology such as the methods of designing a questionnaire, methods of sampling, methods of data collection and methods of data analysis. The data analysis covers data mining, descriptive analysis, factor analysis, and reliability analysis. Besides this, the book assesses the normality distribution of data ...

  12. Data analysis

    data analysis, the process of systematically collecting, cleaning, transforming, describing, modeling, and interpreting data, generally employing statistical techniques. Data analysis is an important part of both scientific research and business, where demand has grown in recent years for data-driven decision making.

  13. What is Data Analysis? (Types, Methods, and Tools)

    Couchbase Product Marketing. December 17, 2023. Data analysis is the process of cleaning, transforming, and interpreting data to uncover insights, patterns, and trends. It plays a crucial role in decision making, problem solving, and driving innovation across various domains. In addition to further exploring the role data analysis plays this ...

  14. Basic statistical tools in research and data analysis

    Statistical methods involved in carrying out a study include planning, designing, collecting data, analysing, drawing meaningful interpretation and reporting of the research findings. The statistical analysis gives meaning to the meaningless numbers, thereby breathing life into a lifeless data. The results and inferences are precise only if ...

  15. Research Methodology

    Qualitative Research Methodology. This is a research methodology that involves the collection and analysis of non-numerical data such as words, images, and observations. This type of research is often used to explore complex phenomena, to gain an in-depth understanding of a particular topic, and to generate hypotheses.

  16. LibGuides: Research Methods: Data Analysis & Interpretation

    For more information on data analysis look at Sage Research Methods database on the library website. *Creswell, John W.(2009) Research design: qualitative, and mixed methods approaches. Sage, Los Angeles, pp 183 **Bergin, T (2018), Data analysis: quantitative, qualitative and mixed methods. Sage, Los Angeles, pp182

  17. Understanding Research

    SOC 211: Research Methods and Data Analysis. Understanding Research. The Research Process; Literature Review; Mind Mapping; Writing Guide; Search Techniques; Finding Books & Background Information; Finding Articles; Statistics and Data; Citing Sources; Social Sciences Librarian. Elizabeth Young-Miller

  18. (PDF) Different Types of Data Analysis; Data Analysis Methods and

    Data analysis is simply the process of converting the gathered data to meanin gf ul information. Different techniques such as modeling to reach trends, relatio nships, and therefore conclusions to ...

  19. Qualitative vs Quantitative Research Methods & Data Analysis

    Olivia Guy-Evans, MSc. The main difference between quantitative and qualitative research is the type of data they collect and analyze. Quantitative data is information about quantities, and therefore numbers, and qualitative data is descriptive, and regards phenomenon which can be observed but not measured, such as language.

  20. The 7 Most Useful Data Analysis Methods and Techniques

    Often, qualitative analysis will organize the data into themes—a process which, fortunately, can be automated. Data analysts work with both quantitative and qualitative data, so it's important to be familiar with a variety of analysis methods. Let's take a look at some of the most useful techniques now. 3. Data analysis techniques

  21. Qualitative Research: Data Collection, Analysis, and Management

    There should be a section on the chosen methodology and a brief discussion about why qualitative methodology was most appropriate for the study question and why one particular methodology (e.g., interpretative phenomenological analysis rather than grounded theory) was selected to guide the research. The method itself should then be described ...

  22. Qualitative Data Analysis

    This method of qualitative data analysis starts with an analysis of a single case to formulate a theory. Then, additional cases are examined to see if they contribute to the theory. Qualitative data analysis can be conducted through the following three steps: Step 1: Developing and Applying Codes. Coding can be explained as categorization of data.

  23. Data Analysis

    Data analysis methods in the absence of primary data collection can involve discussing common patterns, as well as, controversies within secondary data directly related to the research area. My e-book, The Ultimate Guide to Writing a Dissertation in Business Studies: a step by step assistance offers practical assistance to complete a ...

  24. Qualitative research

    Qualitative research is a type of research that aims to gather and analyse non-numerical (descriptive) data in order to gain an understanding of individuals' social reality, including understanding their attitudes, beliefs, and motivation.This type of research typically involves in-depth interviews, focus groups, or field observations in order to collect data that is rich in detail and context.

  25. Introduction to Research Methods and Analysis

    Learn Introduction to Research Methods and Analysis at Walden University. Explore HLTH4205 course description and additional information. ... and statistical techniques for analyzing and interpreting data. Related Programs. BS in Health Studies; Please Select a Start Date. Please Select a Start Date. Oct 07, 2024. Nov 25, 2024. Jan 06, 2025 ...

  26. Market research and competitive analysis

    Here are a few methods you can use to do direct research: Surveys; Questionnaires; Focus groups; In-depth interviews; For guidance on deciding which methods are worthwhile for your small business, the U.S. Small Business Administration (SBA) provides counseling services through our resource partner network.

  27. Are you in the U.S. middle class? Try our income calculator

    About half of U.S. adults (52%) lived in middle-income households in 2022, according to a Pew Research Center analysis of the most recent available government data. Roughly three-in-ten (28%) were in lower-income households and 19% were in upper-income households. Our calculator below, updated with ...

  28. Global burden of 288 causes of death and life expectancy decomposition

    Methods. Read cause-specific methods appendices. The Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2021 cause-of-death analysis estimated mortality and years of life lost (YLLs) from 288 causes of death by age-sex-location-year in 204 countries and territories and 811 subnational locations for each year from 1990 until 2021.

  29. A Combined OCBA-AIC Method for Stochastic Variable Selection in Data

    This study introduces a novel approach to enhance variable selection in Data Envelopment Analysis (DEA), especially in stochastic environments where efficiency estimation is inherently complex. To address these challenges, we propose a game cross-DEA model to refine efficiency estimation. Additionally, we integrate the Akaike Information Criterion (AIC) with the Optimal Computing Budget ...

  30. Physical activity from the perspective of older adults: a convergent

    A pragmatic approach was applied, as this allows using both quantitative and qualitative research methods to collect the data needed to address the study's research questions . The National Research Ethics Committee approved the study before recruitment. The aim was operationalized into three research questions: 1.