:
by Darrell Huff. favorite deceptions. For example, you might choose as one of your favorite deceptions the hypothetical real estate agent’s deceptive use of a neighborhood’s “average” income in Chapter 2. favorite deceptions to other people. ). . . and make a new Discussion Board post.:
.” .” Seven Sins of Statistical Misinterpretation” to scientific articles you have read. that you found and read in Unit #5 and that you synthesized in Unit #6. .” . _PSY-225_Gernsbacher_StatsCheck_Fillable.pdf. In other words, add your last name to the beginning of the filename. you open the unfilled PDF from your computer. and attach your filled-in PDF.:
.” .” .” of the following five topics for which Gallup has conducted a public opinion poll. Then, within each of the three topics you’ve chosen, read of the listed reports. ” ” ” ” ” ” ” ” ” ” ” ” ” ” ” ” ” ” ” ” and make a new post of at least in which you provide the following information for you chose to read (three topics x one report per topic). It will be easiest if you write a separate paragraph for .:
and read all the other students’ posts. other students; each of your three responses should be . that you also wrote about. . (besides the two students you responded to in 1. and 2. above).:
. If you are in a Chat Group with two other students, that means you will read four essays; if you are in a Chat Group with only one other student, that means you will read two essays. . Note that you will again be answering 12 questions about each member’s essays. text-based Chat. . ” images. More than one Chat Group member can indicate the same image if that’s how they are feeling, and please refer to each image by its number , , that summarizes your Group Chat in . , save the Chat transcript, as described in the (under the topic, “How To Save and Attach a Chat Transcript”), and attach the Chat transcript, in PDF, to a post on the . , that states the name of the assignment (Unit 7: Assignment #6), the full name of your Chat Group, the first and last names of each Chat Group member who participated in the Group Chat, the day (e.g., Sunday) and date of this Group Chat (e.g., June 13), the start and stop time of this Group Chat (e.g., 1pm to 2pm). ” images. More than one Chat Group member can indicate the same image if that’s how they are feeling, and please refer to each image by its number. : The “How Are You Feeling at the of Today’s Group Chat” grid of images differs from the “How Are You Feeling at the of Today’s Group Chat” grid of images.Congratulations, you have finished Unit 7! Onward to !
Table of contents.
Professionals across diverse fields such as science, mathematics, marketing, and technology rely on statistics to extract meaningful insights and draw conclusions from extensive datasets. The study of statistics is divided into two primary branches: descriptive and inferential statistics. These branches play distinct roles in uncovering various facets of data, enabling professionals to make informed decisions based on analytical findings.
Understanding different types of statistics, such as descriptive and inferential statistics, is crucial for developing a robust understanding of data management and choosing the most suitable analytical approaches. While certain measurement techniques may share similarities, the fundamental objectives of descriptive and inferential statistics diverge significantly. This blog explores the contrasting features of descriptive and inferential statistics, highlighting their unique roles and impacts in data analysis. By distinguishing between these statistical methods, professionals can effectively leverage them to extract valuable insights from complex datasets.
*geeksforgeeks.org
Descriptive statistics is a field of statistics focused on summarizing and explaining the key aspects of a dataset. It involves techniques for organizing, visualizing, and presenting data in a meaningful and straightforward manner. Descriptive statistics characterize the properties of the dataset being analyzed without making broader assumptions beyond the data itself.
Here are some different types of method for descriptive statistics:
1. Central Tendency: Central tendency is a method of summarizing data by identifying a central value to which all other data points are related. This central value can be represented by the median or the mean. In a normal distribution, these central measures often coincide.
2. Dispersion: Dispersion refers to how spread out the data points are within a dataset. Statisticians commonly use metrics such as variance and standard deviation to quantify the extent of spread among the dataset’s points.
3. Skewness: Skewness in statistics characterizes the distribution shape of a dataset when plotted. Normally distributed datasets exhibit little to no skew, with data points evenly distributed around the central point. In contrast, other datasets may show strong skewness either to the left or right of the center, depending on where the majority of data points are concentrated.
In contrast, inferential statistics involves drawing conclusions, making predictions, or forming generalizations about a larger population based on data collected from a representative sample of that population. It extends the insights gained from a sample to the entire population from which the sample was taken. Inferential statistics enable researchers to reach conclusions, test hypotheses, and predict outcomes for populations, even when it is impractical or impossible to study the entire population directly.
Below are important inferential methods commonly used in statistical analysis:
1. Hypothesis Testing: Hypothesis testing involves a statistician formulating a hypothesis about a population or sample and then collecting data from sample groups to test this hypothesis.
2. Regression Analysis: Regression analysis utilizes a dataset with confirmed data points to estimate the relationship between two variables, such as height and weight. This analysis enables statisticians and other professionals to make predictions for values beyond the measured data range.
3. Confidence Intervals: In research, statisticians often establish confidence intervals, which represent ranges of certain key values associated with specific probabilities. These intervals provide insights into the precision and reliability of statistical estimates.
Descriptive statistics examples.
For instance, Imagine a company wants to analyze the performance of its sales team over the past year. They gather data on monthly sales figures for each salesperson. To understand the distribution of sales and identify key insights, they use descriptive statistics. Here are the steps company need to follow for the analysis:
1. Measures of Central Tendency: The company calculates the mean (average) monthly sales for the entire team. They find that the mean sales per month is $50,000, indicating the typical performance level.
2. Measures of Dispersion: To understand how sales vary among team members, they calculate the standard deviation of monthly sales. They discover that the standard deviation is $15,000, suggesting a moderate level of variation in sales performance across the team.
3. Visualization: Using histograms or box plots, they visually represent the distribution of monthly sales. This visualization highlights whether sales data are symmetrically distributed or skewed towards certain values.
4. Percentiles: They calculate the 25th, 50th (median), and 75th percentiles of monthly sales. This helps identify sales levels at which specific percentages of salespeople perform, such as the median sales or the top quartile of performers.
5. Correlation Analysis: They explore correlations between sales performance and other factors like tenure, training hours, or geographic region. This analysis reveals if any factors are associated with higher or lower sales figures.
Suppose a company is interested in understanding whether coffee consumption affects employee productivity. They decide to conduct a study where they gather data on coffee consumption and corresponding productivity levels among employees. Here is how step by step analysis process will look like:
Descriptive statistics serve the purpose of summarizing and describing the characteristics of a dataset that is directly available to the statistician. This includes providing measures like central tendency (e.g., mean, median), dispersion (e.g., variance, standard deviation), and graphical representations (e.g., histograms, box plots) that depict the distribution of the data. For instance, in the scenario of collecting birth weight data from a hospital, descriptive statistics would be used to calculate the average birth weight of the children born within that specific hospital during a year.
On the other hand, inferential statistics are employed to make inferences or generalizations about a larger population based on a sample of data. The purpose of inferential statistics is to draw conclusions that extend beyond the specific dataset being analyzed. Using inferential methods, a statistician can estimate parameters, test hypotheses, and make predictions about the population from which the sample was drawn. In the given example, inferential statistics would be applied to estimate or predict the average birth weight of children nationwide, based on the sample data collected from the hospital. This allows for broader insights and conclusions to be drawn beyond the immediate dataset.
Descriptive statistics involves analyzing data from a complete set, where every data point is collected and summarized. For instance, when determining the average height of students in a class, each student’s height is measured, providing a comprehensive view of that specific group.
The methodology of inferential statistics focuses on making broader conclusions or predictions about a larger population based on a representative sample. In this methodology, the statistician selects a sample that is representative of the entire population of interest. For example, if the teacher aims to infer the average height of all students in the school, the class data alone may not suffice as a representative sample. To utilize inferential statistics effectively, the teacher would need to construct a sample that includes students from different year levels across the school to ensure accurate representation of the overall student body.
The precision of calculating descriptive statistics relies on the availability of complete data for the analysis. When using descriptive statistical methods, such as calculating the mean test score of a class, statisticians can achieve a better accuracy and precision because they have access to all necessary data points within the defined group or sample. For instance, a teacher calculating the exact average test score from a class of 20 students can achieve a precise measurement as they have all individual scores at their disposal. Descriptive statistics excel in precision when the entire dataset is known and can be directly analyzed, ensuring accurate measurements and insights into the specific group or sample.
The precision of inferential statistics is inherently influenced by the sampling process and the potential for variability and error when making predictions about a larger population based on a sample subset. Inferential statistics involve drawing conclusions or making estimations about a broader population from a representative sample. For example, using the average test score of a class to estimate the average test score of all classes taking the same test introduces uncertainty due to potential differences between the sample and the entire population. While inferential statistics provide valuable insights beyond the observed data, they carry the risk of error and require careful consideration of sampling methods and assumptions to ensure the reliability and accuracy of the inferences made.
Descriptive and inferential statistics, although distinct in their purposes and approaches, exhibit some similarities:
1. Data Utilization: Both descriptive and inferential statistics utilize the same dataset. Descriptive statistics summarize this data, whereas inferential statistics use it to draw broader conclusions about a larger population.
2. Statistical Measures : They commonly utilize similar statistical measures like mean and standard deviation to describe datasets or infer information about populations based on samples.
3. Graphical Representations: Both types of statistics can employ graphical representations like histograms, box plots, and scatterplots to visually represent data trends and patterns.
4. Summary Statistics: Summary statistics are essential in both descriptive and inferential statistics to provide a concise overview of the data, including measures of central tendency and dispersion.
5. Analysis Foundation: Descriptive statistics form the basis for inferential statistics. Properly summarizing and understanding sample data is crucial before making accurate inferences about the broader population.
Descriptive and inferential statistics are important categories in statistics. Descriptive statistics focus on summarizing data to reveal its characteristics and patterns. Meanwhile, inferential statistics use sample data to make predictions and draw conclusions about a larger population.
Both types of statistics are essential for data analysis, complementing each other to provide a complete understanding of datasets. This blog has explained these concepts clearly, highlighting their differences with practical examples. Understanding descriptive and inferential statistics helps analysts and researchers make informed decisions in their work across different fields.
Fill The Form To Get More Information
Professional certificate programme in data science for business decisions – iim kozhikode, post graduate certificate programme in data science for business excellence and innovation – iim nagpur, advanced data science certificate program – rotman school of management (uoft) & iitm pravartak technology innovation hub of iit madras, recent blogs.
Trending blogs.
Save my name, email, and website in this browser for the next time I comment.
Copyright © 2022 Jaro Education. All rights reserved.
Want to create or adapt books like this? Learn more about how Pressbooks supports open publishing practices.
Learning objectives.
In the previous section, we looked at some of the research designs psychologists use. In this section, we will provide an overview of some of the statistical approaches researchers take to understanding the results that are obtained in research. Descriptive statistics are the first step in understanding how to interpret the data you have collected. They are called descriptive because they organize and summarize some important properties of the data set. Keep in mind that researchers are often collecting data from hundreds of participants; descriptive statistics allow them to make some basic interpretations about the results without having to eyeball each result individually.
Let’s work through a hypothetical example to show how descriptive statistics help researchers to understand their data. Let’s assume that we have asked 40 people to report how many hours of moderate-to-vigorous physical activity they get each week. Let’s begin by constructing a frequency distribution of our hypothetical data that will show quickly and graphically what scores we have obtained.
1 | 7 |
2 | 6 |
6 | 5 |
8 | 4 |
7 | 3 |
8 | 2 |
5 | 1 |
3 | 0 |
We can now construct a histogram that will show the same thing on a graph (see Figure 2.5 ). Note how easy it is to see the shape of the frequency distribution of scores.
Many variables that psychologists are interested in have distributions where most of the scores are located near the centre of the distribution, the distribution is symmetrical, and it is bell-shaped (see Figure 2.6 ). A data distribution that is shaped like a bell is known as a normal distribution . Normal distributions are common in human traits because of the tendency for variability; traits like intelligence, wealth, shoe size, and so on, are distributed such that relatively few people are either extremely high or low scorers, and most people fall somewhere near the middle.
A distribution can be described in terms of its central tendency — that is, the point in the distribution around which the data are centred — and its dispersion or spread . The arithmetic average, or arithmetic mean , symbolized by the letter M , is the most commonly used measure of central tendency. It is computed by calculating the sum of all the scores of the variable and dividing this sum by the number of participants in the distribution, denoted by the letter N . In the data presented in Figure 2.6, the mean height of the students is 67.12 inches (170.48 cm). The sample mean is usually indicated by the letter M .
In some cases, however, the data distribution is not symmetrical. This occurs when there are one or more extreme scores, known as outliers , at one end of the distribution. Consider, for instance, the variable of family income (see Figure 2.7 ), which includes an outlier at a value of $3,800,000. In this case, the mean is not a good measure of central tendency. Although it appears from Figure 2.7 that the central tendency of the family income variable should be around $70,000, the mean family income is actually $223,960. The single very extreme income has a disproportionate impact on the mean, resulting in a value that does not well represent the central tendency.
The median is used as an alternative measure of central tendency when distributions are not symmetrical. The median is the score in the centre of the distribution, meaning that 50% of the scores are greater than the median and 50% of the scores are less than the median. In our case, the median household income of $73,000 is a much better indication of central tendency than is the mean household income of $223,960.
A final measure of central tendency, known as the mode , represents the value that occurs most frequently in the distribution. You can see from Figure 2.7 that the mode for the family income variable is $93,000; it occurs four times.
In addition to summarizing the central tendency of a distribution, descriptive statistics convey information about how the scores of the variable are spread around the central tendency. Dispersion refers to the extent to which the scores are all tightly clustered around the central tendency (see Figure 2.8 ). Here, there are many scores close to the middle of the distribution.
In other instances, they may be more spread out away from it (see Figure 2.9 ). Here, the scores are further away from the middle of the distribution.
One simple measure of dispersion is to find the largest (i.e., the maximum) and the smallest (i.e., the minimum) observed values of the variable and to compute the range of the variable as the maximum observed score minus the minimum observed score. You can check that the range of the height variable shown in Figure 2.6 above is 72 – 62 = 10.
The standard deviation , symbolized as s , is the most commonly used measure of variability around the mean. Distributions with a larger standard deviation have more spread. Those with small deviations have scores that do not stray very far from the average score. Thus, standard deviation is a good measure of the average deviation from the mean in a set of scores. In the examples above, the standard deviation of height is s = 2.74, and the standard deviation of family income is s = $745,337. These standard deviations would be more informative if we had others to compare them to. For example, suppose we obtained a different sample of adult heights and compared it to those shown in Figure 2.6 above. If the standard deviation was very different, that would tell us something important about the variability in the second sample as compared to the first. A more relatable example might be student grades: a professor could keep track of student grades over many semesters. If the standard deviations were relatively similar from semester to semester, this would indicate that the amount of variability in student performance is fairly constant. If the standard deviation suddenly went up, that would indicate that there are more students with very low scores, very high scores, or both. It’s useful to see how standard deviation is calculated: a good demonstration can be found at Khan Academy .
The standard deviation in the normal distribution has some interesting properties (see Figure 2.10 ). Approximately 68% of the data fall within 1 standard deviation above or below the mean score: 34% fall above the mean, and 34% fall below. In other words, about 2/3 of the population are within 1 standard deviation of the mean. Therefore, if some variable is normally distributed (e.g., height, IQ, etc.), you can quickly work out where approximately 2/3 of the population fall by knowing the mean and standard deviation.
We have seen that descriptive statistics are useful in providing an initial way to describe, summarize, and interpret a set of data. They are limited in usefulness because they tell us nothing about how meaningful the data are. The second step in analyzing data requires inferential statistics . Inferential statistics provide researchers with the tools to make inferences about the meaning of the results. Specifically, they allow researchers to generalize from the sample they used in their research to the greater population, which the sample represents. Keep in mind that psychologists, like other scientists, rely on relatively small samples to try to understand populations.
This is not a textbook about statistics, so we will limit the discussion of inferential statistics. However, all students of psychology should become familiar with one very important inferential statistic: the significance test. In the simplest, non-mathematical terms, the significance test is the researcher’s estimate of how likely it is that their results were simply the result of chance. Significance testing is not the same thing as estimating how meaningful or large the results are. For example, you might find a very small difference between two experimental conditions that is statistically significant.
Typically, most researchers use the convention that if significance testing shows that a result has a less than 5% probability of being due to chance alone, the result is considered to be real and to generalize to the population. If the significance test shows that the probability of chance causing the outcome is greater than 5%, it is considered to be a non-significant result and, consequently, of little value; non-significant results are more likely to be chance findings and, therefore, should not be generalized to the population. Significance tests are reported as p values , for example, p< .05 means the probability of being caused by chance is less than 5%. P values are reported by all statistical programs so students no longer need to calculate them by hand. Most often, p values are used to determine whether or not effects detected in the research are present. So, if p< .05, then we can conclude that an effect is present, and the difference between the two groups is real.
Thus, p values provide information about the presence of an effect. However, for information about how meaningful or large an effect is, significance tests are of little value. For that, we need some measure of effect size. Effect size is a measure of magnitude; for example, if there is a difference between two experimental groups, how large is the difference? There are a few different statistics for calculating effect sizes.
In summary, statistics are an important tool in helping researchers understand the data that they have collected. Once the statistics have been calculated, the researchers interpret their results. Thus, while statistics are heavily used in the analysis of data, the interpretation of the results requires a researcher’s knowledge, analysis, and expertise.
Figure 2.5. Used under a CC BY-NC-SA 4.0 license.
Figure 2.6. Used under a CC BY-NC-SA 4.0 license.
Figure 2.7. Used under a CC BY-NC-SA 4.0 license.
Figure 2.8. Used under a CC BY-NC-SA 4.0 license.
Figure 2.9. Used under a CC BY-NC-SA 4.0 license.
Figure 2.10. Empirical Rule by Dan Kernler is used under a CC BY-SA 4.0 license.
Figure 2.7. Of the 25 families, 24 families have an income between $44,000 and $111,000, and only one family has an income of $3,800,000. The mean income is $223,960, while the median income is $73,000.
[Return to Figure 2.7]
Psychology - 1st Canadian Edition Copyright © 2020 by Sally Walters is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License , except where otherwise noted.
Descriptive and inferential statistics are two fields of statistics. Descriptive statistics is used to describe data and inferential statistics is used to make predictions. Descriptive and inferential statistics have different tools that can be used to draw conclusions about the data.
In descriptive and inferential statistics, the former uses tools such as central tendency, and dispersion while the latter makes use of hypothesis testing, regression analysis, and confidence intervals. In this article, we will learn more about descriptive and inferential statistics, its differences, associated formulas and examples.
1. | |
2. | |
3. | |
4. | |
5. |
The purpose of descriptive and inferential statistics is to analyze different types of data using different tools. Descriptive statistics helps to describe and organize known data using charts, bar graphs, etc., while inferential statistics aims at making inferences and generalizations about the population data.
Descriptive statistics are a part of statistics that can be used to describe data. It is used to summarize the attributes of a sample in such a way that a pattern can be drawn from the group. It enables researchers to present data in a more meaningful way such that easy interpretations can be made. Descriptive statistics uses two tools to organize and describe data. These are given as follows:
Inferential Statistics
Inferential statistics is a branch of statistics that is used to make inferences about the population by analyzing a sample. When the population data is very large it becomes difficult to use it. In such cases, certain samples are taken that are representative of the entire population. Inferential statistics draws conclusions regarding the population using these samples. Sampling strategies such as simple random sampling, cluster sampling, stratified sampling, and systematic sampling, need to be used in order to choose correct samples from the population. Some methodologies used in inferential statistics are as follows:
Both descriptive and inferential statistics are equally important to analyze data. Descriptive statistics are used to order data and describe the sample using the mean, standard deviation, charts, etc. Inferential statistics uses this sample data to predict the trend of the population data. The differences between descriptive and inferential statistics have been outlined in the table given below:
Definition | Descriptive statistics is used to describe the characteristics of the population using a sample. | Inferential statistics uses various analytical tools to draw inferences about the population using samples. |
Tools | and measures of dispersion. | Hypothesis testing and regression analysis. |
Use | Organizes, describes and presents data in a meaningful way with the help of charts and graphs. | Tests, predicts, and compares data obtained from various samples. |
Relevance | It is used to summarize known data in a way that can be used for further predictions and analysis. | It tries to use the summarized samples to draw conclusions about the population. |
There are many statistical formulas that fall under descriptive and inferential statistics. These are given as follows:
Descriptive Statistics:
Descriptive and inferential statistics need to be used hand in hand so as to analyze the data in the best possible way. Some examples of descriptive and inferential statistics are given below:
Related Articles:
Important Notes on Descriptive and Inferential Statistics
Example 1: The scores of 2 groups of students belonging to different classes are noted. Using descriptive and inferential statistics see which group exhibits a higher variability in performance.
Group A: 56, 58, 60, 62, 64
Group B: 40, 50, 60, 70, 80
Solution: To describe the variability in performance the variance is used. Thus, descriptive statistics is used to analyze this data.
Group A mean = (56 + 58 + 60 + 62 + 64) / 5 = 60
Group A variance = ([56 - 60) 2 + (58 - 60) 2 + (60 - 60) 2 + (62 - 60) 2 + (64 - 60) 2 ] / 5 - 1 = 10
Group B mean = (40 + 50 + 60 + 70 + 80) / 5 = 60
Group B variance = ([40 - 60) 2 + (50 - 60) 2 + (60 - 60) 2 + (70 - 60) 2 + (80 - 60) 2 ] / 5 - 1 = 250
By looking at the variance it is clear that group B displays higher variance than group A
Answer: Group B is more variable.
Example 2: Find the mode of the following data using descriptive statistics. 5, 6, 2, 7, 6, 5,1, 9, 5, 8, 5, 4, 3, 12, 11, 17, 5, 5
Solution: Mode is the most frequently occurring observation. Thus, the mode is 5
Answer: Mode = 5
Example 3: Find the z score using descriptive and inferential statistics for the given data. Population mean 100, sample mean 120, population variance 49 and size 10.
Solution: Inferential statistics is used to find the z score of the data. The formula is given as follows:
z = \(\frac{x-\mu}{\sigma}\)
Standard deviation = \(\sqrt{49}\) = 7
z = (120 - 100) / 7
= 20 / 7 = 2.86
Answer: Z score = 2.86
go to slide go to slide go to slide
Book a Free Trial Class
What is the meaning of descriptive and inferential statistics.
Descriptive and inferential statistics are two branches of statistics that are used to describe data and make important inferences about the population using samples.
The tools used in descriptive and inferential statistics are measures of central tendency, measures of dispersion , hypothesis testing, and regression analysis.
The important formulas used in descriptive and inferential statistics are as follows:
Descriptive and inferential statistics are used to analyze data. Descriptive statistics is used to describe and organize data while inferential statistics draw conclusions about the population from samples by using analytical tools.
Yes, hypothesis tests such as z test, f test, ANOVA test, and t-test are a part of descriptive and inferential statistics. Hypothesis testing along with regression analysis specifically fall under inferential statistics.
The similarity between descriptive and inferential statistics is that they both rely on the same data set. Descriptive statistics describes this data set while inferential statistics uses this data set to make generalizations about a population
The main difference between descriptive and inferential statistics is that the former is used to describe the characteristics of a data set while the latter focuses on making predictions and generalizations about the data.
Check your thesis for plagiarism in 10 minutes, generate your apa citations for free.
Published on 4 November 2022 by Pritha Bhandari . Revised on 9 January 2023.
Descriptive statistics summarise and organise characteristics of a data set. A data set is a collection of responses or observations from a sample or entire population .
In quantitative research , after collecting data, the first step of statistical analysis is to describe characteristics of the responses, such as the average of one variable (e.g., age), or the relation between two variables (e.g., age and creativity).
The next step is inferential statistics , which help you decide whether your data confirms or refutes your hypothesis and whether it is generalisable to a larger population.
Types of descriptive statistics, frequency distribution, measures of central tendency, measures of variability, univariate descriptive statistics, bivariate descriptive statistics, frequently asked questions.
There are 3 main types of descriptive statistics:
You can apply these to assess only one variable at a time, in univariate analysis, or to compare two or more, in bivariate and multivariate analysis.
A data set is made up of a distribution of values, or scores. In tables or graphs, you can summarise the frequency of every possible value of a variable in numbers or percentages.
Gender | Number |
---|---|
Male | 182 |
Female | 235 |
Other | 27 |
From this table, you can see that more women than men or people with another gender identity took part in the study. In a grouped frequency distribution, you can group numerical response values and add up the number of responses for each group. You can also convert each of these numbers to percentages.
Library visits in the past year | Percent |
---|---|
0–4 | 6% |
5–8 | 20% |
9–12 | 42% |
13–16 | 24% |
17+ | 8% |
Measures of central tendency estimate the center, or average, of a data set. The mean , median and mode are 3 ways of finding the average.
Here we will demonstrate how to calculate the mean, median, and mode using the first 6 responses of our survey.
The mean , or M , is the most commonly used method for finding the average.
To find the mean, simply add up all response values and divide the sum by the total number of responses. The total number of responses or observations is called N .
Data set | 15, 3, 12, 0, 24, 3 |
---|---|
Sum of all values | 15 + 3 + 12 + 0 + 24 + 3 = 57 |
Total number of responses | = 6 |
Mean | Divide the sum of values by to find : 57/6 = |
The median is the value that’s exactly in the middle of a data set.
To find the median, order each response value from the smallest to the biggest. Then, the median is the number in the middle. If there are two numbers in the middle, find their mean.
Ordered data set | 0, 3, 3, 12, 15, 24 |
---|---|
Middle numbers | 3, 12 |
Median | Find the mean of the two middle numbers: (3 + 12)/2 = |
The mode is the simply the most popular or most frequent response value. A data set can have no mode, one mode, or more than one mode.
To find the mode, order your data set from lowest to highest and find the response that occurs most frequently.
Ordered data set | 0, 3, 3, 12, 15, 24 |
---|---|
Mode | Find the most frequently occurring response: |
Measures of variability give you a sense of how spread out the response values are. The range, standard deviation and variance each reflect different aspects of spread.
The range gives you an idea of how far apart the most extreme response scores are. To find the range , simply subtract the lowest value from the highest value.
The standard deviation ( s ) is the average amount of variability in your dataset. It tells you, on average, how far each score lies from the mean. The larger the standard deviation, the more variable the data set is.
There are six steps for finding the standard deviation:
Raw data | Deviation from mean | Squared deviation |
---|---|---|
15 | 15 – 9.5 = 5.5 | 30.25 |
3 | 3 – 9.5 = -6.5 | 42.25 |
12 | 12 – 9.5 = 2.5 | 6.25 |
0 | 0 – 9.5 = -9.5 | 90.25 |
24 | 24 – 9.5 = 14.5 | 210.25 |
3 | 3 – 9.5 = -6.5 | 42.25 |
= 9.5 | Sum = 0 | Sum of squares = 421.5 |
Step 5: 421.5/5 = 84.3
Step 6: √84.3 = 9.18
The variance is the average of squared deviations from the mean. Variance reflects the degree of spread in the data set. The more spread the data, the larger the variance is in relation to the mean.
To find the variance, simply square the standard deviation. The symbol for variance is s 2 .
Univariate descriptive statistics focus on only one variable at a time. It’s important to examine data from each variable separately using multiple measures of distribution, central tendency and spread. Programs like SPSS and Excel can be used to easily calculate these.
Visits to the library | |
---|---|
6 | |
Mean | 9.5 |
Median | 7.5 |
Mode | 3 |
Standard deviation | 9.18 |
Variance | 84.3 |
Range | 24 |
If you were to only consider the mean as a measure of central tendency, your impression of the ‘middle’ of the data set can be skewed by outliers, unlike the median or mode.
Likewise, while the range is sensitive to extreme values, you should also consider the standard deviation and variance to get easily comparable measures of spread.
If you’ve collected data on more than one variable, you can use bivariate or multivariate descriptive statistics to explore whether there are relationships between them.
In bivariate analysis, you simultaneously study the frequency and variability of two variables to see if they vary together. You can also compare the central tendency of the two variables before performing further statistical tests .
Multivariate analysis is the same as bivariate analysis but with more than two variables.
In a contingency table, each cell represents the intersection of two variables. Usually, an independent variable (e.g., gender) appears along the vertical axis and a dependent one appears along the horizontal axis (e.g., activities). You read ‘across’ the table to see how the independent and dependent variables relate to each other.
Number of visits to the library in the past year | |||||
---|---|---|---|---|---|
Group | 0–4 | 5–8 | 9–12 | 13–16 | 17+ |
Children | 32 | 68 | 37 | 23 | 22 |
Adults | 36 | 48 | 43 | 83 | 25 |
Interpreting a contingency table is easier when the raw data is converted to percentages. Percentages make each row comparable to the other by making it seem as if each group had only 100 observations or participants. When creating a percentage-based contingency table, you add the N for each independent variable on the end.
Visits to the library in the past year (Percentages) | ||||||
---|---|---|---|---|---|---|
Group | 0–4 | 5–8 | 9–12 | 13–16 | 17+ | |
Children | 18% | 37% | 20% | 13% | 12% | 182 |
Adults | 15% | 20% | 18% | 35% | 11% | 235 |
From this table, it is more clear that similar proportions of children and adults go to the library over 17 times a year. Additionally, children most commonly went to the library between 5 and 8 times, while for adults, this number was between 13 and 16.
A scatter plot is a chart that shows you the relationship between two or three variables. It’s a visual representation of the strength of a relationship.
In a scatter plot, you plot one variable along the x-axis and another one along the y-axis. Each data point is represented by a point in the chart.
From your scatter plot, you see that as the number of movies seen at movie theaters increases, the number of visits to the library decreases. Based on your visual assessment of a possible linear relationship, you perform further tests of correlation and regression.
Descriptive statistics summarise the characteristics of a data set. Inferential statistics allow you to test a hypothesis or assess whether your data is generalisable to the broader population.
The 3 main types of descriptive statistics concern the frequency distribution, central tendency, and variability of a dataset.
If you want to cite this source, you can copy and paste the citation or click the ‘Cite this Scribbr article’ button to automatically add the citation to our free Reference Generator.
Bhandari, P. (2023, January 09). Descriptive Statistics | Definitions, Types, Examples. Scribbr. Retrieved 21 August 2024, from https://www.scribbr.co.uk/stats/descriptive-statistics-explained/
Other students also liked, data collection methods | step-by-step guide & examples, variability | calculating range, iqr, variance, standard deviation, normal distribution | examples, formulas, & uses.
This course is part of Data Analysis with R Specialization
Financial aid available
117,799 already enrolled
(2,666 reviews)
Add to your LinkedIn profile
Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
This course covers commonly used statistical inference methods for numerical and categorical data. You will learn how to set up and perform hypothesis tests, interpret p-values, and report the results of your analysis in a way that is interpretable for clients or the public. Using numerous data examples, you will learn to report estimates of quantities in a way that expresses the uncertainty of the quantity of interest. You will be guided through installing and using R and RStudio (free statistical software), and will use this software for lab exercises and a final project. The course introduces practical tools for performing data analysis and explores the fundamental concepts necessary to interpret and report results for both categorical and numerical data
This short module introduces basics about Coursera specializations and courses in general, this specialization: Statistics with R, and this course: Inferential Statistics. Please take several minutes to browse them through. Thanks for joining us in this course!
2 readings • total 20 minutes.
Welcome to Inferential Statistics! In this course we will discuss Foundations for Inference. Check out the learning objectives, start watching the videos, and finally work on the quiz and the labs of this week. In addition to videos that introduce new concepts, you will also see a few videos that walk you through application examples related to the week's topics. In the first week we will introduce Central Limit Theorem (CLT) and confidence interval.
7 videos 6 readings 3 quizzes
Welcome to Week Two! This week we will discuss formal hypothesis testing and relate testing procedures back to estimation via confidence intervals. These topics will be introduced within the context of working with a population mean, however we will also give you a brief peek at what's to come in the next two weeks by discussing how the methods we're learning can be extended to other estimators. We will also discuss crucial considerations like decision errors and statistical vs. practical significance. The labs for this week will illustrate concepts of sampling distributions and confidence levels.
7 videos 5 readings 3 quizzes
Welcome to Week Three of the course! This week we will introduce the t-distribution and comparing means as well as a simulation based method for creating a confidence interval: bootstrapping. If you have questions or discussions, please use this week's forum to ask/discuss with peers.
11 videos 5 readings 3 quizzes
Welcome to Week Four of our course! In this unit, we’ll discuss inference for categorical data. We use methods introduced this week to answer questions like “What proportion of the American public approves of the job of the Supreme Court is doing?” Also in this week you will use the data set provided to complete and report on a data analysis question. Please read the project instructions to complete this self-assessment.
11 videos 6 readings 3 quizzes
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
Duke University has about 13,000 undergraduate and graduate students and a world-class faculty helping to expand the frontiers of knowledge. The university has a strong commitment to applying knowledge in service to society, both near its North Carolina campus and around the world.
Duke University
Eindhoven University of Technology
University of Amsterdam
Showing 3 of 2666
2,666 reviews
Reviewed on Feb 28, 2017
Great course. If you put in a little effort, you will come out with a lot of new knowledge. I recommend using the book after you have seen the movies. It gives a deeper picture of how it works. Great!
Reviewed on May 25, 2024
This course equips students with the knowledge and skills needed to collect, analyze, and interpret data effectively, making it a valuable tool in many fields of study and professions.
Reviewed on Jul 5, 2020
Very nicely designed course and it also progresses very well. If higher mathematics would be involved in it, the course has the ability to replace many college's statistical inference's classes.
Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Earn a degree from world-class universities - 100% online
Upskill your employees to excel in the digital economy
Cost of the course.
If you want to complete the course and earn a Course Certificate by submitting assignments for a grade, you can upgrade your experience by subscribing to the course for $49/month. You can also apply for financial aid if you can't afford the course fee.
When you enroll in a course that is part of a Specialization (which this course is), you will automatically be enrolled in the entire Specialization. You can unenroll from the Specialization if you’re not interested in the other courses or cancel your subscription once you complete the single course.
To enroll in an individual course, search for the course title in the catalog.
To get full access to a course, including the option to earn grades and a Course Certificate, you'll need to subscribe. New subscribers will start with a full access subscription, which includes full access to every course in the Coursera catalog. Existing Specialization subscribers will be given the option to update to a full access subscription when enrolling in a new Specialization or course.
When you enroll in a course that is part of a Specialization, you will automatically be enrolled in the entire Specialization. You can unenroll from the Specialization if you’re not interested in the other courses.
No. Completion of a Coursera course does not earn you academic credit from Duke; therefore, Duke is not able to provide you with a university transcript. However, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Access to lectures and assignments depends on your type of enrollment. If you take a course in audit mode, you will be able to see most course materials for free. To access graded assignments and to earn a Certificate, you will need to purchase the Certificate experience, during or after your audit. If you don't see the audit option:
The course may not offer an audit option. You can try a Free Trial instead, or apply for Financial Aid.
The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.
If you subscribed, you get a 7-day free trial during which you can cancel at no penalty. After that, we don’t give refunds, but you can cancel your subscription at any time. See our full refund policy Opens in a new tab .
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.
Unit 7: how to evaluate descriptive and inferential statistics.
Unit 7: Assignment #1 (due before 11:59 pm Central on Wednesday September 29) :
Unit 7: Assignment #2 (due before 11:59 pm Central on Thursday September 30) :
Unit 7: Assignment #3 (due before 11:59 pm Central on Friday October 1) :
Unit 7: Assignment #4 (due before 11:59 pm Central on Sunday October 3) :
Unit 7: Assignment #5 (due before 11:59 pm Central on Monday October 4) :
Unit 7: Assignment #6 (due before 11:59 pm Central on Tuesday October 5) :
Congratulations, you have finished Unit 7! Onward to Unit 8 !
Open-Access Active-Learning Research Methods Course by Morton Ann Gernsbacher, PhD is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License . The materials have been modified to add various ADA-compliant accessibility features, in some cases including alternative text-only versions. Permissions beyond the scope of this license may be available at http://www.gernsbacherlab.org .
What is descriptive statistics, what is inferential statistics, key differences between descriptive and inferential statistics , common similarities between descriptive and inferential statistics, 3 major types of descriptive statistics, 3 major types of inferential statistics, descriptive and inferential statistics tools, choose the right program, comprehensive guide to descriptive vs inferential statistics.
Statistics forms the core of data analytics, serving as the fundamental tool for identifying trends and patterns within vast numerical datasets. This mathematical discipline encompasses two main categories: Descriptive Statistics and Inferential Statistics. Here, we delve into the contrasting aspects of descriptive vs inferential statistics and their respective impacts on data analytics . While certain measurement techniques may overlap, their underlying objectives diverge significantly. Therefore, it is crucial to discern the major disparities between the two.
Descriptive statistics is a branch of statistics that deals with summarizing and describing the main features of a dataset. It provides methods for organizing, visualizing, and presenting data meaningfully and informally. Descriptive statistics describe the characteristics of the data set under study without generalizing beyond the analyzed data.
Common measures and techniques in descriptive statistics include measures of central tendency (such as mean, median, and mode), measures of dispersion (such as range, variance, and standard deviation), frequency distributions (histograms, frequency tables), and graphical representations (box plots, bar charts, pie charts, etc.). These methods help to provide a clear and concise summary of the data, facilitating easier interpretation and understanding.
Inferential statistics , on the other hand, involves making inferences, predictions, or generalizations about a larger population based on data collected from a sample of that population. It extends the findings from a sample to the population from which the sample was drawn. Inferential statistics allow researchers to draw conclusions, test hypotheses, and make predictions about populations, even when it is impractical or impossible to study the entire population directly.
Key methods in inferential statistics include hypothesis testing, where researchers test hypotheses about population parameters using sample data; regression analysis, where relationships between variables are examined and used to make predictions; and confidence intervals, which provide estimates of population parameters and their uncertainty levels.
This table summarizes the main differences between descriptive and inferential statistics, highlighting their respective purposes, scopes, objectives, examples, and statistical techniques.
|
|
|
Purpose | Summarizes and describes features of a dataset | Makes inferences, predictions, or generalizations about a population based on sample data |
Scope | Focuses on specific sample data | Extends findings to a larger population |
Objective | Describes characteristics of the data without generalizing | Generalizes findings from sample to population |
Examples | Measures of central tendency, dispersion, frequency distributions, graphical representations | Hypothesis testing, regression analysis, confidence intervals |
Data Analysis | Provides a summary and visualization of data | Draws conclusions, tests hypotheses, and makes predictions |
Population Representation | Represents features within the sample only | Represents features of the larger population |
Statistical Techniques | Mean, median, mode, range, variance, standard deviation, histograms, box plots, etc. | Hypothesis testing, regression analysis, confidence intervals |
Goal | To provide insights into the characteristics of a dataset | To make predictions or draw conclusions about a population |
Associate data engineer , publicis sapient.
The course was well structured and curated. The live classes were extremely helpful. They made learning more productive and interactive. The program helped me change my domain from a data analyst to an Associate Data Engineer.
Project manage , codethink.
After completing this course, I landed a new job & a salary hike of 30%. I now work with Zuhlke Group as a Project Manager.
Measures of central tendency represent the center or typical value of a dataset. They provide insight into where the bulk of the data points lie. The three main measures of central tendency are:
Measures of dispersion quantify the spread or variability of data points around the central tendency. They indicate how much the individual data points deviate from the average. Common measures of dispersion include:
Frequency distributions display the frequency of occurrence of different values or ranges in a dataset. They help to visualize the distribution of data across various categories. Common graphical representations used in descriptive statistics include:
Hypothesis testing is a fundamental technique in inferential statistics used to make decisions or draw conclusions about a population parameter based on sample data. It involves formulating a null hypothesis (H0) and an alternative hypothesis (Ha), collecting sample data, and using statistical tests to determine whether there is enough evidence to reject the null hypothesis in favor of the alternative hypothesis. Common statistical tests for hypothesis testing include t-tests, chi-square tests, ANOVA (Analysis of Variance), and z-tests.
Regression analysis is a statistical technique used to examine the relationship between one or more independent variables (predictors) and a dependent variable (outcome) and to make predictions based on this relationship. It helps to identify and quantify the strength and direction of the association between variables and to predict the dependent variable's value for given independent variable values. Common types of regression analysis include linear, logistic, polynomial, and multiple regression.
Confidence intervals provide a range of values within which the true population parameter is likely to lie with a certain level of confidence based on sample data. They quantify the uncertainty associated with estimating population parameters from sample data. Confidence intervals are calculated using point estimates, such as sample means or proportions, and their standard errors. The confidence level represents the probability that the interval contains the true population parameter. Commonly used confidence levels include 90%, 95%, and 99%.
Interested in building a career path within the dynamic world of data analytics? Our data analytics courses are developed to equip you with the skills and expertise to thrive in this swiftly expanding field. Led by seasoned instructors, our curriculum is enriched with hands-on projects, real-world simulations, and case studies, fostering a practical learning environment essential for your triumph. Through our courses, you'll master the art of data analysis, adeptly craft insightful reports, and harness the power of data-driven decision-making pivotal for steering business triumphs.
Program Name Data Analyst Post Graduate Program In Data Analytics Data Analytics Bootcamp Geo All Geos All Geos US University Simplilearn Purdue Caltech Course Duration 11 Months 8 Months 6 Months Coding Experience Required No Basic No Skills You Will Learn 10+ skills including Python, MySQL, Tableau, NumPy and more Data Analytics, Statistical Analysis using Excel, Data Analysis Python and R, and more Data Visualization with Tableau, Linear and Logistic Regression, Data Manipulation and more Additional Benefits Applied Learning via Capstone and 20+ industry-relevant Data Analytics projects Purdue Alumni Association Membership Free IIMJobs Pro-Membership of 6 months Access to Integrated Practical Labs Caltech CTME Circle Membership Cost $$ $$$$ $$$$ Explore Program Explore Program Explore Program
Do you want to gain an in-depth understanding of descriptive vs. inferential statistics? Do you want to master the computation of summary statistics and gain a thorough knowledge of both branches? Enrolling in the Data Analyst Masters Program by Simplilearn is a significant step for those aspiring to build a career in data analytics. This program equips you with essential statistical fundamentals, including the disparities between descriptive and inferential statistics.
Descriptive statistics summarize and describe the main features of a dataset through measures like mean, median, and standard deviation, providing a quick overview of the sample data. Inferential statistics, on the other hand, use sample data to make estimates, predictions, or other generalizations about a larger population. It involves using probability theory to infer characteristics of the population from which the sample was drawn.
An example of an inferential statistic is the calculation of a confidence interval. For instance, after sampling test scores from a group of students, a confidence interval might be used to estimate the range within which the average test score of all students in the population likely falls.
An example of a descriptive statistic is the mean (average) score of students on a test. If you have test scores for 30 students in a class, calculating the mean score provides a summary of the performance of the class on that test.
Data Science & Business Analytics programs typically range from a few weeks to several months, with fees varying based on program and institution.
Program Name | Duration | Fees |
---|---|---|
Cohort Starts: | 32 weeks | $ 3,850 |
Cohort Starts: | 11 Months | $ 4,500 |
Cohort Starts: | 6 Months | $ 8,500 |
Cohort Starts: | 8 months | $ 3,500 |
Cohort Starts: | 11 months | $ 3,800 |
Cohort Starts: | 3 Months | $ 2,624 |
11 months | $ 1,449 | |
11 months | $ 1,449 |
Machine Learning using Python
Artificial Intelligence Beginners Guide: What is AI?
Data science & business analytics.
Navigate the Future of Data Analytics with Gen AI & Prompt Engineering
Data Storytelling: Transform Data into Business Solutions with Power BI in 60 Minutes
Career Masterclass: AI Engineer vs. Data Scientist: Skills, Roles, and Opportunities
Machine Learning Career Guide: A Playbook to Becoming a Machine Learning Engineer
Inferential Statistics Explained: From Basics to Advanced!
A Comprehensive Look at Percentile in Statistics
Free eBook: Top Programming Languages For A Data Scientist
The Difference Between Data Mining and Statistics
A One-Stop Guide to Statistics for Machine Learning
Probability and Statistics > Descriptive Statistics
Descriptive statistics are one of the fundamental “must knows” with any set of data. It gives you a general idea of trends in your data including:
Descriptive statistics is useful because it allows you to take a large amount of data and summarize it. For example, let’s say you had data on the incomes of one million people. No one is going to want to read a million pieces of data; if they did, they wouldn’t be able to glean any useful information from it. On the other hand, if you summarize it, it becomes useful: an average wage, or a median income, is much easier to understand than reams of data.
Descriptive statistics can be further broken down into several sub-areas, like:
The charts, graphs and plots site index is below . For definitions and information on how to find measures of spread and central tendency, see: Basic statistics (which covers the basic terms you’ll find in descriptive statistics like interquartile range , outliers and standard deviation ).
Statistics can be broken down into two areas:
Descriptive statistics just describes data. For example, descriptive statistics about a college could include: the average SAT score for incoming freshmen; the median income of parents; racial makeup of the student body. It says nothing about why the data might exist, or what trends you might be able to see from the data. When you take your data and start to make predictions about future behavior or trends, that’s inferential statistics. Inferential statistics also allows you to take sample data (e.g. from one university) and apply it to a larger population (e.g. all universities in the country).
Step 1: Type your data into Excel, in a single column. For example, if you have ten items in your data set, type them into cells A1 through A10.
Step 2: Click the “Data” tab and then click “Data Analysis” in the Analysis group.
Step 3: Highlight “Descriptive Statistics” in the pop-up Data Analysis window.
Step 4: Type an input range into the “Input Range” text box. For this example, type “A1:A10” into the box.
Step 5: Check the “Labels in first row” check box if you have titled the column in row 1, otherwise leave the box unchecked.
Step 6: Type a cell location into the “Output Range” box. For example, type “C1.” Make sure that two adjacent columns do not have data in them.
Step 7: Click the “Summary Statistics” check box and then click “OK” to display Excel descriptive statistics. A list of descriptive statistics will be returned in the column you selected as the Output Range.
There are literally dozens of charts and graphs you can make from data. which one you choose depends upon what kind of data you have and what you want to display. For example, if you wanted to display relationships between data in categories, you could make a bar graph.
A pie chart would show you how categories in your data relate to the whole set.
Scatter plots are a good way to display data points.
Less common, but useful in some cases, include dot plots and box and whisker charts :
Dodge, Y. (2008). The Concise Encyclopedia of Statistics . Springer. Everitt, B. S.; Skrondal, A. (2010), The Cambridge Dictionary of Statistics , Cambridge University Press. Gonick, L. (1993). The Cartoon Guide to Statistics . HarperPerennial. Salkind, N. (2016). Statistics for People Who (Think They) Hate Statistics: Using Microsoft Excel 4th Edition.
Introduction.
The first step of any data-related process is the collection of data. Once we have collected the data, what do we do with it? Data can be sorted, analyzed, and used in various methods and formats, depending on the project’s needs. While analyzing a dataset, We use statistical methods to arrive at a conclusion. Data-driven decision-making also depends on how efficiently we use these methods. Two types of statistical methods are widely used in data analysis: descriptive and inferential. This article will focus more on descriptive statistics, its types, calculations, examples,percentages etc.
This article was published as a part of the Data Science Blogathon .
What is descriptive statistics, types of statistics, what is inferential statistics, types of descriptive statistics, descriptive statistics based on the central tendency of data, descriptive statistics based on the dispersion of data, descriptive statistics based on the shape of the data, univariate data vs. bivariate data in descriptive statistics, what are the 10 commonly used descriptive statistics, can descriptive statistics be used to make inferences or predictions, frequently asked questions.
Descriptive statistics serves as the initial step in understanding and summarizing data . It involves organizing, visualizing, and summarizing raw data to create a coherent picture. The primary goal of descriptive statistics is to provide a clear and concise overview of the data’s main features. This helps us identify patterns, trends, and characteristics within the data set without making broader inferences.
Key Aspects of Descriptive Statistics
When you delve into the world of statistics, you’ll encounter two fundamental branches: descriptive statistics and inferential statistics. These two distinct approaches help us make sense of data and draw conclusions. Let’s look at the differences between these two branches to shed light on their roles in the realm of statistical analysis and their total number of branches.
Aspect | Descriptive Statistics | Inferential Statistics |
---|---|---|
Purpose | Summarize and describe data | Draw conclusions or predictions |
Data Sample | Analyzes the entire dataset | Analyzes a sample of the data |
Examples | Mean, Median, Range, Variance | Hypothesis testing, Regression |
Scope | Focuses on data characteristics | Makes inferences about populations |
Goal | Provides insights and simplifies data | Generalizes findings to a larger population |
Assumptions | No assumptions about populations | Requires assumptions about populations |
Common Use Cases | Data visualization, data exploration | Scientific research, hypothesis testing |
Inferential statistics takes data analysis to the next level by drawing conclusions about populations based on a sample. It involves making predictions, generalizations, and hypotheses about a larger group using a smaller subset of data. Inferential statistics bridges the gap between our data and the conclusions we want to reach. This is particularly useful when obtaining data from an entire population is impractical or impossible.
Key Aspects of Inferential Statistics
Now we will look at descriptive statistics in detail.
There are various dimensions in which this data can be described. The three main dimensions used for describing data are the central tendency, dispersion, and the shape of the data. Now, let’s look at them in detail, one by one.
The central tendency of data is the center of the distribution of data. It describes the location of data and concentrates on where the data is located. The three most widely used measures of the “center” of the data are Mean, Median, and Mode.
The “Mean” is the average of the data. The average can be identified by summing up all the numbers and then dividing them by the number of observations.
Mean = X 1 + X 2 + X 3 +… + X n / n
Data – 10,20,30,40,50 and Number of observations = 5 Mean = [ 10+20+30+40+50 ] / 5 Mean = 30
The central tendency of the data may be influenced by outliers. You may now ask, ‘ What are outliers? ‘ Well, outliers are extreme behaviors. An outlier is a data point that differs significantly from other observations. It can cause serious problems in analysis.
Data – 10,20,30,40,200 Mean = [ 10+20+30+40+200 ] / 5 Mean = 60
Solution for the outliers problem: Removing the outliers while taking averages will give us better results.
It is the 50th percentile of the data. In other words, it is exactly the center point of the data. The median can be identified by ordering the data, splitting it into two equal parts, and then finding the number in the middle. It is the best way to find the center of the data.
Note that, in this case, the central tendency of the data is not affected by outliers.
Odd number of Data – 10,20,30,40,50 Median is 30. Even the number of data – 10,20,30,40,50,60
Find the middle 2 data and take the mean of those two values. Here, 30 and 40 are middle values. Now, add them and divide the result by 2 30+40 / 2 =35 Median is 35
The mode of the data is the most frequently occurring data or elements in a dataset. If an element occurs the highest number of times, it is the mode of that data. If no number in the data is repeated, then that data has no mode. There can be more than one mode in a dataset if two values have the same frequency, which is also the highest frequency.
Outliers don’t influence the data in this case. The mode can be calculated for both quantitative and qualitative data.
Data – 1,3,4,6,7,3,3,5,10, 3 Mode is 3, because 3 has the highest frequency (4 times)
The dispersion is the “spread of the data”. It measures how far the data is spread. In most of the dataset, the data values are closely located near the mean. The values are widely spread out of the mean on some other datasets. These dispersions of data can be measured by the Inter Quartile Range (IQR), range, standard deviation, and variance of the data.
Let us see these measures in detail.
Quartiles are special percentiles. 1st Quartile Q1 is the same as the 25th percentile. 2nd Quartile Q2 is the same as 50th percentile. 3rd Quratile Q3 is same as 75th percentile
Steps to find quartile and percentile
The Inter Quartile Range is the difference between the third quartile (Q3) and the first quartile (Q1)
IQR = Q3 – Q1
In this example, the Inter Quartile range is the spread of the middle half (50%) of the data.
The range is the difference between the largest and the smallest value in the data.
The most common measure of spread is the standard deviation. The Standard deviation measures how far the data deviates from the mean value. The standard deviation formula varies for population and and highest value of sample. Both formulas are similar but not the same.
Symbol used for Sample Standard Deviation – “s” (lowercase) Symbol used for Population Standard Deviation – “ σ” (sigma, lower case)
Steps to find the Standard Deviation
If x is a number, then the difference “x – mean” is its deviation. The deviations are used to calculate the standard deviation.
Sample Standard Deviation, s = Square root of sample variance Sample Standard Deviation, s = Square root of [Σ(x − x ¯ ) 2 / n-1] where x ¯ is average and n is no. of samples
Population Standard Deviation, σ = Square root of population variance Population Standard Deviation, σ = Square root of [ Σ(x − μ) 2 / N ] where μ is Mean and N is no.of population.
The standard deviation is always positive or zero. It will be large when the data values are spread out from the mean.
The variance is a measure of variability. It is the average squared deviation from the mean. The symbol σ 2 represents the population variance, and the symbol for s 2 represents sample variance.
The shape of the data is important because deciding the probability of data is based on its shape. The shape describes the type of the graph.
The shape of the data can be measured by three methodologies: symmetric, skewness, kurtosis
In the symmetric shape of the graph, the data is distributed the same on both sides. In symmetric data, the mean and median are located close together. The curve formed by this symmetric graph is called a normal curve.
Skewness is the measure of the asymmetry of the distribution of data. The data is not symmetrical (i.e.) it is skewed towards one side. Skewness is classified into two types: positive skew and negative skew.
Kurtosis is the measure of describing the distribution of data. This data is distributed in three different ways: platykurtic, mesokurtic, and leptokurtic.
When it comes to delving into the world of data analysis, two key terms you’re likely to encounter are “ Univariate ” and “ Bivariate .” These terms are crucial in descriptive statistics, as they help us categorize and understand the data types we’re working with. Whether you’re deciphering the properties of individual data points or unraveling the intricate dance between two variables, the concepts of univariate and bivariate data provide the foundation for insightful data analysis.
the key difference between univariate and bivariate data lies in the focus of analysis. Univariate analysis centers on understanding the characteristics of a single variable, while bivariate analysis explores connections and interactions between two variables. Let’s break down the differences between univariate and bivariate data to better grasp their significance.
Univariate data focuses on a single variable, essentially spotlighting one aspect of your data. In this scenario, you’re interested in studying the distribution, central tendency, and dispersion of a single set of values. For instance, if you’re analyzing the heights of a group of individuals, you’re dealing with univariate data. Here, the variable of interest is height, and you aim to uncover insights about that specific characteristic.
In univariate analysis, you’re often looking at measures like:
Bivariate data, on the other hand, adds an extra layer of complexity to your analysis by involving two variables. Here, you’re not just interested in understanding individual characteristics; you’re also keen on uncovering relationships and patterns between two different variables. For example, if you’re examining the relationship between hours of study and exam scores, you’re working with bivariate data. The goal is to determine whether changes in one variable (study hours) have an impact on another (exam scores).
Bivariate analysis often involves techniques such as:
There are actually many useful descriptive statistics, but here are 5 of the most commonly used:
Descriptive statistics themselves are not used for predictions, but they can lay the groundwork for them. Here’s the key difference:
Descriptive statistics summarize the data you have. They use measures like mean, median, and standard deviation to give you a general idea of what the data looks like. This process often involves exploratory data analysis, where open exploration of the data can reveal patterns and insights. For instance, calculating mean scores is a common part of this analysis.
Inferential statistics use the data you have to draw conclusions about a larger population. This allows you to make predictions about things you haven’t observed yet. Here, you would identify the dependent variable and independent variable in your study, which are crucial for making these inferences.
Think of it like this: Descriptive statistics describe your apartment, while inferential statistics use the features of your apartment to guess about the entire apartment building.
So, while descriptive statistics can’t directly predict the future, they can help you understand the data and prepare it for inferential statistics, which can then be used for predictions. Summary statistics from your exploratory data analysis can provide the foundation for these predictive models.
In a world flooded with data, understanding, interpreting, and communicating information is paramount. Descriptive statistics doesn’t just crunch numbers; it crafts narratives, constructs visualizations, and empowers us to make informed decisions. Hope this article has given you a brief introduction to descriptive statistics. In this article, we have seen how the various measures of descriptive statistics, such as central tendency, dispersion, and shape of the data curve, help decipher the numbers. We have also bridged the gap between individual characteristics and the dance between variables by learning about univariate and bivariate data.
Also, this article will help you with the standard deviation of these statistics and statisticians. Not only Multivariate analysis measures of spread the sample size of the shape of the distribution of these statistics.
Ans. The methods used to summarize and describe the main features of a dataset are called descriptive statistics. Measures of central tendencies, measures of variability, etc., which give information about the typical values in a dataset, are all examples of descriptive statistics.
Ans. The 5 descriptive statistics include standard deviation, minimum and maximum variables, variance, kurtosis, and skewness.
Ans. The frequency distribution, central tendency, and variability of a dataset are the 3 main types of descriptive statistics.
Ans. Descriptive statistics are of 3 types: frequency distribution, central tendency, and variability.
The media shown in this article are not owned by Analytics Vidhya and are used at the Author’s discretion.
Descriptive vs Inferential Statistics: WhatR...
Get Started with Statistics for Data Science
Data Types in Statistics for Data Science
A Guide To Complete Statistics For Data Science...
End to End Statistics for Data Science
15 Basic Statistics Concepts Every Data Science...
Various Uses of Python Statistics Module &...
An Introduction to Statistics For Data Science:...
Mathematics for Data Science
Top 40 Data Science Statistics Interview Questions
Lorem ipsum dolor sit amet, consectetur adipiscing elit,
Clear Submit reply
I have seen so many websites/videos. But did not understood few concepts, after this page - I understood very clearly without any doubts. Kudos to those who prepared this tutorial. Thanking you very much...!!!!
Write, captivate, and earn accolades and rewards for your work
Sion Chakrabarti
CHIRAG GOYAL
Barney Darlington
Suvojit Hore
Arnab Mondal
Prateek Majumder
Revolutionizing ai learning & development.
Continue your learning for free, enter email address to continue, enter otp sent to.
Resend OTP in 45s
selected template will load here
This action is not available.
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
These are homework exercises to accompany the Textmap created for "Introductory Statistics" by Shafer and Zhang.
the store is not to run out of stock by the end of a week for more than \(15\%\) of the weeks; and
the store is not to run out of stock by the end of a week for more than \(5\%\) of the weeks.
\[\begin{array}{c|ccccc}6 & & & & & \\ 7 & & & & & \\ 8 & & & & & \\ 9 & & & & & \\ 10 & & & & & \\ 11 & & & & & \\ 12 & & & & & \\ 13 & 3 & 7 & 8 & 8 & 9 \\ 14 & 0 & 2 & 5 & & \\ 15 & 2 & & & & \\ 16 & 0 & & & &\end{array}\]
\[\begin{array}{c|ccccccc}6 & 0 & & & & \\ 7 & 4 & 9 & & & \\ 8 & 0 & 0 & 2 & 2 & 2 & 2 & 3 \\ 9 & & & & & \\ 10 & & & & & \\ 11 & & & & & \\ 12 & & & & & \\ 13 & & & & & \\ 14 & & & & & \\ 15 & & & & & \\ 16 & & & & &\end{array}\]
The frequency tables are given below in the same order:
\[\begin{array}{c|ccc}Length\hspace{0.167em} & 80 \sim 89 & 90 \sim 99 & 100 \sim 109 \\ \hline f & 1 & 1 & 5\end{array}\]
\[\begin{array}{c|cc}Length\hspace{0.167em} & 110 \sim 119 & 120 \sim 129 \\ \hline f & 2 & 1\end{array}\]
\[\begin{array}{c|ccc}Length\hspace{0.167em} & 130 \sim 139 & 140 \sim 149 & 150 \sim 159 \\ \hline f & 5 & 3 & 1\end{array}\]
\[\begin{array}{c|ccc}Length\hspace{0.167em} & 160 \sim 169 \\ \hline f & 1\end{array}\]
\[\begin{array}{c|ccc}Length\hspace{0.167em} & 60 \sim 69 & 70 \sim 79 & 80 \sim 89 \\ \hline f & 1 & 2 & 7\end{array}\]
The relative frequency tables are also given below in the same order:
Note: For Large Data Set Exercises below, all of the data sets associated with these questions are missing, but the questions themselves are included here for reference.
\[1\; 2\; 3\; 4\]
\[2\; -3\; 6\; 0\; 3\; 1\]
\[2\; 1\; 2\; 7\]
\[-1\; 0\; 1\; 4\; 1\; 1\]
\[\begin{array}{c|c c c} x & 1 & 2 & 7 \\ \hline f &1 &2 &1\\ \end{array}\]
\[\begin{array}{c|c c c c} x & -1 & 0 & 1 & 4 \\ \hline f &1 &1 &3 &1\\ \end{array}\]
\[\begin{matrix} 132 & 162 & 133 & 145 & 148\\ 139 & 147 & 160 & 150 & 153 \end{matrix}\]
\[\begin{matrix} 142 & 152 & 138 & 145 & 148\\ 139 & 147 & 155 & 150 & 153 \end{matrix}\]
Consider the data set represented by the table \[\begin{array}{c|c c c c c c c} x & 26 & 27 & 28 & 29 & 30 & 31 & 32 \\ \hline f &3 &4 &16 &12 &6 &2 &1\\ \end{array}\]
\[\begin{array}{c|c c c c c} x & 1 & 2 & 3 & 4 & 5 \\ \hline f &384 &208 &98 &56 &28 \\ \end{array}\]
\[\begin{array}{c|c c c c c} x & 6 & 7 & 8 & 9 & 10 \\ \hline f &12 &8 &2 &3 &1 \\ \end{array}\]
A random sample of \(49\) invoices for repairs at an automotive body shop is taken. The data are arrayed in the stem and leaf diagram shown. (Stems are thousands of dollars, leaves are hundreds, so that for example the largest observation is \(3,800\).)
\[\begin{array}{c|c c c c c c c c c c c} 3 & 5 & 6 & 8 \\ 3 &0 &0 &1 &1 &2 &4 \\ 2 &5 &6 &6 &7 &7 &8 &8 &9 &9 \\ 2 &0 &0 &0 &0 &1 &2 &2 &4 \\ 1 &5 &5 &5 &6 &6 &7 &7 &7 &8 &8 &9 \\ 1 &0 &0 &1 &3 &4 &4 &4 \\ 0 &5 &6 &8 &8 \\ 0 &4 \end{array}\]
For these data, \(\sum x=101\), \(\sum x^2=244,830,000\).
What must be true of a data set if its standard deviation is \(0\)?
A data set consisting of \(25\) measurements has standard deviation \(0\). One of the measurements has value \(17\). What are the other \(24\) measurements?
Create a sample data set of size \(n=3\) for which the range is \(0\) and the sample mean is \(2\).
Create a sample data set of size \(n=3\) for which the sample variance is \(0\) and the sample mean is \(1\).
The sample \(\{-1,0,1\}\) has mean \(\bar{x}=0\) and standard deviation \(\bar{x}=0\). Create a sample data set of size \(n=3\) for which \(\bar{x}=0\) and \(s\) is greater than \(1\).
The sample \(\{-1,0,1\}\) has mean \(\bar{x}=0\) and standard deviation \(\bar{x}=0\). Create a sample data set of size \(n=3\) for which \(\bar{x}=0\) and the standard deviation \(s\) is less than \(1\).
\[5\; -2\; 6\; 1\; 4\; -3\; 0\; 1\; 4\; 3\; 2\; 5\]
\(\text{Large Data Set 1}\) lists the SAT scores and GPAs of \(1,000\) students.
\(\text{Large Data Set 1}\) lists the SAT scores of \(1,000\) students.
\(\text{Large Data Set 7, 7A, and 7B }\) list the survival times in days of \(140\) laboratory mice with thymic leukemia from onset to death.
Statistics is a vital discipline that empowers us to make sense of data by providing tools for collection, analysis, interpretation, and presentation. In every field, from engineering to social sciences, understanding data is crucial for making informed decisions and drawing accurate conclusions . This understanding is facilitated by two key branches of statistics: descriptive and inferential.
Table of Content
Types of statistics:.
Descriptive Statistics
Measures of central tendency, graphical representation, measures of dispersion, applications of descriptive statistics, 2. inferential statistics, uses cases of inferential statistics, hypothesis testing, regression analysis, applications of inferential statistics.
Statistics is a branch of mathematics dealing with the collection, analysis, interpretation, and presentation of masses of numerical data . It is basically a collection of quantitative data.
Statistics is a fundamental branch of mathematics that involves the collection, analysis, interpretation, presentation, and organization of data. statistics is divided into two main branches: descriptive statistics and inferential statistics . These two branches serve different purposes and are used in various fields, including engineering, social sciences, business, and healthcare. This article explores the definitions, characteristics, and applications of both descriptive and inferential statistics.
Descriptive Statistics | Inferential Statistics |
---|---|
It which describes the data in some manner. | It using data drawn from the population. |
It . | It |
It is used to describe a situation. | It is . |
It explains already known data and is limited to a sample or population having a small size. | It attempts to reach the conclusion about the population. |
It etc. | It can be achieved by |
Descriptive statistics is a term given to the analysis of data that helps to describe, show and summarize data in a meaningful way . It is a simple way to describe our data. Descriptive statistics is very important to present our raw data in effective/meaningful way using numerical calculations or graphs or tables . This type of statistics is applied to already known data.
Descriptive statistics involves summarizing and organizing data to describe the main features of a dataset . It provides simple summaries about the sample and the measures. Descriptive statistics is primarily concerned with the presentation of data in a meaningful way, which includes graphical representation and numerical analysis.
Inferential statistics is used to make predictions by taking any group of data in which you are interested . It can be defined as a random sample of data taken from a population to describe and make inferences about the population. Any group of data that includes all the data you are interested in is known as population. It basically allows you to make predictions by taking a small sample instead of working on the whole population.
Descriptive and inferential statistics are essential tools in the field of statistics, each serving distinct but complementary purposes. Descriptive statistics focuses on summarizing and presenting data to highlight its main features, while inferential statistics aims to make predictions and generalizations about a population based on sample data. Understanding and applying these two branches of statistics enables researchers, analysts, and engineers to make informed decisions, draw meaningful conclusions , and advance knowledge in their respective fields.
What is statistics used for.
Statistics is used to analyze data, make informed decisions, predict outcomes, and ensure quality and consistency in various fields such as business, healthcare, and scientific research
hypothesis testing and regression analysis are two main types of inferential statistics
Measures of Central Tendency, Graphical Representation, Measures of Dispersion are some types of descriptive statistics.
Sir Ronald Aylmer Fisher, a British Genius, is widely considered as the father of modern statistics.
Similar reads.
Introduction Statistics is a branch of mathematics. The subject focuses on collection, management, examination, interpretation and demonstration of the data. Statistical analysis consists of two types, descriptive and inferential statistics. The two concepts play a vital role during any statistical analysis. Briefly speaking, through descriptive statistics, huge volumes of data can be analyzed by using charts and tables. The entire data is analyzed in this process to draw conclusions rather than just using samples. On the other hand, during inferential statistics, data is taken from available samples to generate a hypothesis or test the already existing hypothesis. A researcher studies the samples in order to reach at a conclusion about a specific population.
The blog on descriptive and inferential statistics would help you to understand both the concept in detail and would include the following points:
So, let’s sit down and study the above points in details in this blog on descriptive and inferential statistics without wasting any time.
Definition of descriptive statistics Descriptive statistical analysis makes reports and graphs with the help of data visualization software are for companies to understand a particular event or point that occurred in the past. The name itself signifies that the analysis would be in a description form. It does not lead to a concrete conclusion; rather, it helps in describing the data. Descriptive statistics consist of minute constants which help in outlining and guiding the data set. The data set to be analyzed may be complete or based upon a given sample of the population. Measures of central tendency, along with measures of variability or spread, are the two divisions in the descriptive statistics. The below figure demonstrates the raw data details and the descriptive statistics generated in the form charts and tables.
A measure of central tendency means median and mode. When a raw data is provided with the measure of central tendency, it helps set the position of a frequency distribution. As per the experts of descriptive and inferential statistics, central tendency measures are not specific to a particular condition; it is preferred for types of situations and conditions.
Mean: It is used when there is a continuous flow of data, and the other name of the mean is arithmetic average.
Median: It is used in splitting a given data into two halves. One of the parts is smaller in nature than the other part. It can be used when there is a continuous or ordinal flow of data.
Mode: It consists of a large number of data and is basically used for categorical data.
Now, let’s move on to the discussion of groups that are used in the descriptive statistics.
Dispersion: By measuring the dispersion, the data can be extended in the descriptive statistics. An individual can measure the dispersion with the help of deviation and variance.
Central tendency: The mean, mode and median can determine the centre of the data. By using this measure on the data set, a descriptive summary with a single value can be gained.
Skewedness: When there is a distortion in the bell curve, or there is uneven distribution, it is referred to as skewedness. As per the experts of descriptive and inferential statistics, by looking at the distribution of the values, it can be said whether it is symmetrical or skewed.
Definition of inferential statistics Inferential statistics help in drawing conclusions about the larger population basis a sample. It helps in testing different hypothesis related to the given data. To define the term inferential statistics, we first need to understand how the term population is used in statistics. Population in statistics does not necessarily imply that the human population; rather, the term is used for the complete raw data to be analyzed by conducting the descriptive and inferential statistics. Under certain situations, a person may be asked to analyze incomplete data, and in such instances, the person can use the sample data for his analysis work. If in case you want to conduct a study on cancer survivors below the age of 16 years, residing around the globe, you won’t be able to get accurate data. So in such cases, where the total population is not specific or complete, you need to consider a sample data.
When an analysis is to be performed basis a sample population, there are some techniques in inferential statistics applying which inductive reasoning can be generated about the sample population data considered for the analysis. The analyst can reach at a generalized conclusion through the process of inductive reasoning. The analyst can also try to represent a near to accurate result in his analysis by using the sampling process on the given population. Inferential statistics can be majorly used in works related to data science.
But the fact that this sampling process cannot generate an accurate result is true as the sample data used may have some errors or discrepancies which may lead to inaccurate and inconsistent interpretations. It is preferable to use the probability theory while applying inferential statistics.
Methods usually applied in inferential statistics are as follows:
Parameter estimation: The entire raw population has some descriptive estimates known as parameters. When the analysis is to be done on a random population sample, then the process is termed as sample statistics. Through this method, the analyst discovers the estimate of a complete population with a sample’s support, but the estimation generated through this process may not be precise.
Statistical hypothesis testing: The aim of this method is to differentiate basis population or verify the connection between variables with the help of samples. With this method’s help, conclusions can be drawn for the entire population basis a sample population provided.
Regression analysis: It explains the connection between a given set of independent and dependent variables. With the help of hypothesis testing, the analysis determines the existence of a relationship as presented in the sample data.
The distinction between descriptive and inferential statistics Although both descriptive and inferential statistics are used to perform analysis on a given set of data but there lays a huge difference between them in terms process and interpretation of data. The key distinction between descriptive and inferential statistics can be generated on the following points:
When using a descriptive statistics, the analyst has access to the entire raw population data whereas inferential statistics are used by analyst considering some part of the data when the population is too large and cannot be collected or compiled in a single attempt.
Descriptive statistics are used when there is no sampling process requirement, whereas inferential statistics are completely based upon the sampling process and sample parameters.
The descriptive statistics have some properties parameter for the raw population, namely mean, median and mode, whereas in inferential statistics properties parameter for the raw population is termed as statistics.
There are some limitations in descriptive statistics, and it can be applied when the data is actually measured in reality whereas in inferential statistics, sample data from the large population is applied, so there are no such limitations in representing the population data.
It is claimed that descriptive statistics can generate 100 per cent accuracy as it is based upon the complete raw data of the population without any assumptions whereas analysis based on inferential statistics uses sample data and the results can be speculative. There is no guarantee of 100 per cent accuracy in this method as conclusions are drawn based upon some sample population data.
Descriptive statistics help present meaningful data, whereas inferential statistics help compare the data, make hypothesis and predictions.
Descriptive statistics are used in describing a situation, whereas inferential statistics are used to describe the occurrence of any future event.
The descriptive statistics can be explained with the help of graphs, charts and tables, whereas inferential statistics can be explained through probability.
The need for statistical software in analyzing the data Conducting research requires analysis of huge data sets, and this can be easily done with the application of different statistical software’s tools. Application of such software’s smooth’s the process of descriptive and inferential statistics analysis. It brings accuracy and precision in the results of the analysis and saves time. Using the statistical software during the analysis of descriptive and inferential statistics has great advantages as compared to the analysis conducted manually. The below points will help out in understanding the need for statistical software:
Reduction in error during the sampling process: The success of research depends upon the type of analysis performed on a given set of data. If there are any errors in collecting data or during its processing, then the entire analysis can be considered useless and vague. The sampling process may have some errors when there is a deviation between the actual data and the sample data. The larger database can be accessed with the help of statistical software to generate error-free and customized analysis. It reduces manual intervention, thereby reducing the workload on an individual or a group while performing descriptive and inferential statistics. Majority of the software has an automatic feature that nullifies the requirement of revising the data again and again on its usage.
Accurate result and easy solution: With the help of statistical software, complex questions or problems can be solved easily. In case of analyzing a limited data, it would be the best option to look out for. But in the case of large data, the simplification of the solution could be problematic and may lead to inaccurate analysis. Easy solution and accurate results can be achieved with the application of statistical software’s. There are some features in the software that make handling the analysis an easy way out using multivariate analysis, statistical process control and regression analysis. The three features mentioned are few; there are various other features as well as making the task of handling descriptive and inferential statistics an easy one. The features of the statistical software safeguard the data and the results can be easily understood.
Helping business houses: The statistical software’s can be used by businesses to evaluate different areas to their advantage. The advantages may include judging its employees’ performance, finding a changing behavioural pattern of its consumers, driving audits, and understanding sales performance in different locations. With the help of statistical software’s the business can make future predictions and maximize its profits.
Topmost preferable statistical software’s If you search for statistical software’s to conduct a descriptive and inferential statistics analysis, you are sure to come across a variety of them. You need to choose the best among all, and it purely depends upon the type of population data and the results that you are willing to see. The software’s are designed with some special programs that allow its users to feed any amount of data and easily come up with a conclusion that was not manually possible. The software’s are basically used by mathematicians, data scientists and industries. Each tool has a unique feature which sets it apart from the rest and depending upon the feature, and you may choose the one that suits you best. Some of the preferable statistical software’s for the analysis of descriptive and inferential statistics have been discussed below:
IBM SPSS Statistics: Industries use the software to solve business-specific issues and reach a correct decision. It has some customized features which can be seen while generating graphs and reports. The software can help generate probabilities, make predictions for future events, plan activities for the benefit of the community, and fulfil objectives and goals.
R Studio: The tool has been created for statistical and data science computing. Individuals and teams can use it at the same time. The resources can be shared in order to generate results to be used by the decisions makers in the organization. With the help of this software assignment related to R, language can be graphically represented. The data can be automatically imported in this software, and it makes navigation within the source file an easy process without inserting any line of code. The plots and the commands generated are efficiently managed through this software.
Stata: It is a tool for the management of all types of data, used in descriptive and inferential statistics analysis, maintaining high-resolution graphics, etc. The software has a simple interface, the help section of the manual explains in details about the commands with support from a wide community. The navigation process is much easier, graphs generated can be used while giving presentations, and the analysis generated is user friendly. To use the software appropriately, one needs to know graphical interpretations and usage of regression and standard errors.
JMP: The tool uses robust statistics and dynamic graphics to present an analysis of the data fed in the memory or in the computer. The software’s interactive and visual feature provides better insights about the data that could not be gained from static graphs or raw number tables. Any amount of statistical problem can be solved with the usage of this software. During the process of descriptive and inferential statistics analysis, the user does not face any problem in handling the software due to its easy interface.
Minitab 18: It is software with different tools that the users can use to analyze data and find solutions to different business problems. Data interpretations are made without any interruptions, and presentations are effortless. One can discover different other advantages by using this software. It has an extraordinary user interface and can be easily located. The tools can be used upon their categorization. When you are stuck at any point in time, you can use the help feature of the software by a right-click, and you will get instant help in the form of step by step guidance.
KNIME analytics platform: It is an overall solution for all data-driven discoveries, helping the users in discovering the prospects of hidden data, generating new insights and making future predictions. It consists of many modules, examples and other tools that can help analyze descriptive and inferential statistics. It checks the workflow, provides mathematical and statistical solutions, generates and predicts algorithms for machine learning. The platform can evaluate a large amount of data involving algorithms and codes with the help of modules. It does not require any programming to perform graphical jobs.
Origin Pro: It is easy to learn and friendly software that helps in data analysis and is able to publish customized quality graphs as per the needs of engineers and scientists. People using this software can engineer operations related to importing, analysis and graphing from the graphical user interface. When there is a change in the data, the graphs, analysis and the reports automatically get updated. The best feature of this software is its customer service which provides quick solutions. The graphics generated through the software is very professional and visually appealing.
NumXL: The software is considered as different from others basis its time series and Excel add-ins. The two features change the Microsoft Excel into an econometrics tool and extraordinary time series software. It provides accuracy in descriptive and inferential statistics along with some shortcuts, which can take you with the entire process. All data can be easily adjusted with the help of add-ins. Customer support is one of its best parts in case you are stuck at any point in time.
SAS or STAT: The software adjusts itself with any type of data being fed in it. It has some techniques that can interpret smaller data sets; some tools can interpret larger data sets by applying statistical modelling tools. The software can also interpret data having incomplete values by the application of modern methods. There are regular updates with different statistical methods and statistical procedures that can be used at any time. It helps in managing the codes and the macros by itself when one does not have enough time to write down the codes in a detailed manner.
SAS base: It is programming language software providing an interface which is purely web-based. Certain programs can be used instantly for data manipulation, data storage and recovery, descriptive and inferential statistics, and reporting. The feature of cross-platform and multi-platform support can be found on this tool. The tool is streamlined, and there are no frills. You can insert your data or write your codes, and the tool will run the data and provide you with the result. You can either do the analysis yourself or pass the result to another program for further interpretations. The tool acts immediately once everything has been put in the right place.
Conducting a statistical analysis with the help Totalassignmenthelp.com If you are tasked with the analysis of descriptive and inferential statistics, statistic assignment help from total assignment help would be the best option to get help. Experts working on statistics assignment take help from different statistical software to generate quick and accurate results on any amount of data. The interpretations provided by the experts cannot be challenged at any platform. You can contact us for more information’s related to descriptive and inferential statistics. We will help you in solving your statistical problems and other assignment related queries. Along with assignment writing services, a student can avail the below benefits:
Quality work and on-time delivery
Extended revisions
Affordable prices
Non-stop offers on new assignments.
Original content
Quick refund
Immediate service
If you want to verify the quality of our descriptive and inferential statistics assignment or you want check the type of work done by us, please visit our official website where you can search for sample articles related to the field you are having a problem with.
Total Assignment Help Incase, you are looking for an opportunity to work from home and earn big money. TotalAssignmenthelp Affiliate program is the best choice for you.
Do visit : https://www.totalassignmenthelp.com/affiliate-program for more details
Total Assignment help is an online assignment help service available in 9 countries. Our local operations span across Australia, US, UK, South east Asia and the Middle East. With extensive experience in academic writing, Total assignment help has a strong track record delivering quality writing at a nominal price that meet the unique needs of students in our local markets.
We have specialized network of highly trained writers, who can provide best possible assignment help solution for all your needs. Next time you are looking for assignment help, make sure to give us a try.
Get the best Assignment Help from leading experts from the field of academics with assured onetime, 100% plagiarism free and top Quality delivery.
Related posts.
Comments are closed.
IMAGES
COMMENTS
A good exploratory tool for descriptive statistics is the five-number summary, which presents a set of distributional properties for your sample.. Related post: Analyzing Descriptive Statistics in Excel. Inferential Statistics. Inferential statistics takes data from a sample and makes inferences about the larger population from which the sample was drawn.
Descriptive statistics use summary statistics, graphs, and tables to describe a data set. This is useful for helping us gain a quick and easy understanding of a data set without pouring over all of the individual data values. Inferential statistics use samples to draw inferences about larger populations.
4. Types of Descriptive Statistics with Examples. Measures of Central Tendency:These provide insights into the central point of a dataset.. Mean (Average): The sum of all values divided by the number of values. Example: For a dataset of ages (23, 25, 26, 29, 30), the mean age is $ \frac{23+25+26+29+30}{5} = 26.6 $ years. Median: The middle value in an ordered dataset.
Inferential Statistics | An Easy Introduction & Examples. Published on September 4, 2020 by Pritha Bhandari.Revised on June 22, 2023. While descriptive statistics summarize the characteristics of a data set, inferential statistics help you come to conclusions and make predictions based on your data. When you have collected data from a sample, you can use inferential statistics to understand ...
For more descriptive statistics, consider Table 7.2.2.2 7.2.2. 2. It shows the number of unmarried men per 100 unmarried women in U.S. Metro Areas in 1990. From this table we see that men outnumber women most in Jacksonville, NC, and women outnumber men most in Sarasota, FL.
Here, we introduce descriptive statistics using examples and discuss the difference between descriptive and inferential statistics. We also talk about samples and populations, explain how you can identify biased samples, and define differential statistics. ... Random assignment is critical for the validity of an experiment. For example ...
NOTE: Descriptive statistics summarize data to make sense or meaning of a list of numeric values. Transition from descriptive to inferential statistics (Chapters 6-7) Inferential Statistics (Chapters 8-18) Statistics Descriptive Statistics (Chapters 2-5) FIGURE 1.1 A general overview of this book. This book begins with an introduction to ...
Organizing and summarizing data is called descriptive statistics. Two ways to summarize data are by graphing and by using numbers (for example, finding an average). After you have studied probability and probability distributions, you will use formal methods for drawing conclusions from "good" data.
Inferential statistics use data from a sample to answer questions about a population. Inferential statistics involves generalizing beyond the data at hand. Descriptive statistics are numbers that are used to summarize and describe data. Predicting next month's unemployment rate involves predicting future data, no describing the data at hand.
Each of these segments is important, offering different techniques that accomplish different objectives. Descriptive statistics describe what is going on in a population or data set. Inferential statistics, by contrast, allow scientists to take findings from a sample group and generalize them to a larger population.
Unit 7: Assignment #2 (due before 11:59 pm Central on MON JUL 8): To become familiar with some of the ways that descriptive and inferential statistics can be used to deceive people, read Chapters 2 through 6 of (a slender!) book titled How to Lie with Statistics by Darrell Huff. NOTE: This book was published in 1954; therefore, the examples are ...
Descriptive and inferential statistics, although distinct in their purposes and approaches, exhibit some similarities: 1. Data Utilization: Both descriptive and inferential statistics utilize the same dataset. Descriptive statistics summarize this data, whereas inferential statistics use it to draw broader conclusions about a larger population. 2.
Inferential statistics. We have seen that descriptive statistics are useful in providing an initial way to describe, summarize, and interpret a set of data. They are limited in usefulness because they tell us nothing about how meaningful the data are. The second step in analyzing data requires inferential statistics.
Example 3: Find the z score using descriptive and inferential statistics for the given data. Population mean 100, sample mean 120, population variance 49 and size 10. Solution: Inferential statistics is used to find the z score of the data. The formula is given as follows: z = x−μ σ x − μ σ. Standard deviation = √49 49 = 7.
Types of descriptive statistics. There are 3 main types of descriptive statistics: The distribution concerns the frequency of each value. The central tendency concerns the averages of the values. The variability or dispersion concerns how spread out the values are. You can apply these to assess only one variable at a time, in univariate ...
There are 5 modules in this course. This course covers commonly used statistical inference methods for numerical and categorical data. You will learn how to set up and perform hypothesis tests, interpret p-values, and report the results of your analysis in a way that is interpretable for clients or the public.
Unit 7: How to Evaluate Descriptive and Inferential Statistics. Unit 7: Assignment #1 (due before 11:59 pm Central on Wednesday September 29): To review what descriptive and inferential statistics are, why they are important to learn, and examples of how they are used: Watch Lynda.com's (2010) video, " Understanding Descriptive and ...
Descriptive statistics provide insights into the features of the observed data, while inferential statistics extend these findings to make predictions or draw conclusions about a broader population. Application: Descriptive and inferential statistics are widely applied across various fields, including science, business, economics, social ...
For example, if you have ten items in your data set, type them into cells A1 through A10. Step 2: Click the "Data" tab and then click "Data Analysis" in the Analysis group. Step 3: Highlight "Descriptive Statistics" in the pop-up Data Analysis window. Step 4: Type an input range into the "Input Range" text box.
It involves organizing, visualizing, and summarizing raw data to create a coherent picture. The primary goal of descriptive statistics is to provide a clear and concise overview of the data's main features. This helps us identify patterns, trends, and characteristics within the data set without making broader inferences.
This page titled 2.E: Descriptive Statistics (Exercises) is shared under a license and was authored, remixed, and/or curated by via that was edited to the style and standards of the LibreTexts platform. These are homework exercises to accompany the Textmap created for "Introductory Statistics" by Shafer and Zhang.
Descriptive Statistics Inferential Statistics; It gives information about raw data which describes the data in some manner.: It makes inferences about the population using data drawn from the population.: It helps in organizing, analyzing, and to present data in a meaningful manner.: It allows us to compare data, and make hypotheses and predictions.: It is used to describe a situation.
Statistics is a branch of mathematics. The subject focuses on collection, management, examination, interpretation and demonstration of the data. Statistical analysis consists of two types, descriptive and inferential statistics. The two concepts play a vital role during any statistical analysis. Briefly speaking, through descriptive statistics ...