Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Introduction to Statistics: Data Collection, Presentation, and Analysis, Study notes of Statistics

Our "Basic Statistics Notes" offer a clear and concise guide to essential statistical concepts, data collection methods, and effective data presentation. Starting with fundamental terms, these notes explain key concepts such as descriptive vs. inferential statistics, population vs. sample, and measures of central tendency (mean, median, mode) and dispersion (range, variance). They cover various data collection techniques, including sampling methods and survey design, ensuring accurate data gathering. Finally, the notes detail how to present data using charts and graphs like bar charts, histograms, and line graphs, equipping you with the skills to visualize and communicate statistical findings effectively. Ideal for students and professionals alike, these notes provide a solid foundation in understanding and applying basic statistical principles.

Typology: Study notes

2023/2024

Available from 08/14/2024

reene-cags
reene-cags 🇵🇭

2 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
@reenotes
INTRODUCTION TO STATISTICS
L1| Introduction to Statistics
statistics
- (plural form) data or numerical computations are derived
from a set of sample data
- (singular form, more general definition) branch of science
that deals with developing methods for a more effective
way of collecting, organizing, presenting, and analyzing
data
statistical methods existing procedures and techniques used
from the collection of data to the proper presentation and
analysis of results
statistical theory development of the formulas used in the
computation and development of scientific procedures that
constitute the basis of statistical methods
data
- basic element of statistical analysis
- expressed either in numerical value or description by its
quality or kind
- 2 types of data:
- quantitative data expressed in numbers
(measured or counted)
- age, height, weight, income, number of
students
- qualitative data expressed in categories or kind
- gender, educational attainment, civil status,
vehicle plate numbers
population data
- collection of all the units from where data is collected
- unit in the population is called element
sample data
- subset of the population
- listing of all the elements is called frame
census
- info is gathered for all units in the population
- complete enumeration of all the units in the population
- expensive and time-consuming
- (PH census that occurs every 10 years)
sampling only a part of the population is used to obtain data
parameter numerical measure computed from the population
statistic numerical measure computed from the sample
two major areas of statistics:
1. descriptive
- summary calculations, graphical and tabular
displays, and describing important features of a set
of data
2. inductive or inferential
- making generalizations for the population based on
the information drawn from the sample
variables characteristics or properties measured from
objects, persons, or things
- two types:
1. discrete counted, whole numbers
2. continuous measured, decimal numbers
four scales of measurement
1. nominal lowest form, identification only
2. ordinal has both identity and order
3. interval identity, order, and equality of scale
4. ratio identity, order, equality of scale, and absolute zero
absolute zero nothing of the characteristic that is being
measured
symbols used
1. summation compact way to write the sum of the set of
variables
2. factorial compact way of writing the product of a
sequence of positive integers
L2| Data Collection
data collection
- collected directly through surveys or experiments
- collected indirectly from existing records and documents
which can be written, printed, or digital
methods
- popular method is through surveys
- can be done through interviews, telephone,
questionnaires, or observations
1. Interviews
- in-person interviews allow researchers to collect
more information
- telephone interviews, while less costly, restrict the
sample to those who only have phones and with
free schedules
2. Questionnaires
- can only be used when respondents are available
and willing to participate as subject
- can be mailed or handed personally
- reliability and validity of data collected depends on
the respondent’s memories and forthrightness
- low and differential response rate that leads to
loss of information
3. Observation
- researcher directly observes rather than relying on
respondent’s memory or truthfulness
- may use videotapes, audio tapes, or other data
collection equipments in combination to collect
observational data
- time-consuming, samples are small and
unrepresentative of the population, and there may
be observation errors
4. Records
- utilization of existing records
- economical and requires less cooperation from
those who dislike interviews and questionnaires
- some information needed may not be found, can
be unpublished or published
sampling methods
1. probability
a. simple random sampling
- used when homogeneous population is not
large and frame is available
- selection of samples is done where sample
size and has an equal probability of being
selected or chance of being in the sample
- steps: make a list, assign a sequential number,
choose sample size, use random number
generator
b. systematic sampling
- equal probability without being dependent on
the frame
- elements are assigned a number from 1 to N
and interval is determined by taking the ratio of
N to sample size n. random number is
selected from a list or sequential files 1 to k
(random start), unit assigned is then included
in the sample
c. stratified sampling
- extension of simple random sampling; divides
the entire population into strata then probability
samples are selected in each stratum
- simple random sampling is used in selecting
samples in each stratum
- used when the population is extremely
heterogeneous
- improves the quality of inferences made esp.
when strata formed has units that are
homogeneous as possible
pf2

Partial preview of the text

Download Introduction to Statistics: Data Collection, Presentation, and Analysis and more Study notes Statistics in PDF only on Docsity!

@reenotes

INTRODUCTION TO STATISTICS

L1| Introduction to Statistics

statistics

  • (plural form) data or numerical computations are derived from a set of sample data
  • (singular form, more general definition) branch of science that deals with developing methods for a more effective way of collecting, organizing, presenting, and analyzing data statistical methods – existing procedures and techniques used from the collection of data to the proper presentation and analysis of results statistical theory – development of the formulas used in the computation and development of scientific procedures that constitute the basis of statistical methods data
  • basic element of statistical analysis
  • expressed either in numerical value or description by its quality or kind
  • 2 types of data:
  • quantitative data – expressed in numbers (measured or counted)
  • age, height, weight, income, number of students
  • qualitative data – expressed in categories or kind
  • gender, educational attainment, civil status, vehicle plate numbers population data
  • collection of all the units from where data is collected
  • unit in the population is called element sample data
  • subset of the population
  • listing of all the elements is called frame census
  • info is gathered for all units in the population
  • complete enumeration of all the units in the population
  • expensive and time-consuming
  • (PH census that occurs every 10 years) sampling – only a part of the population is used to obtain data parameter – numerical measure computed from the population statistic – numerical measure computed from the sample two major areas of statistics:
  1. descriptive
  • summary calculations, graphical and tabular displays, and describing important features of a set of data
  1. inductive or inferential
  • making generalizations for the population based on the information drawn from the sample variables – characteristics or properties measured from objects, persons, or things
  • two types:
  1. discrete – counted, whole numbers
  2. continuous – measured, decimal numbers four scales of measurement
  3. nominal – lowest form, identification only
  4. ordinal – has both identity and order
  5. interval – identity, order, and equality of scale
  6. ratio – identity, order, equality of scale, and absolute zero absolute zero – nothing of the characteristic that is being measured symbols used
  7. summation – compact way to write the sum of the set of variables
  8. factorial – compact way of writing the product of a sequence of positive integers

L2| Data Collection

data collection

  • collected directly through surveys or experiments
  • collected indirectly from existing records and documents which can be written, printed, or digital methods
  • popular method is through surveys
  • can be done through interviews, telephone, questionnaires, or observations
  1. Interviews
  • in-person interviews allow researchers to collect more information
  • telephone interviews, while less costly, restrict the sample to those who only have phones and with free schedules
  1. Questionnaires
  • can only be used when respondents are available and willing to participate as subject
  • can be mailed or handed personally
  • reliability and validity of data collected depends on the respondent’s memories and forthrightness
  • low and differential response rate that leads to loss of information
  1. Observation
  • researcher directly observes rather than relying on respondent’s memory or truthfulness
  • may use videotapes, audio tapes, or other data collection equipments in combination to collect observational data
  • time-consuming, samples are small and unrepresentative of the population, and there may be observation errors
  1. Records
  • utilization of existing records
  • economical and requires less cooperation from those who dislike interviews and questionnaires
  • some information needed may not be found, can be unpublished or published sampling methods
  1. probability a. simple random sampling
  • used when homogeneous population is not large and frame is available
  • selection of samples is done where sample size and has an equal probability of being selected or chance of being in the sample
  • steps: make a list, assign a sequential number, choose sample size, use random number generator b. systematic sampling
  • equal probability without being dependent on the frame
  • elements are assigned a number from 1 to N and interval is determined by taking the ratio of N to sample size n. random number is selected from a list or sequential files 1 to k (random start), unit assigned is then included in the sample c. stratified sampling
  • extension of simple random sampling; divides the entire population into strata then probability samples are selected in each stratum
  • simple random sampling is used in selecting samples in each stratum
  • used when the population is extremely heterogeneous
  • improves the quality of inferences made esp. when strata formed has units that are homogeneous as possible

@reenotes

INTRODUCTION TO STATISTICS

d. cluster sampling

  • used when units of the population are naturally grouped “clusters”
  • selection of random sample of clusters then clusters are subjected to complete enumeration
  1. non-probability a. haphazard or accidental sampling
  • nonstatistical, unsystematic selection of sample units
  • approximates random sampling by selecting sample items without conscious bias and specific reason for including or excluding items b. convenience sampling
  • respondents are sampled just because they are convenient sources c. volunteer sampling
  • participants self-select to be part of a study because they volunteer when asked d. purposive sampling
  • judgmental, selective, subjective sampling
  • researchers rely on their own judgment when choosing sample e. quota sampling
  • sample units are picked for convenience but the number of people to interview (quota) is given to interviewers
  • used in market research

L3 | Presentation of Data

two methods:

  1. tabular a. percentage or frequency table - data is categorical in nature b. cross tabulation or contingency table - data is categorical in nature - lists the frequencies for different value combinations of two categorical variables c. frequency distribution table - data is numerical in nature - groups all observations into intervals/classes with a count of the number of observations that fall in that interval/class d. stem and leaf plot - groups numerical data into intervals without any loss of information
  2. graphical a. bar chart - graph where the different classes of the frequency table are represented by rectangles or bars - obtained by plotting the class intervals (x) and class frequency (y) b. frequency histogram - graph that looks like the bar chart - obtained by plotting class boundaries (x) and class frequencies (y) c. line chart or frequency polygon
  • constructed by plotting class marks/midpoints (x) and class frequencies (y) d. frequency ogive
  • plotting upper class boundaries (x) and less than cumulative frequency (y) e. box and whisker plot
  • pictorial representation of data distribution
  • upper and lower boundaries of the box mark represent the upper and lower quartiles
  • line identifies the median f. pie chart
  • categorical variables
  • circle divided into pie-shaped sectors or pizza slices g. pictograph – more dramatic and lively appearance