Using real-world evidence to investigate clinical research questionsKarin Verspoor
Adoption of electronic health records to document extensive clinical information brings with it the opportunity to utilise that information to support clinical research, and ultimately to support clinical decision making. In this talk, I discuss both these opportunities and the challenges that we face when working with real-world clinical data, and introduce some of the strategies that we are adopting to make this data more usable, and to extract more value from it. I specifically discuss the use of natural language processing to transform clinical documentation into structured data for this purpose.
Define the terms: Statistics and Biostatistics
Discuss the importance of Biostatistics
Differentiate between Population & Sample, Parameter & Statistics
Identify the various sources of data collection
Explain the types of variables
Explore the different types of Measurement scales
Methods of Presenting the data
Tabular Presentation
Textual Presentation
Graphical Presentation
Statistics
Collection, Classification, Organization, Summarization, analysis, Presentation, and Interpretation of the data / information.
Biostatistics
Collection, Classification, Organization, Summarization, Presentation, and Interpretation of the data / information.
If related to Biological or Health sciences called “Biostatistics”
Why do we need to study Biostatistics course?
To learn how to deal with numbers.
To assess evidence from different studies.
To understand published scientific papers.
To do research and write papers in journals.
Population
The set of all the measurements of interest to the investigator.
Monthly income of households in Pakistan
Number of TB Patients in Pakistan
Sample
It is a group of subjects selected from a population
A random sample is a good representative of population
Example
A survey of 1,000 households taken from all parts of Pakistan to assess their monthly income
Parameter
– The characteristics of interest to the researcher in the population is called a parameter.
E.g. average household size and percent of households with modern sanitation as reported in the 1998 census of Karachi
Statistic
– The characteristics of interest to the researcher in the sub-set of population is called a statistic.
E.g. average household size and percent of households as reported from a sample survey of 6,000 households in Karachi, 2010
Descriptive Statistic :
Consists of the collection, organization, summarization and presentation of data.
Inferential Statistic :
Consists of generalizing from samples to populations, performing estimations and hypothesis tests, determining relationships among variables
A Variable is simply what is being observed or measured
The dependent variable is the outcome of interest
The independent variable is the intervention or what is being manipulated
Data
The set of values collected for the variable of each of the elements belonging to the sample
Qualitative Variable:
Variables that can be placed into distinct categories, according to some characteristic or attribute.
Quantitative variables
That have are measured on a numeric
or quantitative scale. Interval and ratio scales are quantitative
Nominal Scale
- It is the first level of measurement
- Named variables
Ordinal Scale
-Data measured at this level can be placed into categories, and these categories can be ordered, or ranked.
Interval scale:
Differences between values have meaning.
Ordered with proportionate difference between variables
Arbitrary Zero (0 will have a meaning)
Ratio scale:
Differences between values have meaning. Absolute Zero (absence)
Paul Aylin, Co-Director of the Dr Foster Unit at Imperial College London, gives concrete examples of using a specific statistical model for monitoring care quality, cumulative sum (CUSUM).
Using real-world evidence to investigate clinical research questionsKarin Verspoor
Adoption of electronic health records to document extensive clinical information brings with it the opportunity to utilise that information to support clinical research, and ultimately to support clinical decision making. In this talk, I discuss both these opportunities and the challenges that we face when working with real-world clinical data, and introduce some of the strategies that we are adopting to make this data more usable, and to extract more value from it. I specifically discuss the use of natural language processing to transform clinical documentation into structured data for this purpose.
Define the terms: Statistics and Biostatistics
Discuss the importance of Biostatistics
Differentiate between Population & Sample, Parameter & Statistics
Identify the various sources of data collection
Explain the types of variables
Explore the different types of Measurement scales
Methods of Presenting the data
Tabular Presentation
Textual Presentation
Graphical Presentation
Statistics
Collection, Classification, Organization, Summarization, analysis, Presentation, and Interpretation of the data / information.
Biostatistics
Collection, Classification, Organization, Summarization, Presentation, and Interpretation of the data / information.
If related to Biological or Health sciences called “Biostatistics”
Why do we need to study Biostatistics course?
To learn how to deal with numbers.
To assess evidence from different studies.
To understand published scientific papers.
To do research and write papers in journals.
Population
The set of all the measurements of interest to the investigator.
Monthly income of households in Pakistan
Number of TB Patients in Pakistan
Sample
It is a group of subjects selected from a population
A random sample is a good representative of population
Example
A survey of 1,000 households taken from all parts of Pakistan to assess their monthly income
Parameter
– The characteristics of interest to the researcher in the population is called a parameter.
E.g. average household size and percent of households with modern sanitation as reported in the 1998 census of Karachi
Statistic
– The characteristics of interest to the researcher in the sub-set of population is called a statistic.
E.g. average household size and percent of households as reported from a sample survey of 6,000 households in Karachi, 2010
Descriptive Statistic :
Consists of the collection, organization, summarization and presentation of data.
Inferential Statistic :
Consists of generalizing from samples to populations, performing estimations and hypothesis tests, determining relationships among variables
A Variable is simply what is being observed or measured
The dependent variable is the outcome of interest
The independent variable is the intervention or what is being manipulated
Data
The set of values collected for the variable of each of the elements belonging to the sample
Qualitative Variable:
Variables that can be placed into distinct categories, according to some characteristic or attribute.
Quantitative variables
That have are measured on a numeric
or quantitative scale. Interval and ratio scales are quantitative
Nominal Scale
- It is the first level of measurement
- Named variables
Ordinal Scale
-Data measured at this level can be placed into categories, and these categories can be ordered, or ranked.
Interval scale:
Differences between values have meaning.
Ordered with proportionate difference between variables
Arbitrary Zero (0 will have a meaning)
Ratio scale:
Differences between values have meaning. Absolute Zero (absence)
Paul Aylin, Co-Director of the Dr Foster Unit at Imperial College London, gives concrete examples of using a specific statistical model for monitoring care quality, cumulative sum (CUSUM).
A chi-squared test (χ2) is basically a data analysis on the basis of observations of a random set of variables. Usually, it is a comparison of two statistical data sets. This test was introduced by Karl Pearson in 1900 for categorical data analysis and distribution. So, it was mentioned as Pearson’s chi-squared test.
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
This is a presentation by Dada Robert in a Your Skill Boost masterclass organised by the Excellence Foundation for South Sudan (EFSS) on Saturday, the 25th and Sunday, the 26th of May 2024.
He discussed the concept of quality improvement, emphasizing its applicability to various aspects of life, including personal, project, and program improvements. He defined quality as doing the right thing at the right time in the right way to achieve the best possible results and discussed the concept of the "gap" between what we know and what we do, and how this gap represents the areas we need to improve. He explained the scientific approach to quality improvement, which involves systematic performance analysis, testing and learning, and implementing change ideas. He also highlighted the importance of client focus and a team approach to quality improvement.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
How to Create Map Views in the Odoo 17 ERPCeline George
The map views are useful for providing a geographical representation of data. They allow users to visualize and analyze the data in a more intuitive manner.
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
How to Split Bills in the Odoo 17 POS ModuleCeline George
Bills have a main role in point of sale procedure. It will help to track sales, handling payments and giving receipts to customers. Bill splitting also has an important role in POS. For example, If some friends come together for dinner and if they want to divide the bill then it is possible by POS bill splitting. This slide will show how to split bills in odoo 17 POS.
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
The Indian economy is classified into different sectors to simplify the analysis and understanding of economic activities. For Class 10, it's essential to grasp the sectors of the Indian economy, understand their characteristics, and recognize their importance. This guide will provide detailed notes on the Sectors of the Indian Economy Class 10, using specific long-tail keywords to enhance comprehension.
For more information, visit-www.vavaclasses.com
Home assignment II on Spectroscopy 2024 Answers.pdf
statistics.ppt
1. Faculty of Medicine
Introduction to Community Medicine Course
(31505201)
Introduction to
Statistics and Demography
By
Hatim Jaber
MD MPH JBCM PhD
27+29 - 11- 2016
1
2. World AIDS Day 2016:
end AIDS by 2030
• People living with HIV 36.7
million
• People on antiretroviral
therapy 18.2 million
• Mother-to-child
transmission 7 out of 10
2
5. Presentation outline
Time
Introduction and Definitions of Statistics and
biostatistics
12:00 to 12:10
Role of Statistics in Clinical Medicine 12:10 to 12:20
Basic concepts 12:20 to 12:30
Methods of presentation of data 12:30 to 12:40
12:40 to 12:50
5
7. Definition of Statistics
• Different authors have defined statistics differently. The best
definition of statistics is given by Croxton and Cowden according to
whom statistics may be defined as the science, which
deals with collection, presentation, analysis
and interpretation of numerical data.
• The science and art of dealing with variation in data through collection,
classification, and analysis in such a way as to obtain reliable
results. —(John M. Last, A Dictionary of Epidemiology )
• Branch of mathematics that deals with the collection, organization,
and analysis of numerical data and with such problems as
experiment design and decision making. —(Microsoft
Encarta Premium 2009)
7
8. Definition of Biostatistics= Medical
statistics
• Biostatistics may be defined as application of
statistical methods to medical, biological
and public health related problems.
• It is the scientific treatment given to the medical
data derived from group of individuals or patients
Collection of data.
Presentation of the collected data.
Analysis and interpretation of the results.
Making decisions on the basis of such analysis 8
9. Role of Statistics in Clinical Medicine
The main theory of statistics lies in the term variability.
There is No two individuals are same. For example, blood
pressure of person may vary from time to time as well as from
person to person.
We can also have instrumental variability as well as
observers variability.
Methods of statistical inference provide largely objective means
for drawing conclusions from the data about the issue under
study. Medical science is full of uncertainties and statistics deals
with uncertainties. Statistical methods try to quantify the
uncertainties present in medical science.
It helps the researcher to arrive at a scientific judgment about
a hypothesis. It has been argued that decision making is an
integral part of a physician’s work.
Frequently, decision making is probability based. 9
10. Role of Statistics in
Public Health and Community Medicine
Statistics finds an extensive use in Public Health and Community Medicine.
Statistical methods are foundations for public health administrators to
understand what is happening to the population under their care at community
level as well as
individual level. If reliable information regarding the disease is available, the
public health administrator is in a position to:
●● Assess community needs
●● Understand socio-economic determinants of health
●● Plan experiment in health research
●● Analyze their results
●● Study diagnosis and prognosis of the disease for taking
effective action
●● Scientifically test the efficacy of new medicines and
methods of treatment.
10
11. Why we need to study Medical Statistics?
Three reasons:
(1) Basic requirement of medical research.
(2) Update your medical knowledge.
(3) Data management and treatment.
11
12. Role of statisticians
To guide the design of an experiment or survey prior to
data collection
To analyze data using proper statistical procedures and
techniques
To present and interpret the results to researchers and
other decision makers
12
13. I. Basic concepts
• Homogeneity: All individuals have similar values or
belong to same category.
Example: all individuals are Chinese, women, middle age (30~40
years old), work in a computer factory ---- homogeneity in nationality,
gender, age and occupation.
• Variation: the differences in feature, voice…
• Throw a coin: The mark face may be up or down ---- variation!
• Treat the patients suffering from pneumonia with same antibiotics:
A part of them recovered and others didn’t ---- variation!
• If there is no variation, there is no need for statistics.
• Many examples of variation in medical field: height, weight, pulse,
blood pressure, … …
13
14. 2. Population and Sample
• Population: The whole collection of individuals that
one intends to study.
• Sample: A representative part of the population.
• Randomization: An important way to make the
sample representative.
14
15. limited population and limitless population
• All the cases with hepatitis B collected in a hospital
in Amman . (limited)
• All the deaths found from the permanent residents
in a city. (limited)
• All the rats for testing the toxicity of a medicine.
(limitless)
• All the patients for testing the effect of a medicine.
(limitless) hypertensive, diabetic, …
15
16. Random
By chance!
• Random event: the event may occur or may not
occur in one experiment.
Before one experiment, nobody is sure whether
the event occurs or not.
Example: weather, traffic accident, …
There must be some regulation in a large number
of experiments.
16
17. 3. Probability
• Measure the possibility of occurrence of a random
event.
• A : random event
• P(A) : Probability of the random event A
P(A)=1, if an event always occurs.
P(A)=0, if an event never occurs.
17
18. Estimation of Probability----Frequency
• Number of observations: n (large enough)
Number of occurrences of random event A: m
f(A) m/n
(Frequency or Relative frequency)
Example: Throw a coin event:
n=100, m (Times of the mark face occurred)=46
m/n=46%, this is the frequency; P(A)=1/2=50%,
this is the Probability.
18
19. 4. Parameter and Statistic
• Parameter : A measure of population or
A measure of the distribution of population.
Parameter is usually presented by Greek letter.
such as μ,π,σ.
-- Parameters are unknown usually
To know the parameter of a population, we need a sample
• Statistic: A measure of sample or A measure of the distribution of sample.
Statistic is usually presented by Latin letter
such as s , p, t.
19
20. 5. Sampling Error
error :The difference between observed value and
true value.
Three kinds of error:
(1) Systematic error (fixed)
(2) Measurement error (random) (Observational error)
(3) Sampling error (random)
20
21. Sampling error
• The statistics of different samples from same
population: different each other!
• The statistics: different from the parameter!
The sampling error exists in any sampling research.
It can not be avoided but may be estimated.
21
22. II. Types of data
1. Numerical Data ( Quantitative Data )
• The variable describe the characteristic of individuals
quantitatively
-- Numerical Data
• The data of numerical variable
-- Quantitative Data
22
23. 2. Categorical Data ( Enumeration Data )
• The variable describe the category of individuals according to a
characteristic of individuals
-- Categorical Data
• The number of individuals in each category
-- Enumeration Data
23
24. Special case of categorical data :
Ordinal Data ( rank data )
• There exists order among all possible categories. ( level of
measurement)
-- Ordinal Data
• The data of ordinal variable, which represent the order of
individuals only
-- Rank data
24
25. Examples
Which type of data they belong to?
• RBC (4.58 106/mcL)
• Diastolic/systolic blood pressure
(8/12 kPa) or ( 80/100 mmHg)
• Percentage of individuals with blood type A (20%)
(A, B, AB, O)
• Protein in urine (++) (-, ±, +, ++, +++)
• Incidence rate of breast cancer ( 35/100,000)
25
26. III. The Basic Steps of Statistical Work
1. Design of study
• Professional design:
Research aim
Subjects,
Measures, etc.
26
28. 2. Collection of data
• Source of data
Government report system such as: cholera,
plague (black death) …
Registration system such as: birth/death
certificate …
Routine records such as: patient case report …
Ad hoc survey such as: influenza A (H1N1) …
28
29. • Data collection – Accuracy, complete,
in time
Protocol: Place, subjects, timing; training; pilot;
questionnaire; instruments; sampling method and
sample size; budget…
Procedure: observation, interview, filling
form, letter, telephone, web.
29
30. 3. Data Sorting
• Checking
Hand, computer software
• Amend
• Missing data?
• Grouping
According to categorical variables (sex, occupation, disease…)
According to numerical variables (age, income, blood pressure …)
30
31. 31
4. Data Analysis
• Descriptive statistics (show the sample)
mean, incidence rate …
-- Table and plot
• Inferential statistics (towards the population)
-- Estimation
-- Hypothesis testing (comparison)
32. About Teaching and Learning
• Aim:
Training statistical thinking
Skill of dealing with medical data.
• Emphasize:
Essential concepts and statistical thinking
-- lectures and practice session
Skill of computer and statistical software
-- practice session ( Excel and SPSS )
32
37. 1- Numerical presentation
Tabular presentation (simple – complex)
Name of variable
(Units of variable)
Frequency %
-
- Categories
-
Total
Simple frequency distribution Table (S.F.D.T.)
Title
37
38. Table (I): Distribution of 50 patients at the surgical
department of AAAAA hospital in May 2008 according
to their ABO blood groups
Blood group Frequency %
A
B
AB
O
12
18
5
15
24
36
10
30
Total 50 100
38
39. Table (II): Distribution of 50 patients at the surgical
department of AAAAA hospital in May 2008 according to
their age
Age
(years)
Frequency %
20-<30
30-
40-
50+
12
18
5
15
24
36
10
30
Total 50 100
39
40. Complex frequency distribution Table
Table (III): Distribution of 20 lung cancer patients at the chest department
of AAAAA hospital and 40 controls in May 2008 according to smoking
Smoking
Lung cancer
Total
Cases Control
No. % No. % No. %
Smoker 15 75% 8 20% 23 38.33
Non
smoker 5 25% 32 80% 37 61.67
Total 20 100 40 100 60 100
40
41. Complex frequency distribution Table
Table (IV): Distribution of 60 patients at the chest department of
AAAAA hospital in May 2008 according to smoking & lung cancer
Smoking
Lung cancer
Total
positive negative
No. % No. % No. %
Smoker 15 65.2 8 34.8 23 100
Non
smoker 5 13.5 32 86.5 37 100
Total 20 33.3 40 66.7 60 100
41
45. Frequency polygon
Age
Sex
M-P
M F
20- (12%) (10%) 25
30- (36%) (30%) 35
40- (8%) (25%) 45
50- (16%) (15%) 55
60-70 (8%) (20%) 65
0
5
10
15
20
25
30
35
40
25 35 45 55 65
Age
%
Males Females
Figure (2): Distribution of 45 patients at (place) , in (time)
by age and sex 45
47. Histogram
Distribution of a group of cholera patients by age
Age (years) Frequency %
25-
30-
40-
45-
60-65
3
5
7
4
2
14.3
23.8
33.3
19.0
9.5
Total 21 100
0
5
10
15
20
25
30
35
0
2
5
3
0
4
0
4
5
6
0
6
5
Age (years)
%
Figure (2): Distribution of 100 cholera patients at (place) , in
(time) by age 47
53. 1- Measures of central tendency (averages)
Midrange
Smallest observation + Largest observation
2
Mode
the value which occurs with the greatest
frequency i.e. the most common value
Summery statistics
53
54. 1- Measures of central tendency (cont.)
Median
the observation which lies in the middle of the
ordered observation.
Arithmetic mean (mean)
Sum of all observations
Number of observations
Summery statistics
54