This document proposes a framework to quantify researcher diversity and its impact on productivity and awards. It suggests measuring diversity using Shannon entropy based on researchers' publication topics classified with MeSH terms. Higher diversity is correlated with lower productivity but higher awards. Future work could involve larger datasets and distinguishing specialists in the "long tail" from generalists at the "head of the tail".
Bibliometrics literally means "book measurement" but the term is used about all kinds of documents (with journal articles as the dominant kind of document).
What is measured are not the physical properties of documents but statistical patterns in variables such as authorship, sources, subjects, geographical origins, and citations.
Metrics envelop number of subject domains, e.g., general relativity under physics, networking, mathematics, software analysis, etc. --- STATISTICS
Enumerated in the slides are the different metric fields in information science.
Presentation covering introduction to bibliometrics. Suggested audience: PGRs, early career researchers, academic staff wanting refresher, research support staff
Scientometrics and semantic maps for development (Author: Iina Hellsten)Sarah Cummings
This presentation was a preliminary overview of the research being undertaken by Iina Hellsten and Sarah Cummings. It provides a first outline of what we are planning to do.
Bibliometric Research Synthesis
bibliometrix: An R-tool for comprehensive science mapping analysis
In the seminar we propose and use a unique tool, developed in the R language, which follows a classic logical bibliometric workflow that we reconstruct. We have designed and produced an R-tool for comprehensive bibliometric analyses. R is a language and environment for statistical computing and graphics. It provides a wide variety of statistical and graphical techniques and is highly extensible. In addition to enabling statistical operations, it is an object-oriented and functional programming language; hence, you can automate your analyses and create new functions. It has an open-software nature, which means it is well supported by the user community and new functions are regularly contributed by users, many of whom are prominent statisticians. As it is programmed in R, the proposed tool is flexible, can be rapidly upgraded, and can be integrated with other statistical R-packages. It is therefore useful in a constantly changing field such as bibliometrics.
Dr. Frances Harris from Centre for Earth and Environmental Sciences Research, School of Geography, Geology and the Environment at Kingston University - with areview of approaches to knowledge co-production focused on food, water, energy and environment.
Bibliometrics literally means "book measurement" but the term is used about all kinds of documents (with journal articles as the dominant kind of document).
What is measured are not the physical properties of documents but statistical patterns in variables such as authorship, sources, subjects, geographical origins, and citations.
Metrics envelop number of subject domains, e.g., general relativity under physics, networking, mathematics, software analysis, etc. --- STATISTICS
Enumerated in the slides are the different metric fields in information science.
Presentation covering introduction to bibliometrics. Suggested audience: PGRs, early career researchers, academic staff wanting refresher, research support staff
Scientometrics and semantic maps for development (Author: Iina Hellsten)Sarah Cummings
This presentation was a preliminary overview of the research being undertaken by Iina Hellsten and Sarah Cummings. It provides a first outline of what we are planning to do.
Bibliometric Research Synthesis
bibliometrix: An R-tool for comprehensive science mapping analysis
In the seminar we propose and use a unique tool, developed in the R language, which follows a classic logical bibliometric workflow that we reconstruct. We have designed and produced an R-tool for comprehensive bibliometric analyses. R is a language and environment for statistical computing and graphics. It provides a wide variety of statistical and graphical techniques and is highly extensible. In addition to enabling statistical operations, it is an object-oriented and functional programming language; hence, you can automate your analyses and create new functions. It has an open-software nature, which means it is well supported by the user community and new functions are regularly contributed by users, many of whom are prominent statisticians. As it is programmed in R, the proposed tool is flexible, can be rapidly upgraded, and can be integrated with other statistical R-packages. It is therefore useful in a constantly changing field such as bibliometrics.
Dr. Frances Harris from Centre for Earth and Environmental Sciences Research, School of Geography, Geology and the Environment at Kingston University - with areview of approaches to knowledge co-production focused on food, water, energy and environment.
2021 - Communicating Astronomy with the Public TalkJohn C. Besley
An updated version of the 'strategic science communication' talk for astronomy communicators. Focuses more deeply on the goals that might make the most sense for basic science researchers.
Researching multilingually and interculturallyRMBorders
Holmes, P. (Durham University), Fay, R. (University of Manchester), Attia, M. (Durham University) and Andrews, J. (University of the West of England), Researching multilingually and interculturally. Paper presented at the 19th CultNet, hosted by Durham University, April 21st-23rd, 2016.
Discussion Question 1Select one species of organism that is intere.docxfelipaser7p
Discussion Question 1
Select one species of organism that is interesting to you (try to pick something different from your classmates’ choices---you only have about two million to choose from). Show the hierarchy of classification for the organism you choose from Domain through to binomial name (genus and species). Be sure to write the names in correct format and to spell correctly.
What are the nearest relatives of your chosen species? How do the levels of classification for your species trace the major evolutionary steps the ancestors of your species took over time? Be prepared to discuss similarities and differences in the evolution of diversity in the examples chosen by your classmates.
I want to do this on a Horse. Basic information. Does not need to be an essay just a paragraph or two, with reference in APA format.
Discussion Question 2
Over the long period of time that life has existed on Earth, there have been a number of important or significant innovations including (but not limited to) endosymbiosis to create mitochondria and chloroplasts; multicellularity; adaptation to land by plants and animals; development of exoskeletons in arthropods, shells in molluscs, and notochords followed by vertebral columns in chordates and vertebrates; and bipedalism in the ancestry of humans. All of these had to come about by natural selection in response to changing environmental forces.
Pick one of these significant innovations and describe:
How the innovation appears to have happened
What environmental challenges were met and overcome by this innovation, and
What opportunities were opened for the organism that made this innovation.
Be prepared to discuss the general concept of innovation and opportunity in response to selective challenges in life and how this can greatly increase biodiversity over time.
This does not need to be an essay. Just a paragraph or two with reference.
.
SOLE: Linking Research Papers with Science Objects
Scientometrics
1. Measuring Researcher Diversity
and its Impact on Awards
Tanu Malik Computation Institute
Andrey Rzhetsky Department of Human Genetics
Ian Foster Computation Institute
University of Chicago
Argonne National Laboratory
2. History Leonardo da Vinci
Bohr
Darwin
Renaissance polymath, painter, sculptor, architect,
musician, mathematician, engineer, inventor, anatomist,
Einstein
geologist, cartographer, botanist, and writer
3. Biological species
Short Term: Competition Long Term: Changing Environments
Competition Niche Differences
Adapted from: Levine, J. M. & HilleRisLambers, J. (2012) The Maintenance of Species Diversity. Nature Education Knowledge 3(10):59
4. Biological species:
Specialist/Generalist
Short Term: Competition Long Term: Changing Environments
Competition Niche Differences
Adapted from: Levine, J. M. & HilleRisLambers, J. (2012) The Maintenance of Species Diversity. Nature Education Knowledge 3(10):59
5. Science Research
Short Term: Competition on Topics Long Term: Changing Funding Situation
Competition Niche Differences
6. Why is this important?
• Research articles whose coauthors are in different departments at
the same university receive more citations than those authored in a
single department (Katz et.al, 1997).
• Multi–university collaborations that include a top tier–university
were found to produce the highest–impact research articles (Jones,
et al., 2008).
• It has also been demonstrated that scholarly work covering a range
of fields — and patents generated by larger teams of co–authors —
tend to have greater impact over time (Wuchty, et al., 2007).
• In the area of nanotechnology authors who have a diverse set of
collaborators tend to write articles that have higher impact (Rafols
et. al., 2010).
• Finally, diverse groups can, depending on the type of task,
outperform individual experts or even groups of experts (Page,
2007).
7. Individual Focus
• Some mathematicians are birds, other are frogs.
Birds fly high in the air, frogs live in the mud
below.. (Freeman Dyson, AMS Einstein Lecture,
2008)
• “Foxes”, individuals who know many little things,
tend to make better predictions about future
outcomes than “hedgehogs” who focus on one
big thing (Tetlock, 2005)
• Individuals’ degree of focus is positively
correlated with the quality of their contributions
(Adamic, 2010)
8. Goals and Problems
• Goal: Quantify the ability of each class of
researchers (specialists/generalists) to
competition in near term and adapt to
changing funding requirements in the long
term.
A. How to determine specialist and generalist
researchers?
B. How to quantify the ability to compete/adapt?
9. A. Researcher Diversity
• Based on their publication history, determine
if their interests can be classified into highly
varied interests or focused interests
• Researcher profiles created from PubMed
10. Creating Researcher Profiles
• Author Disambiguation
– Data mining methods
– Microsoft Academic Search
• Automated profiles of users
• Web scraping
• Person’s organization and domain
of interest as disambiguating features
– Harvard Profiles
• Directly links to PubMed
• Also takes an input of publications
claimed by an author.
13. Researcher Diversity
• Shannon’s Entropy
pi: proportion of individual’s contributions in
category i
– Category = MeSH term
– Frequency over years
0.00 0.41 0.82 1.00 1.59 2.00
14. Shannon Vs Sterling
• Variety: how many different areas an
individual contributes
• Balance: how evenly their efforts are
distributed among these areas; and,
• Similarity, or how related those areas are
15. B. Quantifying the Ability to Compete
• Entropy has a negative correlation with measures of impact and productivity,
viz. the h-index and the g-index.
• Result (in a way) reconfirms Adamic’s result of positive correlation between
specialist and productivity
16. Geniuses, Birds, Beavers, Frogs
Geniuses: Dwell on many topics at all times (8-9)
Birds: Dwell on many topics over their research career, but a few topics at a given time
Beavers: Specialists whose focus is interdisciplinary
Frogs: High-focused
20. Future Work
• Larger datasets
• Researchers in the long tail are specialists;
generalists are in the head of the tail;
21. Summary
• A framework to understand researcher
diversity
• Quantification of researcher diversity with
productivity and awards
• Negative correlation of diversity with
productivity and positive with awards
• Use more accurate author disambiguation
methods
Editor's Notes
(i.e. single concept can exist in more than one place in different contexts).within the hierarchy each meaning, in MeSH terminology called concept, is represented by its Unique ID, its MeSH heading (default name) and its Tree Numbers (contexts). The tree numbers are in fact all distinct drill paths leading from the root of the hierarchy to the concept. Each tree number is a level, and encompasses all levels below it. Every journal article is indexed with about 10-15 descriptors, allowing us to compare researchers with similar number of publications. We, though however, eliminate polysemous descriptors (which occur in multiple tree paths) so as to not include unrelated research areas.