3. البحثية البيانات
Research data
الخام البيانات مقابل في
Raw data
3
Raw data Research data
Modelling Not modelled, or modelled in a way that serve certain
tool for specific platform
Modelled in a standardized way that can serve reuse of the
data
Description Might include descriptive metadata but not obligatory Must include metadata that can describe it clearly.
Exchange Normally built for backend usage and have a very
limited exchange possibility.
Ready to be exchanged/harvested standalone.
Curation Have no certain plans for curation as by its own. Should have a curation plan.
Publishing Just given access via a tool or application. Can be published stand alone.
Re-usability Needs effort to be harvested and get prepared for
further usage
Ready for reuse in less painful manner.
البحثية بالبيانات التعريف
12. التجميع
واإللتقاط
ال
Web Scraping for research legal
According to German Law of copyright paragraphs § 60c Scientific Research, and § 60d Text and data
mining, using web scraping, and text mining is totally allowed practice for research proposes.
اإلنترنت من البيانات بجمع المرتبطة واألخالقية القانونية االعتبارات
12
- Researchers may re-produce up to 75% of the work for the
research proposed.
- You can systematically/automatically capture and re-produce
data corpuses.
- Share the aggregated data among specific communities for
research-related reasons e.g. validation.
- Storing the scraped data as long as there is a research-
related reason.
- It is always an excellent practice to ask for permission if you
have doubts.
- Protected data and copyrighted materials that state exclusively that
scraping is not allowed may not be used for research with no
permission (e.g., Robots.txt practices)
- breaching technical protection aspects and causing bottlenecks for
technical infrastructure can trigger legal action against scrapers
(e.g., LinkedIn case)
- Researchers may not publish the scraped datasets without prior
permission,
- You cannot archive the scraped dataset forever, and data should be
destroyed when the reason for its gathering comes to an end.
What is allowed? What is Not allowed?
25. البحث ومنتج البحثية البيانات
Research data as a research product
البيانات نشر
25
البيانات بنشر نقوم يجعلنا الذي السبب ما
السمعة بناء
األكاديمية
academic
reputation: data by its
own is considered a stand-
alone citable research
output.
المستقبلي التحليلFurther
analysis: increase
opportunities for further
investigation via different
methods.
مصادر تنمية
المجتمع
Community
resource development:
publishing research data
can be valuable to known
user groups.
التحقق
Verification: enable
others to follow the
process of research that
led to the findings and
potentially re-produce or
verify these findings.
المستقبلية المنشورات
Further
publications: data article
publications contribute to
scientific communication
and debate about reuse.
والتدريس التعلم
Learning &
teaching: data embedding
in a learning/teaching
enhance learning process
interactivity, encourages
users to participate in the
research