An overview of how we're using semantic technologies at Springer Nature, and an introduction to our latest product: www.scigraph.com
(Keynote given at http://2016.semantics.cc/, Leipzig, Sept 2016)
"Hạnh phúc xanh" là chương trình phát triển cộng đồng, thúc đẩy người dân trồng cây nhằm: tăng mật độ cây xanh ở Việt Nam, tăng sự kết nối giữa con người và tự nhiên, sự kết nối giữa con người và con người, từ đó mang lại sự bảo vệ và hạnh phúc cho mọi người.
There is more value to links that simple SEO. This is why, between 2000 and 2002, there was a huge patent case in the US over who owned the hyperlink! Find out, 20 years later, what all the fuss was about.
9 mô hình tổ chức và quản lý doanh nghiệp phổ biến hiện nay.
===
Bước đầu tiên trong việc tổ chức bộ máy quản trị doanh nghiệp là việc xác định cơ cấu tổ chức phù hợp. Vì sao lại như vậy?
Một tổ chức được cấu trúc hiệu quả sẽ giúp doanh nghiệp quản trị công việc một cách hệ thống, trật tự, linh hoạt, tạo điều kiện thuận lợi cho việc thực hiện chiến lược tương lai. Trong khi đó, cơ cấu tổ chức không hợp lý sẽ cản trở hoặc hút cạn tiềm lực của chính doanh nghiệp đó.
Đặc biệt, trong bối cảnh kinh tế thay đổi chóng mặt như hiện nay, môi trường làm việc truyền thống nhường chỗ cho xu thế "Work From Home" thì cơ cấu tổ chức phải đáp ứng sự linh hoạt và biến đổi của thị trường rất quan trọng.
Để giúp anh/chị có cái nhìn trực quan và có thêm kiến thức xây dựng cơ cấu tổ chức hiệu quả, SlimCRM.vn xin gửi tới a/c tài liệu "9 mô hình tổ chức và quản trị doanh nghiệp thịnh hành nhất hiện nay".
Nội dung:
+ Các loại mô hình tổ chức doanh nghiệp, ưu và nhược điểm của từng mô hình.
+ Mô hình nào phù hợp để Work from home?
+ Ví dụ trực quan về cách tổ chức doanh nghiệp của các tập đoàn hàng đầu thế giới (Twitter, Spotify, Buffer..)
Link tải full tài liệu: https://go.slimcrm.vn/9-mo-hinh-to-chuc-dn
--
Nguồn: SlimCRM.vn
"Hạnh phúc xanh" là chương trình phát triển cộng đồng, thúc đẩy người dân trồng cây nhằm: tăng mật độ cây xanh ở Việt Nam, tăng sự kết nối giữa con người và tự nhiên, sự kết nối giữa con người và con người, từ đó mang lại sự bảo vệ và hạnh phúc cho mọi người.
There is more value to links that simple SEO. This is why, between 2000 and 2002, there was a huge patent case in the US over who owned the hyperlink! Find out, 20 years later, what all the fuss was about.
9 mô hình tổ chức và quản lý doanh nghiệp phổ biến hiện nay.
===
Bước đầu tiên trong việc tổ chức bộ máy quản trị doanh nghiệp là việc xác định cơ cấu tổ chức phù hợp. Vì sao lại như vậy?
Một tổ chức được cấu trúc hiệu quả sẽ giúp doanh nghiệp quản trị công việc một cách hệ thống, trật tự, linh hoạt, tạo điều kiện thuận lợi cho việc thực hiện chiến lược tương lai. Trong khi đó, cơ cấu tổ chức không hợp lý sẽ cản trở hoặc hút cạn tiềm lực của chính doanh nghiệp đó.
Đặc biệt, trong bối cảnh kinh tế thay đổi chóng mặt như hiện nay, môi trường làm việc truyền thống nhường chỗ cho xu thế "Work From Home" thì cơ cấu tổ chức phải đáp ứng sự linh hoạt và biến đổi của thị trường rất quan trọng.
Để giúp anh/chị có cái nhìn trực quan và có thêm kiến thức xây dựng cơ cấu tổ chức hiệu quả, SlimCRM.vn xin gửi tới a/c tài liệu "9 mô hình tổ chức và quản trị doanh nghiệp thịnh hành nhất hiện nay".
Nội dung:
+ Các loại mô hình tổ chức doanh nghiệp, ưu và nhược điểm của từng mô hình.
+ Mô hình nào phù hợp để Work from home?
+ Ví dụ trực quan về cách tổ chức doanh nghiệp của các tập đoàn hàng đầu thế giới (Twitter, Spotify, Buffer..)
Link tải full tài liệu: https://go.slimcrm.vn/9-mo-hinh-to-chuc-dn
--
Nguồn: SlimCRM.vn
Luận Văn Các Nhân Tố Ảnh Hưởng Đến Việc Áp Dụng Chuẩn Mực Báo Cáo Tài Chính Quốc Tế Cho Doanh Nghiệp Nhỏ Và Vừa Tại Việt Nam đã chia sẻ đến cho các bạn học viên nguồn tài liệu hoàn toàn hữu ích đáng để xem và tham khảo. Nếu các bạn muốn tải bài mẫu này hãy nhắn tin ngay qua zalo/telegram : 0973.287.149 để được hỗ trợ tải nhé.
Turning A Neglected YouTube Channel into a Traffic Generation MachinePhil Nottingham
In this talk, performed at BrightonSEO April 2023, Phil Nottingham breaks down the steps to turning a neglected YouTube Channel into a resource that generates loads of views and drives traffic to your website.
Hội chợ HawaExpo 2023 đã kết thúc và để lại nhiều ấn tượng đối với doanh nghiệp, khách hàng, đối tác trong nước và quốc tế. Để đạt được những kết quả ấn tượng này, bên cạnh sự chỉ đạo của các cơ quan ban ngành, sự hỗ trợ của các Hiệp hội, đối tác trong và ngoài nước, sự hỗ lực từ phía BTC còn có sự đóng góp vô cùng quan trọng từ phía quý DN trong việc hưởng ứng và làm việc nghiêm túc để mang đến một Hội chợ HawaExpo 2023 của ngành Gỗ & Nội thất Việt Nam Đổi mới – Chuyên Nghiệp – Hiệu quả.
Tiểu luận kinh doanh Quốc tế, Nghiên cứu công ty Nike và bài học kinh nghiệm. Để hiểu rõ môi trường kinh doanh, hoạt động kinh doanh quốc tế của Nike để từ đó đưa ra các bài học và giải pháp cho các doanh nghiệp giày dép của Việt Nam, đề tài này sẽ đi sâu phân tích các phần chính sau:
Using an employee knowledge graph for employee engagement and career mobilityNeo4j
Learn what a knowledge graph is and how it plays a salient role in enterprises and how to apply knowledge graphs for various business use cases across the data spectrum – from management to analytics and machine learning.
Nhận viết luận văn Đại học , thạc sĩ - Zalo: 0917.193.864
Tham khảo bảng giá dịch vụ viết bài tại: vietbaocaothuctap.net
Luận văn thạc sĩ quản lý giáo dục, dành cho các bạn là đề tài khóa luận tham khảo, đề tài: Quản lý các dự án hợp tác quốc tế về đào tạo của trường đại học Giao thông vận tải đến năm 2020
[BrightonSEO 2022] Unlocking the Hidden Potential of Product Listing PagesAreej AbuAli
E-commerce websites’ product listing pages contain untapped hidden potential. This talk is all about unlocking the magic of your listing pages by making the most out of filters and internal linking. Instead of being fixated on those landing page head terms, let’s turn our attention to the indexability of long-tail pages with high conversion. Whether you work in e-commerce or not, we’ll also cover how to embed yourself within Tech teams and analyse impactful changes.
Luận Văn Thạc Sĩ Các Yếu Tố Cơ Bản Ảnh Hưởng Đến Chi Tiêu Giáo Dục Của Hộ Gia Đình Trên Địa Bàn Thành Phố Hồ Chí Minh đã chia sẻ đến cho các bạn nguồn tài liệu hoàn toàn hữu ích. Nếu các bạn có nhu cầu cần tải bài mẫu này vui lòng nhắn tin ngay qua zalo/telegram : 0932.091.562 để được hỗ trợ tải nhé!
Luận Văn Các Nhân Tố Ảnh Hưởng Đến Việc Áp Dụng Chuẩn Mực Báo Cáo Tài Chính Quốc Tế Cho Doanh Nghiệp Nhỏ Và Vừa Tại Việt Nam đã chia sẻ đến cho các bạn học viên nguồn tài liệu hoàn toàn hữu ích đáng để xem và tham khảo. Nếu các bạn muốn tải bài mẫu này hãy nhắn tin ngay qua zalo/telegram : 0973.287.149 để được hỗ trợ tải nhé.
Turning A Neglected YouTube Channel into a Traffic Generation MachinePhil Nottingham
In this talk, performed at BrightonSEO April 2023, Phil Nottingham breaks down the steps to turning a neglected YouTube Channel into a resource that generates loads of views and drives traffic to your website.
Hội chợ HawaExpo 2023 đã kết thúc và để lại nhiều ấn tượng đối với doanh nghiệp, khách hàng, đối tác trong nước và quốc tế. Để đạt được những kết quả ấn tượng này, bên cạnh sự chỉ đạo của các cơ quan ban ngành, sự hỗ trợ của các Hiệp hội, đối tác trong và ngoài nước, sự hỗ lực từ phía BTC còn có sự đóng góp vô cùng quan trọng từ phía quý DN trong việc hưởng ứng và làm việc nghiêm túc để mang đến một Hội chợ HawaExpo 2023 của ngành Gỗ & Nội thất Việt Nam Đổi mới – Chuyên Nghiệp – Hiệu quả.
Tiểu luận kinh doanh Quốc tế, Nghiên cứu công ty Nike và bài học kinh nghiệm. Để hiểu rõ môi trường kinh doanh, hoạt động kinh doanh quốc tế của Nike để từ đó đưa ra các bài học và giải pháp cho các doanh nghiệp giày dép của Việt Nam, đề tài này sẽ đi sâu phân tích các phần chính sau:
Using an employee knowledge graph for employee engagement and career mobilityNeo4j
Learn what a knowledge graph is and how it plays a salient role in enterprises and how to apply knowledge graphs for various business use cases across the data spectrum – from management to analytics and machine learning.
Nhận viết luận văn Đại học , thạc sĩ - Zalo: 0917.193.864
Tham khảo bảng giá dịch vụ viết bài tại: vietbaocaothuctap.net
Luận văn thạc sĩ quản lý giáo dục, dành cho các bạn là đề tài khóa luận tham khảo, đề tài: Quản lý các dự án hợp tác quốc tế về đào tạo của trường đại học Giao thông vận tải đến năm 2020
[BrightonSEO 2022] Unlocking the Hidden Potential of Product Listing PagesAreej AbuAli
E-commerce websites’ product listing pages contain untapped hidden potential. This talk is all about unlocking the magic of your listing pages by making the most out of filters and internal linking. Instead of being fixated on those landing page head terms, let’s turn our attention to the indexability of long-tail pages with high conversion. Whether you work in e-commerce or not, we’ll also cover how to embed yourself within Tech teams and analyse impactful changes.
Luận Văn Thạc Sĩ Các Yếu Tố Cơ Bản Ảnh Hưởng Đến Chi Tiêu Giáo Dục Của Hộ Gia Đình Trên Địa Bàn Thành Phố Hồ Chí Minh đã chia sẻ đến cho các bạn nguồn tài liệu hoàn toàn hữu ích. Nếu các bạn có nhu cầu cần tải bài mẫu này vui lòng nhắn tin ngay qua zalo/telegram : 0932.091.562 để được hỗ trợ tải nhé!
"Identifying Springer's Author (with ORCID iD) on SpringerLink and the benefits" presented by Hazman Aziz, Account Development Manager for Southeast Asia at Springer Nature, at ORCID's Malaysia workshop on 28 February 2017.
SpringerNature and its sharing strategy on ReadCubeMartijn Roelandse
Springer Nature wants researchers to share content easily and legally. Our Springer Nature SharedIt content-sharing initiative means that links to view-only, full-text subscription research articles can be posted anywhere - including on social media platforms, author websites and in institutional repositories - so researcher can share research with colleagues and general audiences.
POSHAN District Nutrition Profile_Aurangabad_BiharPOSHAN
POSHAN District Nutrition Profiles (DNPs) draw on diverse sources of data to compile a set of indicators on the state of nutrition and its cross-sectoral determinants. The profiles are intended to be conversation-starters at the district level and to enable discussions about why undernutrition levels are high, and which factors, at multiple levels, might need to be addressed to improve nutrition.
PLEASE NOTE that POSHAN is regularly tracking data sources as they are released and updating the profiles accordingly.
You may be surprised to learn which five dilemmas of English Education in America have been around for the past two hundred years. The revolution is long overdue!
Informe semestral ene octubre 2015 proyectos promovidos.Hermel Cabrera
Proyectos emprendedores e innovadores promovidos y gestionados por la Dirección de Innovación Productiva; con el objetivo de disminuir importaciones y aumentar la producción nacional con mayor Valor Agregado Nacional (VAN) Ecuatoriano.
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
Presentation by Tony Hammond and Michele Pasin to Linked Science workshop, co-located with International Semantic Web Conference (ISWC) 2015, on October 12, 2015
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data21Style
MuseoTorino, is the first italian project using Web 3.0 tecnologies. NOSQL-GraphDB (Neo4J), RDFa, Linked Open Data.
MuseoTorino is a 21style (www.21-style.com) project for the municipality of Torino, Italy.
These slides come from CodeMotion, the best Italian conference for developers and IT entusiast !
"Semantic Integration Is What You Do Before The Deep Learning". dev.bg Machine Learning seminar, 13 May 2019.
It's well known that 80\% of the effort of a data scientist is spent on data preparation. Semantic integration is arguably the best way to spend this effort more efficiently and to reuse it between tasks, projects and organizations. Knowledge Graphs (KG) and Linked Open Data (LOD) have become very popular recently. They are used by Google, Amazon, Bing, Samsung, Springer Nature, Microsoft Academic, AirBnb… and any large enterprise that would like to have a holistic (360 degree) view of its business. The Semantic Web (web 3.0) is a way to build a Giant Global Graph, just like the normal web is a Global Web of Documents. IEEE already talks about Big Data Semantics. We review the topic of KGs and their applicability to Machine Learning.
Multi-Model Data Query Languages and Processing ParadigmsJiaheng Lu
Specifying users' interests with a formal query language is a typically challenging task, which becomes even harder in the context of multi-model data management because we have to deal with data variety. It usually lacks a unified schema to help the users issuing their queries, or has an incomplete schema as data come from disparate sources. Multi-Model DataBases (MMDBs) have emerged as a promising approach for dealing with this task as they are capable of accommodating and querying the multi-model data in a single system. This tutorial aims to offer a comprehensive presentation of a wide range of query languages for MMDBs and to make comparisons of their properties from multiple perspectives. We will discuss the essence of cross-model query processing and provide insights on the research challenges and directions for future work. The tutorial will also offer the participants hands-on experience in applying MMDBs to issue multi-model data queries.
The Power of Semantic Technologies to Explore Linked Open DataOntotext
Atanas Kiryakov's, Ontotext’s CEO, presentation at the first edition of Graphorum (http://graphorum2017.dataversity.net/) – a new forum that taps into the growing interest in Graph Databases and Technologies. Graphorum is co-located with the Smart Data Conference, organized by the digital publishing platform Dataversity.
The presentation demonstrates the capabilities of Ontotext’s own approach to contributing to the discipline of more intelligent information gathering and analysis by:
- graphically explorinh the connectivity patterns in big datasets;
- building new links between identical entities residing in different data silos;
- getting insights of what type of queries can be run against various linked data sets;
- reliably filtering information based on relationships, e.g., between people and organizations, in the news;
- demonstrating the conversion of tabular data into RDF.
Learn more at http://ontotext.com/.
Apache® Spark™ MLlib: From Quick Start to Scikit-LearnDatabricks
These are the slides to support the Apache® Spark™ MLlib: From Quick Start to Scikit-Learn webinar.
In this webcast, Joseph Bradley from Databricks will be speaking about Apache Spark’s distributed Machine Learning Library - MLlib.
We will start off with a quick primer on machine learning, Spark MLlib, and a quick overview of some Spark machine learning use cases. We will continue with multiple Spark MLlib quick start demos. Afterwards, the talk will transition toward the integration of common data science tools like Python pandas, scikit-learn, and R with MLlib
This slide deck has been prepared for a workshop on Linked Data Publishing and Semantic Processing using the Redlink platform (http://redlink.co). The workshop delivered at the Department of Information Engineering, Computer Science and Mathematics at Università degli Studi dell'Aquila aimed at providing a general understanding of Semantic Web Technologies and how these can be used in real world use cases such as Salzburgerland Tourismus.
A brief introduction has been also included on MICO (Media in Context) a European Union part-funded research project to provide cross-media analysis solutions for online multimedia producers.
Knowledge Discovery tools using Linked Data techniques - {resentation for the Linked Data 4 Knowledge Discovery Workshop at ECML/PKDD2015 conference - http://events.kmi.open.ac.uk/ld4kd2015/ -
Boost your data analytics with open data and public news contentOntotext
Get guidance through the gigantic sea of freely available Open Data and learn how it can empower you analysis of any kind of sources.
This webinar is a live demo of news and data analytics, based on rich links within big knowledge graphs. It will show you how to:
Build ranking reports (e.g for people and organisations)
View topics linked implicitly (e.g. daughter companies, key personnel, products …)
Draw trend lines
Extend your analytics with additional data sources
Building search and discovery services for Schibsted (LSRS '17)Sandra Garcia
Presentation given at the Large Scale Recommender Systems workshop (LSRS) in Recsys 2017.
This presentation describes the search and discovery products we are working on in Schibsted for the domains of news and marketplaces as well as the challenges within each of these domains. It also covers how we bring these services into production including the system architecture and deployment process.
Building a Dataset Search Engine with Spark and Elasticsearch: Spark Summit E...Spark Summit
Elasticsearch provides native integration with Apache Spark through ES-Hadoop. However, especially during development, it is at best cumbersome to have Elasticsearch running in a separate machine/instance. Leveraging Spark Cluster with Elasticsearch Inside it is possible to run an embedded instance of Elasticsearch in the driver node of a Spark Cluster. This opens up new opportunities to develop cutting-edge applications. One such application is Dataset Search.
Oscar will give a demo of a Dataset Search Engine built on Spark Cluster with Elasticsearch Inside. Motivation is that once Elasticsearch is running on Spark it becomes possible and interesting to have the Elasticsearch in-memory instance join an (existing) Elasticsearch cluster. And this in turn enables indexing of Datasets that are processed as part of Data Pipelines running on Spark. Dataset Search and Data Management are R&D topics that should be of interest to Spark Summit East attendees who are looking for a way to organize their Data Lake and make it searchable.
Establishing the Connection: Creating a Linked Data Version of the BNBnw13
Presentation for Talis Linked Data in Libraries event July 14 2011
Describes some of the choices made and lessons learned in migrating from traditional bibliographic metadata to linked open data.
Designing great dashboards: a slidedeck for dashboard developersMichele Pasin
After reading many useful papers and online resources on the topic of dashboards design, I realised I didn’t have a single document collecting and organising all of the useful ideas I encountered. So the purpose of this slidedeck is to serve as a (work-in-progress) handbook a dashboards developer can get back to, in order to find inspiration, advice, and maybe, even endorsement. Use at your own risk!
STI 2022 - Generating large-scale network analyses of scientific landscapes i...Michele Pasin
The growth of large, programatically accessible bibliometrics databases presents new opportunities for complex analyses of publication metadata. In addition to providing a wealth of information about authors and institutions, databases such as those provided by Dimensions also provide conceptual information and links to entities such as grants, funders and patents. However, data is not the only challenge in evaluating patterns in scholarly work: These large datasets can be challenging to integrate, particularly for those unfamiliar with the complex schemas necessary for accommodating such heterogeneous information, and those most comfortable with data mining may not be as experienced in data visualisation. Here, we present an open-source Python library that streamlines the process accessing and diagramming subsets of the Dimensions on Google BigQuery database and demonstrate its use on the freely available Dimensions COVID-19 dataset. We are optimistic that this tool will expand access to this valuable information by streamlining what would otherwise be multiple complex technical tasks, enabling more researchers to examine patterns in research focus and collaboration over time.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...pchutichetpong
M Capital Group (“MCG”) expects to see demand and the changing evolution of supply, facilitated through institutional investment rotation out of offices and into work from home (“WFH”), while the ever-expanding need for data storage as global internet usage expands, with experts predicting 5.3 billion users by 2023. These market factors will be underpinned by technological changes, such as progressing cloud services and edge sites, allowing the industry to see strong expected annual growth of 13% over the next 4 years.
Whilst competitive headwinds remain, represented through the recent second bankruptcy filing of Sungard, which blames “COVID-19 and other macroeconomic trends including delayed customer spending decisions, insourcing and reductions in IT spending, energy inflation and reduction in demand for certain services”, the industry has seen key adjustments, where MCG believes that engineering cost management and technological innovation will be paramount to success.
MCG reports that the more favorable market conditions expected over the next few years, helped by the winding down of pandemic restrictions and a hybrid working environment will be driving market momentum forward. The continuous injection of capital by alternative investment firms, as well as the growing infrastructural investment from cloud service providers and social media companies, whose revenues are expected to grow over 3.6x larger by value in 2026, will likely help propel center provision and innovation. These factors paint a promising picture for the industry players that offset rising input costs and adapt to new technologies.
According to M Capital Group: “Specifically, the long-term cost-saving opportunities available from the rise of remote managing will likely aid value growth for the industry. Through margin optimization and further availability of capital for reinvestment, strong players will maintain their competitive foothold, while weaker players exit the market to balance supply and demand.”
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu
For massive graphs that fit in RAM, but not in GPU memory, it is possible to take
advantage of a shared memory system with multiple CPUs, each with multiple cores, to
accelerate pagerank computation. If the NUMA architecture of the system is properly taken
into account with good vertex partitioning, the speedup can be significant. To take steps in
this direction, experiments are conducted to implement pagerank in OpenMP using two
different approaches, uniform and hybrid. The uniform approach runs all primitives required
for pagerank in OpenMP mode (with multiple threads). On the other hand, the hybrid
approach runs certain primitives in sequential mode (i.e., sumAt, multiply).
11. > Collaborative effort between Springer Nature and
Digital Science
> Supporting internal use cases,but also contributing
to an emerging web of linked science data
> Not just publications data but a wealth of other
related information
18. The Knowledge Graph is
about collecting
information about objects
in the real world
…so that we can do a better job of
providing users with what they're
looking for
19. reads / writes
is about
interested in
Three areas of knowledge we care about
20. Reads / Writes
Works for
Funds
Lead researcher in
Produces
Studies Located at
In
proceedings
C
ontains
Cites
Has learning
resource
Attends
Has topicProduces
23. Our Work So Far
2014
2013
2012
2015
2016
NPG Linked Data Platform
Nature Ontologies Portal
Springer Materials
Springer Conferences
Scigraph
Content Hub
Scigraph
prototype
Nero
Project
Linnaeus
Project
Springer
Protocols
CURI Semantic
Annotation Project
24. Deliverables (2012–2014)
● Prototype for external use
● SPARQL query service
● Two RDF dataset releases in 2012
– April 2012 (22m triples)
– July 2012 (270m triples)
● Live updates to query endpoint
Led to (2014–)
● Focus on internal use-cases
● Publish ontology pages
● Periodic data snapshots
NPG Linked Data Platform (2012)
25. Features
● Hybrid RDF + XML architecture
– MarkLogic for XML, RDF/XML
– Triplestore (TDB) for RDF validation
● Repo’s for binary assets
Layout
! Semantic RDF/XML includes in XML
● RDF objects serialized in list order
● Application XML for subject hierarchy
Indexes
● Indexes over all elements
● Range indexes for datatypes (e.g. dates)
NPG Content Hub (2014): Hybrid Architecture
33. a DB/OO
scheme
Arbitrary relations plus
axioms, constraints
and rules expressed
in a logical languagea glossary
an axiomatized
theory
a thesaurus
a taxonomy
Taxonomy plus
related terms;
captures synonymy,
homonymy etc.
Complexity (ontological depth)
A controlled
vocabulary with NL
definitions (e.g.
lexicon)
- Publishers
- Relations
- Publish-states
A c.v. that captures
broaderThan /
narrowerThan
relationships
- Subjects,
- Article Types
Relational model:
unconstrained use
of arbitrary relations
Scigraph
Core ontology
Ontologies and Taxonomies: overview
37. 37
SKOS taxonomies: Subjects
- Structure: SKOS, ~2500 concepts, multi hierarchical tree, 6 branches, 7 levels of depth
- Mappings: 100% of terms, using skos:broadMatch or skos:closeMatch, (Dbpedia and
MESH)
- Document tagging: mostly manual, different workflows, often costly and inconsistent
40. 40
Naming Architecture: federated model
> Dereference and 303 redirects:
- http://name.scigraph.com/{things}/
- http://data.scigraph.com/{things}/
> Two patterns: schemas and instances
- http://name.scigraph.com/ontologies/{domain}/
- http://name.scigraph.com/{domain}/{things}/
> Prefixes for schemas and instances
- @prefix sg: <http://name.scigraph.com/ontologies/core/> .
> Entity names follow a robust convention
- camel-case for naming terms, with an initial uppercase for
classes and an initial lowercase for properties.
> Named graphs used to track provenance
41. 41
Scigraph - Data Flow
Peer
Review
DDS
Core
Media
UNSILO TARGET
Uber
Research
DBPedia etc..
KNOWLEDGE GRAPH
JSON-LD API DDS Adapter TTL Loader RDF Loader ..
data
sources
integration
layer
real time
services
Peer Review
Service
Search Service
(Content Hub)
applications Peer Review Oscar Search
data is delivered to
applications via fast APIs
data is extracted and
denormalised so to support
applications
data is normalised and
mapped to SN ontologies
42. 42
ETL Architecture: main features [in evolution]
Tech stack
> Airflow framework (Airbnb)
> Amazon S3 to make backups
> GraphDB triplestore (staging and presentation)
> Elastic search and APIs
Components & Principles
> Graph must be ‘ephemeral’
> Data sources versioning algorithm
> Identity Persistence service
> Validation via SHACL (TopBraid API)
45. 45
Data Validation: from SPIN to SHACL
> SPIN SPARQL syntax
(2011, TopQuadrant)
> Example: “if a Journal
instance has no short
title, raise an Exception”
> Main drawback: hard to
maintain and to read by
non specialists
46. 46
Data Validation: from SPIN to SHACL
> SHACL - Shapes
Constraint Language
(2016, TopQuadrant)
> Example: “all article
instances should have a
valid DOI”
> Example: “all grants
instances should have
max 1 start year and end
year”
> Approach: polish data
before entering the
triplestore, use triplestore
inference primarily for
integration
48. 48
Looking Ahead
Summary
● Scigraph is our latest LD platform - public version live in late 2016
● SW tech allows for scalable enterprise-level metadata management
● It is crucial to distinguish between data Integration VS (real time) data delivery
● Still a work in progress… suggestions or feedback very welcome!
Ongoing Work
● Ontology: federated model, more advanced inferencing capabilities
● Build internal/external APIs (JSON-LD) by integrating also NoSQL
● Tools for analytics, reporting, visualisation, interactive exploration of the graph
● Entities extraction: scientific entities, places, people, events etc..
● We’re looking to collaborate… Crossref, W3C, building a Linked Science Web
50. 50
The Knowledge Graph team
CORE TEAM
*Markus Kaindl: Product Owner
*Ben Kirkley: Project Manager
* Michele Pasin: Lead Data Architect
*Tony Hammond: Data Architect
* Matias Piipari: Lead Engineer
* Hilverd Reker: Software Engineer
*Artur Konczak: Software Engineer
*<blankNode>: Data Scientist
*<blankNode>: Data Engineer
DIGITAL SCIENCE
* Martin Szomszor: Data Scientist
*Richard Koks: Data Scientist
* Mario Diwersy: CTO, Uber Research
PROGRAM SPONSOR
* Henning Schoenenberger: Director Data &
Metadata