Humanities Networked Infrastructure (HuNI)

HUMANITIES
NETWORKED
INFRASTRUCTURE
(HUNI)
JAILBREAKING AUSTRALIA’S
CULTURAL DATA

CRICOS Provider Code: 00113B
NATIONAL E-RESEARCH
COLLABORATION TOOLS AND
RESOURCES (NECTAR)
NeCTAR is a $47 million dollar, Australian Government
project, conducted as part of the Super Science
initiative and financed by the Education Investment
Fund. The University of Melbourne is the lead
agent, chosen by the Commonwealth Government.

• Ensure that Australian cultural datasets and the research
associated with them become part of the emerging
international Linked Open Data environment.
• Enable research enquiries to move easily from: what is?
to where is?
• Support the role of annotation and metadata in discovery
of new knowledge or the means to elucidate new
knowledge
• Position the idea of data as both a subject and object of
analysis in humanities
• Contribute to debates around standards for development
and implementation
HuNI BROAD BENEFITS

• Enable humanities researchers to work with cultural datasets
more efficiently and effectively, and on a larger scale;
• Encourage the systematic sharing of research data between
humanities researchers (including the cultural dataset
curators themselves), the community and cultural
institutions;
• Encourage a greater level of cross-disciplinary and
interdisciplinary research, both within the
humanities/creative arts and between the
humanities/creative arts and other disciplines, and the wider
public;
• Support innovative methodologies such as network
analysis, game theory and ‘virtual history’ that rely on large-
scale datasets
HUNI: SPECIFIC BENEFITS

1. Organisational level: the goals and processes of the institutions
involved
2. The semantic level: meaning of the exchanged digital resources
3. Technical level: implementing data interoperability requires
both data integration and data exchange processes as well as
enabling effective use of the data that becomes available
Pasquale Pagano, ‘Data Interoperability’ (GRDI2020)
4. Project level: The advent of more complex ‘big humanities’
projects requires multiple and multi-disciplinary personnel
which in turn entails the organization of different workflows
and expectations: e.g. challenge of developing a
comprehensive or consortial approach, common definition of
project method etc.
INTEROPERABILITY

1. A PARTNERSHIP
… a Deakin led consortium
• Cultural data providers (10) – project co-operators
• Humanities software developer (1) – project co-
developers
• eResearch organisations (2) – lead development
agencies

HUNI PARTNER DATASETS
AMHD
MAP
CAARP
Bonza
AFIRC
Circus Oz
AusStage
Media:
film, cinema, theatre, newspapers, magazines, advertis
ing, music, live performances
DAAO
AustLit
AWR
ADB
DoS
Biographical: artists, designers, writers, significant
people, scientists, Sydney demographics
EOAS
AUSTLANG
Mura
Indigenous languages

Welcome to the Cinema and Audiences Research Project (CAARP) database: An online encyclopaedia of
cinema-going in Australia.
Data
This site contains information on film screenings and venues in Australia.
430,137 screenings
10,256 films
1,978 cinemas
1,649 companies
From 1846 to now

• NeCTAR investment of $1.33M
• Partner contributions of $480,000
• Partner in-kind contributions amounting to >$1M
A FISCAL COLLABORATION

COMMUNITY BUILDING
• Collated user-stories (20)
• Online showcase events – next one is 4th September
2013
• Live link to the latest alpha prototype on huni.net.au;
feedback buttons
• Wider beta launch at eResearch Australasia in October
2013
• Stay up to date through our monthly Newsletter and
blog feed
• Follow us on twitter - @HuNIVL

Information design challenge to build an ontology and use
linked data and controlled vocabularies for data to be
aligned and related.
• Reading the data. Characteristics of the data determine
the ontological components selected and the major
“entities” (aka “access points”).
• Identified early as:
people, organisations, events, relationships, places, dates,
resources, and subjects.
• Components from ontologies already available and being
reused or kept in our sights: CIDOC-
CRM, FOAF, FRBR, FRBR-OO, BibFrame and PROV-O.
2. INTEGRATING MEANING

HUNI ONTOLOGY (all classes and
object properties)
cidoc:E41Appellation
cidoc:E49TimeAppellation
has subclass
cidoc:E44PlaceAppellation
has subclass
cidoc:E18PhysicalThing
cidoc:E24PhysicalManMadeThing
has subclass
cidoc:E19PhysicalObject
has subclass
frbr:F7Object
has subclass
cidoc:P1isIdentifiedBy (Domain>Range)
frbr:F9Place
cidoc:P53hasCurrentOrFormerLocation (Domain>Range)
cidoc:P1isIdentifiedBy (Domain>Range)cidoc:E22Man-MadeObject
has subclass
cidoc:E52Time-Span
cidoc:E2TemporalEntity
has subclasscidoc:P4hasTimeSpan (Domain>Range)
cidoc:E4Period
has subclass
frbr:F22Self-Contained_Expression
frbr:F25Performance_Plan
has subclass
frbr:F26Recording
has subclass
frbr:F24Publication_Expression
has subclass
frbr:F15Complex_Work
frbr:F18Serial_Work
has subclass
cidoc:E21Person
frbr:F10Person
has subclass
cidoc:E67Birth
cidoc:P98iwasBorn (Domain>Range)
foaf:Person
has subclass
cidoc:E74Group
cidoc:P107iisCurrentOrFormerMemberOf (Domain>Range)
cidoc:E69Death
cidoc:P101idiedIn (Domain>Range)
cidoc:E7Activity
cidoc:P14iperformed (Domain>Range)
Thing
cidoc:E39Actor
has subclasscidoc:E15IdentifierAssignment
has subclass
huni:PrimaryTopic
has subclass
cidoc:E35Title
has subclass
cidoc:E71Man-MadeThing
has subclass
has subclass
cidoc:E53Place
has subclass
has subclass
huni:SKOS.Occupation
has subclass
has subclass
foaf:Group
has subclass
huni:SKOS.Role
has subclass
frbr:F6Concept
has subclass
frbr:F11Corporate_Body
has subclass
huni:SKOS.Collection
has subclass
cidoc:E42Identifier
has subclass
has subclass
frbr:F8Event
has subclass
huni:SKOS.Item
has subclass
has subclass
cidoc:E56Language
has subclass
has subclass
frbr:F13Identifier
has subclass
has subclass
cidoc:E55Type
has subclass
has subclassfrbr:F40Identifier_Assignment
has subclass
cidoc:P2hasType (Domain>Range)
cidoc:P11iparticipatedIn (Domain>Range)
has subclass
cidoc:P2HasType (Domain>Range)
has subclass has subclass
has subclass
has subclass
cidoc:E65Creation
has subclass
frbr:F31Performance
has subclasshas subclass
cidoc:E12Production
has subclass
has subclass
huni:timeIsIdentifiedBy (Domain>Range)
cidoc:E5Event
has subclass
has subclass
cidoc:P1isIdentifiedBy (Domain>Range)cidoc:P1isIdentifiedBy (Domain>Range)
cidoc:P7tookPlaceAt (Domain>Range)cidoc:P1isIdentifiedBy (Domain>Range)
huni:hasOccupation (Domain>Range) huni:hasRole (Domain>Range)
cidoc:E48PlaceName
has subclass
frbr:F30Publication_Event
frbr:R24created (Domain>Range)frbr:F21Recording_Work
frbr:R23createdARealisationOf (Domain>Range)
frbr:F19Publication_Work
frbr:R24created (Domain>Range)
has subclass
cidoc:P1isIdentifiedBy (Domain>Range)cidoc:P1isIdentifiedBy (Domain>Range)
huni:placeIsIdentifiedBy (Domain>Range)
frbr:F28Expression_Creation
has subclass cidoc:P108hasProduced (Domain>Range)
has subclassfrbr:F1Work
frbr:F2Expression
frbr:F21Recording_Event
has subclass
cidoc:E28ConceptualObject
has subclass
has subclass
has subclass
cidoc:E89PropositionalObject
has subclass
frbr:F14Individual_Work
frbr:F17Aggregation_Work
has subclass
cidoc:P94hasCreated (Domain>Range)
frbr:f25Work_Conception
has subclass
cidoc:P102hasTitle (Domain>Range) huni:hasCollection (Domain>Range)
cidoc:P2hasType (Domain>Range)
has subclass
cidoc:P148hasComponent (Domain>Range)
cidoc:E73InformationObject
has subclass huni:hasItem (Domain>Range)
cidoc:P2HasType (Domain>Range) frbr:f16Container_Work
has subclass
has subclass
has subclass has subclass
frbr:F20Performance_Work
has subclasshas subclass has subclasscidoc:P72hasLanguage (Domain>Range) has subclass
cidoc:P2hasType (Domain>Range)cidoc:P2HasType (Domain>Range)
has subclass
has subclass
cidoc:P102hasTitle (Domain>Range)
frbr:R12isRealisedIn (Domain>Range)
has subclass
has subclass
frbr:R16initiated (Domain>Range)
cidoc:P14iperformed (Domain>Range) has subclass

3. HuNI DATA ARCHITECTURE
Data
integration
HuNI
side
Partner
side
Data harvest,
transform
and ingest
Solr Search Server
[HuNI Data]
RDF Triple Store
[HuNI Linked Data]
Data
analysis
and
mapping
HuNI Virtual Laboratory
Scholarly researcher workflow tasks Admin tasksPublic and citizen
researcher workflow tasks
Data
discovery
Data
analysis
Data
sharing
Analyse and annotate
collection
Export collection
Share collection and
analysis
Share search results
Corbicula
Registration and login
Profile management
History recording
Project management
Simple search
Advanced search
Save search results as
private collection
Refine / expand
collection
Simple search
Advanced search
Deep (SPARQL-based)
search
Data update
and
publish ADB DAAO CAARP AFIRC AusStage

A total of 28 Australian datasets are being harvested for integration into
HuNI
• Data gateway components, called HuNI Corbicula, deployed on the
NeCTAR Cloud to harvest the XML feed data and transforming it into
forms suitable for ingestion into two HuNI data aggregates: a Solr
search server [HuNI Data], and a Jena RDF Triple Store [HuNI Linked
Data]
DATA INTEGRATION
The harvesting process
requires:
• Live data feeds
deployed at the partner
sites to publish
updated partner data
as XML
Data
integration
HuNI
side
Partner
side
Data harvest,
transform
and ingest
Solr Search Server
[HuNI Data]
RDF Triple Store
[HuNI Linked Data]
Data
analysis
and
mapping
Corbicula
Data update
and
publish ADB DAAO CAARP AFIRC AusStage

TWO HUNI DATA AGGREGATES?
Solr aggregate RDF aggregate
28
0
7
14
21
24
0
7
14
21
6
partnerdataset
partnerdataset

TECHNOLOGY STACK
• front-end frameworks - AngularJS and Twitter
Bootstrap single page web app
• tools hosting framework - Open Social via Apache
Shindig
• back-end framework - SpringMVC via Roo.
• layer integration - RESTful web services

• Search the HuNI Data
• Save their search results as a
private collection
• Refine their collection through
additional searches
• Analyse and annotate their
collection with their own
assertions and commentary
• Export their collection for
further analysis
• Publish and share their
collection and research
RESEARCH ACTIVITIES
A researcher with a HuNI account will be able to:
Data
discovery
Data
analysis
Data
sharing
Analyse and annotate
collection
Export collection
Share collection and
analysis
Share search results
Profile management
History recording
Project management
Simple search
Advanced search
Save search results as
private collection
Refine / expand
collection
Simple search
Advanced search
Solr Search Server
[HuNI Data]

Scholarly researchers will also
be able to perform a “deep
search” of the graphs in RDF
Triple Store.
The large-scale aggregation of
Linked Data makes explicit the
relationships and connections
between related records across
all the partner
datasets, enabling the
researcher to construct more
complex semantic queries.
RESEARCH ACTIVITIES 2
Data
discovery
Data
analysis
Data
sharing
Profile management
History recording
Project management
Deep (SPARQL-based)
search
RDF Triple Store
[HuNI Linked Data]

VIRTUAL LABORATORY RESEARCHER
WORKFLOW: Discovery (part 1)

WORKFLOW: Analysis (part 1)

WORKFLOW – Analysis (part 2)

WORKFLOW: Sharing

4. THE PROJECT
• project director/community liaison (20%)
• project manager (100%)
• technical coordinator (100%)
• information services coordinator (90%)
• community engagement (30%)
• communication coordinator (20%)
• administrative support (20%)
• software developer(s)
NeCTAR
Directorate
HuNI
Steering
Committee
Team HuNI
Technical
Working
Group
Expert
Advisory
Group
Expert Data
Group

PROJECT WIKI: apidictor.huni.net.au

HuNI: a virtual laboratory for the humanities
http://huni.net.au/@HuNIVL

Humanities Networked Infrastructure (HuNI)

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (7)

Similar to Humanities Networked Infrastructure (HuNI)

Similar to Humanities Networked Infrastructure (HuNI) (20)

More from Deb Verhoeven

More from Deb Verhoeven (10)

Recently uploaded

Recently uploaded (20)

Humanities Networked Infrastructure (HuNI)

Editor's Notes