Mapping your data can help to provide new insights on your research findings. However, many scholars are put off by the steep learning curve demanded by Geographic Information Systems (GIS) such as ArcGIS from ESRI. New and simple tools have become available that offer sophisticated output without extensive training. In fact, tools such as Google Maps, Google Earth, Open Street Map among others can offer immediate returns in a matter of hours where tasks in the past required, weeks, months and even years of training.
This document discusses using smartphones to improve landscape archaeology studies. It presents an app and QR codes created for the Stonehenge landscape to test how smartphones can aid interpretation. The app provides location-based information on monuments through pins and clustering. QR codes link to web pages with monument descriptions. Field testing found the app provided an adequate platform for understanding the landscape, while QR codes allowed two-way discussion. Further work is needed to fully realize smartphones' potential to facilitate different theories of spatial thinking in archaeology.
PostGIS is a spatial extension for PostgreSQL that aims to make it compliant with the OpenGIS Simple Features for SQL standard. It adds spatial data types and functions to PostgreSQL, allowing spatial objects to be stored and manipulated like other database objects. PostGIS indexes spatial data using GiST to enable fast spatial queries in SQL. The open source PostGIS software helps enable more open and affordable spatial databases and GIS applications.
The document summarizes the Chalice project, which aims to create a historical gazetteer from volumes of the English Place-Name Society (EPNS). It involves extracting place names from EPNS texts, disambiguating them using a gazetteer, and linking the information as structured data. Challenges include dealing with optical character recognition errors and integrating the data with other historical sources and datasets.
This project shows methods of mapping surname data from 1940 in Nova Scotia, based on voter lists. The names were mapped in ArcMap to create a hard copy map and ArcGIS Online application, while a CSV database was loaded onto CartoDB to use with a Leaflet template, allowing users to query surnames and see their distribution. The idea was inspired by an interactive map of Irish surname data. The data for this project was transcribed from Ancestry.ca.
Chalice / Edinburgh Geoparser at GISRUK with extra slidesJo Walsh
The document summarizes the Chalice project, which aims to create a historical gazetteer from volumes of the English Place-Name Society (EPNS). It discusses the Edinburgh Geoparser system used to extract place names and disambiguate them. It also provides an overview of how the extracted place name data is represented in Turtle format as linked data, with URIs for each place and links to information about name attestations and sources. Issues around integrating this data with other historical sources and datasets are also mentioned.
Constructing Semantic Gazetteers: Managing GeoSpatial Vocabularies Using Open...Stephane Fellah
The document discusses constructing a semantic gazetteer to manage geospatial vocabularies using open semantic web standards. It describes encoding the Aquatic Sciences and Fisheries Abstracts (ASFA) controlled vocabulary terms semantically using SKOS, geocoding the terms, and enabling spatial search of the ASFA database integrated with topic and author search. The goal is to demonstrate the value of linked data and semantic approaches by enabling geospatial reasoning and reusability of the linked data.
Mapping your data can help to provide new insights on your research findings. However, many scholars are put off by the steep learning curve demanded by Geographic Information Systems (GIS) such as ArcGIS from ESRI. New and simple tools have become available that offer sophisticated output without extensive training. In fact, tools such as Google Maps, Google Earth, Open Street Map among others can offer immediate returns in a matter of hours where tasks in the past required, weeks, months and even years of training.
This document discusses using smartphones to improve landscape archaeology studies. It presents an app and QR codes created for the Stonehenge landscape to test how smartphones can aid interpretation. The app provides location-based information on monuments through pins and clustering. QR codes link to web pages with monument descriptions. Field testing found the app provided an adequate platform for understanding the landscape, while QR codes allowed two-way discussion. Further work is needed to fully realize smartphones' potential to facilitate different theories of spatial thinking in archaeology.
PostGIS is a spatial extension for PostgreSQL that aims to make it compliant with the OpenGIS Simple Features for SQL standard. It adds spatial data types and functions to PostgreSQL, allowing spatial objects to be stored and manipulated like other database objects. PostGIS indexes spatial data using GiST to enable fast spatial queries in SQL. The open source PostGIS software helps enable more open and affordable spatial databases and GIS applications.
The document summarizes the Chalice project, which aims to create a historical gazetteer from volumes of the English Place-Name Society (EPNS). It involves extracting place names from EPNS texts, disambiguating them using a gazetteer, and linking the information as structured data. Challenges include dealing with optical character recognition errors and integrating the data with other historical sources and datasets.
This project shows methods of mapping surname data from 1940 in Nova Scotia, based on voter lists. The names were mapped in ArcMap to create a hard copy map and ArcGIS Online application, while a CSV database was loaded onto CartoDB to use with a Leaflet template, allowing users to query surnames and see their distribution. The idea was inspired by an interactive map of Irish surname data. The data for this project was transcribed from Ancestry.ca.
Chalice / Edinburgh Geoparser at GISRUK with extra slidesJo Walsh
The document summarizes the Chalice project, which aims to create a historical gazetteer from volumes of the English Place-Name Society (EPNS). It discusses the Edinburgh Geoparser system used to extract place names and disambiguate them. It also provides an overview of how the extracted place name data is represented in Turtle format as linked data, with URIs for each place and links to information about name attestations and sources. Issues around integrating this data with other historical sources and datasets are also mentioned.
Constructing Semantic Gazetteers: Managing GeoSpatial Vocabularies Using Open...Stephane Fellah
The document discusses constructing a semantic gazetteer to manage geospatial vocabularies using open semantic web standards. It describes encoding the Aquatic Sciences and Fisheries Abstracts (ASFA) controlled vocabulary terms semantically using SKOS, geocoding the terms, and enabling spatial search of the ASFA database integrated with topic and author search. The goal is to demonstrate the value of linked data and semantic approaches by enabling geospatial reasoning and reusability of the linked data.
LoCloud Historic Place Names Service, Rimvydas Laužikas, Justinas Jaronis and...locloud
An introduction to the LoCloud Historic Place Names Service presented at the LoCloud training workshops. The presentation provides an introduction to historic place names, which were used in earlier times and are mentioned in historical documents and sources. It introduces the service, an application which enables local cultural institutions to collaborate in developing the Thesaurus of Historic Place Names and enriching metadata.
The document summarizes the Chalice project, which aims to create a historical gazetteer from volumes of the English Place-Name Society. It discusses extracting place names from text, linking them to geographic data, and integrating the information with other historical sources. Key aspects include parsing text to recognize place names, disambiguating names against a gazetteer, representing the information as linked data, and addressing challenges around optical character recognition and structural variation in source materials.
First, an interdisciplinary research approach of the Digital Humanities will be presented with a research project at the Bibliotheca Hertziana (MPI for Art History), Rome serving as an example. In the Institute's research, questions about the historical understanding of social space and its change during the so-called Long Middle Ages play a central role. The investigation of the relationship between historical maps and texts is intended to explore the historical understanding of space and the knowledge associated with it by taking up approaches from cognitive linguistics. Cognitive maps depict culturally specific spatial knowledge and practices. Annotation and analysis of historical texts and maps pose special challenges for knowledge representation and processing. The research data will be published as Linked Open Data in the Semantic Web. Subsequently, problems and challenges for machine learning in this area will be discussed.
The document discusses what geospatial data is, noting that practically all data has some spatial attribute or context since everything happens somewhere. It explains that geospatial data can come from existing datasets or be created by geocoding or geo-tagging data to assign it spatial coordinates or locations. The document also outlines different ways geospatial data can be mapped, analyzed, shared, and referenced for use in geographic information systems.
Real-time Visualisation of Cultural Heritage and Environmental Archaeology Da...Marcus Smith
This document discusses a hack day event called "ArkHack" where participants worked to link archaeological, cultural heritage, and environmental data. It describes efforts to map environmental data from the SEAD database to linked open data formats. The document then discusses potential uses of linked data from multiple sources to reconstruct past landscapes and environments over time through techniques like paleoenvironmental modeling. Challenges and prerequisites for more fully realizing this vision are also outlined.
This document provides an overview of mapping with Drupal 7. It discusses web mapping libraries like OpenLayers and commercial alternatives. It also outlines Drupal field types, modules and libraries that can be used for geo-enabled content as well as recommendations for storing, inputting, and displaying map data with modules like Geofield, Address Field, and OpenLayers. Examples of implemented maps using these techniques are also provided.
Archaeology, Informatics and Knowledge RepresentationDART Project
This document discusses using logic programming and ontologies to model stratigraphic relationships in archaeology. It presents an example stratigraphic sequence and shows how it can be represented and reasoned about using Prolog rules and predicates. Different states of the stratigraphic model are output as the data and rules are updated, demonstrating how logical reasoning can infer additional relationships and handle inconsistencies in the archaeological record. Ontologies like CIDOC-CRM are discussed as a way to formally represent archaeological concepts and relationships to support modeling landscape stratigraphy.
Efficient Query Processing in Geographic Web Search EnginesYen-Yu Chen
Geographic web search engines allow users to constrain and order search results in an intuitive manner by focusing a query on a particular geographic region. Geographic search technology, also called local search, has recently received significant interest from major search engine companies. Academic research in this area has focused primarily on techniques for extracting geographic knowledge from the web. In this paper, we study the problem of efficient query processing in scalable geographic search engines. Query processing is a major bottleneck in standard web search engines, and the main reason for the thousands of machines used by the major engines. Geographic search engine query processing is different in that it requires a combination of text and spatial data processing techniques. We propose several algorithms for efficient query processing in geographic search engines, integrate them into an existing web search query processor, and evaluate them on large sets of real data and query traces.
The Expert Library: Emergent needs in academic and special librariesLAICDG
Presentation by John B. Howard, head librarian at University College Dublin, for the Information Skills for the Future event, organised by the Career Development Group of the Library Association of Ireland on April 2nd 2015
This document summarizes a workshop on programming with data. It discusses preparing data by structuring it into categories and relationships. Examples are given of literary mapping projects that encode spatial and prosodic data from texts. Programming tasks for these projects are outlined, such as counting feet in poems and identifying deviation patterns. The document emphasizes thinking through one's data and goals before choosing tools or languages in order to focus learning.
CAA 2014 - To Boldly or Bravely Go? Experiences of using Semantic Technologie...Keith.May
This paper is based upon practical experiences of Conceptual modelling, using CIDOC CRM, of the single context recording system at English Heritage and mapping it to other 'single context' based systems. It also presents recent work on identifying conceptual commonalities that may exist in different archaeological recording methodologies, whether 'single context recording' or otherwise, along with practical challenges based on experiences of trying to integrate, or simply search across, data from different archaeological recording systems. In addition it introduces the work to date on developing http://www.heritagedata.org/ and suggests opportunities for sharing and aligning further archaeological vocabularies using SKOS and Linked Open Data technologies.
- The document describes a project to fill gaps in knowledge about diamond mining, trading, and polishing in Borneo by developing a workflow using various CLARIAH tools and resources.
- The workflow involved digitizing a diamond encyclopedia, extracting concepts and place names, linking the data to external sources to create linked open data, and querying newspaper archives to build a corpus of relevant articles.
- Promising results showed mining, trading, and polishing continued in Borneo for Southeast Asian customers, and described previously unknown diamond fields and polishing locations in Borneo. The project aims to apply the workflow to other commodities like sugar.
The document discusses establishing geospatial analysis capabilities within an exploration environment. It outlines collecting all relevant exploration data, representing it properly in a GIS database, and generating continuous geospatial grids using deterministic and geostatistical methods. These grids would support spatial analysis and decision making by allowing visualization and queries of parameters like probability of success. Maintaining data quality and understanding relationships between different data types is important for effective analysis.
This document discusses opportunities for using spatial data and analysis. Key points include:
- Increased user sophistication and availability of location-based data and tools enable new integrated systems and applications.
- Spatial data and analysis can be used for real estate development, environmental planning, scientific research, economic analysis and more.
- SQL Server provides robust spatial data types and methods to build high performance spatial solutions and integrate geospatial applications.
How to Download & Install Module From the Odoo App Store in Odoo 17Celine George
Custom modules offer the flexibility to extend Odoo's capabilities, address unique requirements, and optimize workflows to align seamlessly with your organization's processes. By leveraging custom modules, businesses can unlock greater efficiency, productivity, and innovation, empowering them to stay competitive in today's dynamic market landscape. In this tutorial, we'll guide you step by step on how to easily download and install modules from the Odoo App Store.
How to Setup Default Value for a Field in Odoo 17Celine George
In Odoo, we can set a default value for a field during the creation of a record for a model. We have many methods in odoo for setting a default value to the field.
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...TechSoup
Whether you're new to SEO or looking to refine your existing strategies, this webinar will provide you with actionable insights and practical tips to elevate your nonprofit's online presence.
LoCloud Historic Place Names Service, Rimvydas Laužikas, Justinas Jaronis and...locloud
An introduction to the LoCloud Historic Place Names Service presented at the LoCloud training workshops. The presentation provides an introduction to historic place names, which were used in earlier times and are mentioned in historical documents and sources. It introduces the service, an application which enables local cultural institutions to collaborate in developing the Thesaurus of Historic Place Names and enriching metadata.
The document summarizes the Chalice project, which aims to create a historical gazetteer from volumes of the English Place-Name Society. It discusses extracting place names from text, linking them to geographic data, and integrating the information with other historical sources. Key aspects include parsing text to recognize place names, disambiguating names against a gazetteer, representing the information as linked data, and addressing challenges around optical character recognition and structural variation in source materials.
First, an interdisciplinary research approach of the Digital Humanities will be presented with a research project at the Bibliotheca Hertziana (MPI for Art History), Rome serving as an example. In the Institute's research, questions about the historical understanding of social space and its change during the so-called Long Middle Ages play a central role. The investigation of the relationship between historical maps and texts is intended to explore the historical understanding of space and the knowledge associated with it by taking up approaches from cognitive linguistics. Cognitive maps depict culturally specific spatial knowledge and practices. Annotation and analysis of historical texts and maps pose special challenges for knowledge representation and processing. The research data will be published as Linked Open Data in the Semantic Web. Subsequently, problems and challenges for machine learning in this area will be discussed.
The document discusses what geospatial data is, noting that practically all data has some spatial attribute or context since everything happens somewhere. It explains that geospatial data can come from existing datasets or be created by geocoding or geo-tagging data to assign it spatial coordinates or locations. The document also outlines different ways geospatial data can be mapped, analyzed, shared, and referenced for use in geographic information systems.
Real-time Visualisation of Cultural Heritage and Environmental Archaeology Da...Marcus Smith
This document discusses a hack day event called "ArkHack" where participants worked to link archaeological, cultural heritage, and environmental data. It describes efforts to map environmental data from the SEAD database to linked open data formats. The document then discusses potential uses of linked data from multiple sources to reconstruct past landscapes and environments over time through techniques like paleoenvironmental modeling. Challenges and prerequisites for more fully realizing this vision are also outlined.
This document provides an overview of mapping with Drupal 7. It discusses web mapping libraries like OpenLayers and commercial alternatives. It also outlines Drupal field types, modules and libraries that can be used for geo-enabled content as well as recommendations for storing, inputting, and displaying map data with modules like Geofield, Address Field, and OpenLayers. Examples of implemented maps using these techniques are also provided.
Archaeology, Informatics and Knowledge RepresentationDART Project
This document discusses using logic programming and ontologies to model stratigraphic relationships in archaeology. It presents an example stratigraphic sequence and shows how it can be represented and reasoned about using Prolog rules and predicates. Different states of the stratigraphic model are output as the data and rules are updated, demonstrating how logical reasoning can infer additional relationships and handle inconsistencies in the archaeological record. Ontologies like CIDOC-CRM are discussed as a way to formally represent archaeological concepts and relationships to support modeling landscape stratigraphy.
Efficient Query Processing in Geographic Web Search EnginesYen-Yu Chen
Geographic web search engines allow users to constrain and order search results in an intuitive manner by focusing a query on a particular geographic region. Geographic search technology, also called local search, has recently received significant interest from major search engine companies. Academic research in this area has focused primarily on techniques for extracting geographic knowledge from the web. In this paper, we study the problem of efficient query processing in scalable geographic search engines. Query processing is a major bottleneck in standard web search engines, and the main reason for the thousands of machines used by the major engines. Geographic search engine query processing is different in that it requires a combination of text and spatial data processing techniques. We propose several algorithms for efficient query processing in geographic search engines, integrate them into an existing web search query processor, and evaluate them on large sets of real data and query traces.
The Expert Library: Emergent needs in academic and special librariesLAICDG
Presentation by John B. Howard, head librarian at University College Dublin, for the Information Skills for the Future event, organised by the Career Development Group of the Library Association of Ireland on April 2nd 2015
This document summarizes a workshop on programming with data. It discusses preparing data by structuring it into categories and relationships. Examples are given of literary mapping projects that encode spatial and prosodic data from texts. Programming tasks for these projects are outlined, such as counting feet in poems and identifying deviation patterns. The document emphasizes thinking through one's data and goals before choosing tools or languages in order to focus learning.
CAA 2014 - To Boldly or Bravely Go? Experiences of using Semantic Technologie...Keith.May
This paper is based upon practical experiences of Conceptual modelling, using CIDOC CRM, of the single context recording system at English Heritage and mapping it to other 'single context' based systems. It also presents recent work on identifying conceptual commonalities that may exist in different archaeological recording methodologies, whether 'single context recording' or otherwise, along with practical challenges based on experiences of trying to integrate, or simply search across, data from different archaeological recording systems. In addition it introduces the work to date on developing http://www.heritagedata.org/ and suggests opportunities for sharing and aligning further archaeological vocabularies using SKOS and Linked Open Data technologies.
- The document describes a project to fill gaps in knowledge about diamond mining, trading, and polishing in Borneo by developing a workflow using various CLARIAH tools and resources.
- The workflow involved digitizing a diamond encyclopedia, extracting concepts and place names, linking the data to external sources to create linked open data, and querying newspaper archives to build a corpus of relevant articles.
- Promising results showed mining, trading, and polishing continued in Borneo for Southeast Asian customers, and described previously unknown diamond fields and polishing locations in Borneo. The project aims to apply the workflow to other commodities like sugar.
The document discusses establishing geospatial analysis capabilities within an exploration environment. It outlines collecting all relevant exploration data, representing it properly in a GIS database, and generating continuous geospatial grids using deterministic and geostatistical methods. These grids would support spatial analysis and decision making by allowing visualization and queries of parameters like probability of success. Maintaining data quality and understanding relationships between different data types is important for effective analysis.
This document discusses opportunities for using spatial data and analysis. Key points include:
- Increased user sophistication and availability of location-based data and tools enable new integrated systems and applications.
- Spatial data and analysis can be used for real estate development, environmental planning, scientific research, economic analysis and more.
- SQL Server provides robust spatial data types and methods to build high performance spatial solutions and integrate geospatial applications.
How to Download & Install Module From the Odoo App Store in Odoo 17Celine George
Custom modules offer the flexibility to extend Odoo's capabilities, address unique requirements, and optimize workflows to align seamlessly with your organization's processes. By leveraging custom modules, businesses can unlock greater efficiency, productivity, and innovation, empowering them to stay competitive in today's dynamic market landscape. In this tutorial, we'll guide you step by step on how to easily download and install modules from the Odoo App Store.
How to Setup Default Value for a Field in Odoo 17Celine George
In Odoo, we can set a default value for a field during the creation of a record for a model. We have many methods in odoo for setting a default value to the field.
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...TechSoup
Whether you're new to SEO or looking to refine your existing strategies, this webinar will provide you with actionable insights and practical tips to elevate your nonprofit's online presence.
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...indexPub
The recent surge in pro-Palestine student activism has prompted significant responses from universities, ranging from negotiations and divestment commitments to increased transparency about investments in companies supporting the war on Gaza. This activism has led to the cessation of student encampments but also highlighted the substantial sacrifices made by students, including academic disruptions and personal risks. The primary drivers of these protests are poor university administration, lack of transparency, and inadequate communication between officials and students. This study examines the profound emotional, psychological, and professional impacts on students engaged in pro-Palestine protests, focusing on Generation Z's (Gen-Z) activism dynamics. This paper explores the significant sacrifices made by these students and even the professors supporting the pro-Palestine movement, with a focus on recent global movements. Through an in-depth analysis of printed and electronic media, the study examines the impacts of these sacrifices on the academic and personal lives of those involved. The paper highlights examples from various universities, demonstrating student activism's long-term and short-term effects, including disciplinary actions, social backlash, and career implications. The researchers also explore the broader implications of student sacrifices. The findings reveal that these sacrifices are driven by a profound commitment to justice and human rights, and are influenced by the increasing availability of information, peer interactions, and personal convictions. The study also discusses the broader implications of this activism, comparing it to historical precedents and assessing its potential to influence policy and public opinion. The emotional and psychological toll on student activists is significant, but their sense of purpose and community support mitigates some of these challenges. However, the researchers call for acknowledging the broader Impact of these sacrifices on the future global movement of FreePalestine.
How to Manage Reception Report in Odoo 17Celine George
A business may deal with both sales and purchases occasionally. They buy things from vendors and then sell them to their customers. Such dealings can be confusing at times. Because multiple clients may inquire about the same product at the same time, after purchasing those products, customers must be assigned to them. Odoo has a tool called Reception Report that can be used to complete this assignment. By enabling this, a reception report comes automatically after confirming a receipt, from which we can assign products to orders.
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.pptHenry Hollis
The History of NZ 1870-1900.
Making of a Nation.
From the NZ Wars to Liberals,
Richard Seddon, George Grey,
Social Laboratory, New Zealand,
Confiscations, Kotahitanga, Kingitanga, Parliament, Suffrage, Repudiation, Economic Change, Agriculture, Gold Mining, Timber, Flax, Sheep, Dairying,
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
Edin pelagios
1. Institute for Language,
Cognition and Computation
The Edinburgh Geoparser
and Chalice
Claire Grover
Kate Byrne, Richard Tobin, Jo Walsh
www.inf.ed.ac.uk
2. Institute for Language,
Cognition and Computation
Overview of the Edinburgh Geoparser
• System to automatically recognise place names in text and
disambiguate them with respect to a gazetteer. (Athens, Springfield)
• Patchy development over past few years funded by a variety of
projects applied to a range of data sets:
– GeoCrossWalk
– BOPCRIS
– GeoDigRef (Histpop, BOPCRIS, BL)
– Embedding GeoCrossWalk (Stormont Papers)
– SYNC3 (online news)
– Chalice (EPNS)
– Unlock
• Main concern has been to keep it generally usable while applying it to
specific data sets.
3. Institute for Language,
Cognition and Computation
Overview of the Edinburgh Geoparser
Geotagging
.txt
.html
Format
Tokenisation
POS
Lemmatis-
Named
Entity
.geotagged.xml
.xml
conversion
tagging
ation
Recognition
.geotagged.xml
Gazetteer
lookup
Resolution
.gaz.xml
Georesolution
8. Institute for Language,
Cognition and Computation
Evaluation (2009)
SpatialML (gold geotagging) GeoNames Unlock
No. of place names 3628 3628
No. for which gaz entries found 3538 3049
Correct within 5km 2946 2143
As % of total 81.2% 59.0%
SpatialML (end-to-end) GeoNames
No. of place names 3628
No. for which gaz entries found 2923
Correct within 5km 2504
As % of total 69.0%
9. Institute for Language,
Cognition and Computation
Current Development Issues
• Open source release
• Increased configurability
– Input formats: plain text, HTML, simple XML, ...
– User’s own text analysis: paragraphs, sentences, word tokens,
place name mark-up
– Output formats: map visualisation, text mark-up, …
– User input: constrain by area, bounding box, …
• Choice of gazetteer: GeoNames, Unlock, geonames-local, Pleiades+,
Chalice historical gazetteer, ...
• Performance monitoring/evaluation against test sets
10. Institute for Language,
Cognition and Computation
GAP project: Pleiades+
• Based on Pleiades set of ancient place names but extended in two ways:
• by matching Pleiades place names against GeoNames place names in the
same location and adding the GeoNames alternative names to the Pleiades+
list:
– adds three alternative names for the single Pleiades entry for
Autricum (Chartrez, Chartres, Shartr), because Autricum” is present
in both Pleiades and GeoNames, with the same approximate location
• at run-time, looking up place names found in the text against GeoNames (as
well as against Pleiades+) and the using the alternative names from GeoNames
to match against the Pleiades+ list
– Pleiades has no entry for Egypt”. We look up the name in GeoNames and
use its alternative names (which include Aegyptus) to match back against
Pleiades (which does include Aegyptus). (We don't want to simply take
places directly from GeoNames because, when we tried it, we were
swamped with irrelevant modern places having names corresponding to
ancient toponyms.)
11. Institute for Language,
Cognition and Computation
Chalice
• Connecting Historical Authorities with Linked Data, Contexts, and Entities.
• Funded under the JISC jiscEXPO programme on exposing digital content
for education and research.
• The project is exploring the viability of creating a historical gazetteer from
digitized volumes from the English Place-Name Society (EPNS).
• Partners:
– CDDA, Queen’s University, Belfast
– School of Informatics, Edinburgh
– EDINA, Edinburgh
– CeRch, Kings College London
• Informatics role is to adapt our existing text mining/geoparsing technology
to convert the textual documents that are output from OCR into structured
data.
12. Institute for Language,
Cognition and Computation
Chalice data
• Cheshire
– Cheshire Part I. EPNS Volume 44, 1970
– Cheshire Part II. EPNS Volume 45, 1970
– Cheshire Part III. EPNS Volume 46, 1971
– Cheshire Part IV. EPNS Volume 47, 1972
– Cheshire Part V (1 :i). EPNS Volume 48, 1981
– Cheshire Part V (1 :ii). EPNS Volume 54, 1981
• Small samples from:
– Berkshire, Buckinghamshire (Vol. 2), Cambridgeshire (Vol 19),
Derbyshire (Vols 27-29), Hertfordshire (Vol. 15)
• Shropshire: Pimhill Hundred (born digital)
13. Institute for Language,
Cognition and Computation
EPNS
• Parishes are usually organised in terms of the hundreds in which they belong.
• Towns and villages are usually referred to as townships and are organised in
terms of the parish in which they belong.
• Township descriptions often contain relatively unstructured information about
smaller associated places such as buildings, bridges, lanes, woods and
farms.
• Township descriptions also frequently contain separately marked sections of
information about field names and street names.
• Information about river and major road names are described separately from
the inhabited place descriptions.
• Place names are the primary object of interest and descriptions of them
contain information about alternative names and spellings that have been
attested in historical sources and the etymology of names or name parts.
• In Chalice we focus on capturing parishes, townships, sub-townships,
attestation. We don’t deal with hundreds, field names, street names, rivers,
roads etc.
22. Institute for Language,
Cognition and Computation
Issues
• OCR quality needs to be high: not just recognising characters correctly but
getting font and layout information right. Failure to recognise bold and small
caps fonts or the difference between a line break and a paragraph break can
lead to major errors in the recognition process.
• EPNS volumes vary in the use of layout and font to indicate structure (e.g.
Cheshire parishes are signaled by centering combined with numbering with
roman numerals while Hertfordshire ones are unnumbered but centered and in
bold font.) In some volumes potentially useful information is contained in
footnotes.
• Different volumes reflect different decisions about where place name information
should be put. In most cases the information about the parish name occurs next
to the town in the parish that has the same name. In the Shropshire text some
place name information occurs in an earlier volume and is not subsequently
repeated, e.g. the description of the parish of Baschurch, containing a township
of the same name, has no attestation or etymological information provided
because the name was discussed in Part 1.