Laurel Stvan, Associate Professor of Linguistics, UT Arlington, presentation for “Using Digital Humanities Research Tools in the Classroom” at UT Dallas 2/27/13
LIWC-ing at Texts for Insights from Linguistic PatternsShalin Hai-Jew
Since the mid-1990s, researchers have been using the Linguistic Inquiry and Word Count (LIWC pronounced “luke”) software tool to explore various text corpora for hidden insights from linguistic patterns. The LIWC tool has evolved over the years. Simultaneously, research using computational text analysis has evolved and shed light on areas of deception, threat assessment, personality, predictive analytics, and other areas. This presentation will highlight some of the applications of LIWC in the research literature and showcase the tool on some original text sets.
Supporting User's Exploration of Digital Libraries, Suedl 2012 workshop proce...pathsproject
Workshop proceedings from the International workshop on Supporting Users Exploration of Digital Libraries, SUEDL 2012 which was held at TPDL 2012 (the international conference on Theory and Practice in Digital Libraries), Paphos, Cyprus, September 2012.
The aim of the workshop was to stimulate collaboration from experts and stakeholders in Digital Libraries, Cultural Heritage, Natural Language Processing and Information Retrieval in order to explore methods and strategies to support exploration of Digital Libraries, beyond the white box paradigm of search and click.
The proceedings includes:
"Browsing Europeana - Opportunities and Challenges', David Haskiya
"Query re-writing using shallow language processing effects', Anna Mastora and Sarantos Kapidakis
"Visualising Television Heritage" Johan Ooman et al,
"Providing suitable information access for new users of Digital Libraries", Rike Brecht et al
"Exploring Pelagios: a Visual Browser for Geo-tagged datasets" Rainer Simon et al
Presentation given to 3rd Year Undergraduate English Students on Narrative Analysis module at Brimingham City University. Followed by demonstration of eMargin text annotation software: http://emargin.bcu.ac.uk/
Capitalizing on Machine Reading to Engage Bigger DataShalin Hai-Jew
What are some ways to select, say, 200 research articles to “close read” from a set of 2,000 PDF articles gleaned from library databases and Google Scholar? How can a researcher make sense of a trending issue in the flood of Tweets and RT based on a particular hashtag (#) or keyword search or an especially lively Tweetstream based on a particular social media account? People are dealing with ever more prodigious amounts of information—from a number of sources. Those who are savvy to the uses of computers to aid their reading (through “distant reading” or “not-reading”) may find that they are able to cover much more ground. This presentation introduces the use of NVivo 11 Plus (matrix queries, word frequency counts, text searches and dendrograms, cluster analyses, topic modeling, and others) for multiple cases of distant reading to aid in academic and research work.
The document provides an overview of the research areas and projects within the Language and Intelligence School of Computer Applications. It summarizes work on natural language processing, machine translation, speech recognition, artificial intelligence, language evolution, and neural network modeling. It describes several specific projects including example-based machine translation, speaker characterization, computational semantics, and a decentralized artificial intelligence system called World Wide Mind.
The document discusses social media in Japan, including that blogs have 17 million users with 3 million active, Mixi is the largest social networking site with 15 million users, Wikipedia is one of the top five languages, and NicoNicoDouga has 10 million video sharing users. Trends in social media content include daily life, culture, communication, and politics. Participants are mostly anonymous with some celebrities and famous bloggers. Future directions may include more integration with mobile phones, lifelogging, agents, and collaboration to shape bottom-up culture. The document also shows a social network graph of creators for Hatsune Miku related videos.
Finding the annotation needs of the botanical community in a digital libraryWilliam Ulate
The Center for Biodiversity Informatics at the Missouri Botanical Garden and Saint Louis University are analyzing the web annotation needs of the botanical community to develop a prototype of how those needs may be met within a digital library platform. We want to assess the practicality of existing tools to satisfy the technical, economic, and operational needs of botanical users to annotate. This will inform on requisites, best practices, and further developments for a research project to integrate an annotation tool within a virtual library. We surveyed 14 members of 10 different institutions in the botanical and scientific communities. We included both, those who currently annotate online as well as those who have only annotated offline (e.g. print or analog), in order to better understand the functionality needed to encourage and support online annotation activities. The answers to this survey were analyzed in the context of an annotation tool in a digital library and a prioritized list of annotation needs for users of a botanical virtual library was produced, taking into account the minimal and recommended functionality required to comply with the users requirements. Preliminary results from the report of the in-depth user assessments of annotation needs in the specific domain of botanists are shared with the attendees. Advances in the definition of a prototype are also shown.
A non-technical introduction to text mining for information specialists Tom De Schryver
This document provides a non-technical introduction to text mining. It discusses how text mining can both complement and threaten information specialists by requiring new skills, but also providing opportunities for collaboration. The document then provides examples of descriptive and predictive text analysis, including topic extraction, network analysis of document structures, and using text mining to more quickly find relevant literature for reviews. It discusses parsing, filtering, and enriching text data, as well as challenges like the curse of dimensionality.
LIWC-ing at Texts for Insights from Linguistic PatternsShalin Hai-Jew
Since the mid-1990s, researchers have been using the Linguistic Inquiry and Word Count (LIWC pronounced “luke”) software tool to explore various text corpora for hidden insights from linguistic patterns. The LIWC tool has evolved over the years. Simultaneously, research using computational text analysis has evolved and shed light on areas of deception, threat assessment, personality, predictive analytics, and other areas. This presentation will highlight some of the applications of LIWC in the research literature and showcase the tool on some original text sets.
Supporting User's Exploration of Digital Libraries, Suedl 2012 workshop proce...pathsproject
Workshop proceedings from the International workshop on Supporting Users Exploration of Digital Libraries, SUEDL 2012 which was held at TPDL 2012 (the international conference on Theory and Practice in Digital Libraries), Paphos, Cyprus, September 2012.
The aim of the workshop was to stimulate collaboration from experts and stakeholders in Digital Libraries, Cultural Heritage, Natural Language Processing and Information Retrieval in order to explore methods and strategies to support exploration of Digital Libraries, beyond the white box paradigm of search and click.
The proceedings includes:
"Browsing Europeana - Opportunities and Challenges', David Haskiya
"Query re-writing using shallow language processing effects', Anna Mastora and Sarantos Kapidakis
"Visualising Television Heritage" Johan Ooman et al,
"Providing suitable information access for new users of Digital Libraries", Rike Brecht et al
"Exploring Pelagios: a Visual Browser for Geo-tagged datasets" Rainer Simon et al
Presentation given to 3rd Year Undergraduate English Students on Narrative Analysis module at Brimingham City University. Followed by demonstration of eMargin text annotation software: http://emargin.bcu.ac.uk/
Capitalizing on Machine Reading to Engage Bigger DataShalin Hai-Jew
What are some ways to select, say, 200 research articles to “close read” from a set of 2,000 PDF articles gleaned from library databases and Google Scholar? How can a researcher make sense of a trending issue in the flood of Tweets and RT based on a particular hashtag (#) or keyword search or an especially lively Tweetstream based on a particular social media account? People are dealing with ever more prodigious amounts of information—from a number of sources. Those who are savvy to the uses of computers to aid their reading (through “distant reading” or “not-reading”) may find that they are able to cover much more ground. This presentation introduces the use of NVivo 11 Plus (matrix queries, word frequency counts, text searches and dendrograms, cluster analyses, topic modeling, and others) for multiple cases of distant reading to aid in academic and research work.
The document provides an overview of the research areas and projects within the Language and Intelligence School of Computer Applications. It summarizes work on natural language processing, machine translation, speech recognition, artificial intelligence, language evolution, and neural network modeling. It describes several specific projects including example-based machine translation, speaker characterization, computational semantics, and a decentralized artificial intelligence system called World Wide Mind.
The document discusses social media in Japan, including that blogs have 17 million users with 3 million active, Mixi is the largest social networking site with 15 million users, Wikipedia is one of the top five languages, and NicoNicoDouga has 10 million video sharing users. Trends in social media content include daily life, culture, communication, and politics. Participants are mostly anonymous with some celebrities and famous bloggers. Future directions may include more integration with mobile phones, lifelogging, agents, and collaboration to shape bottom-up culture. The document also shows a social network graph of creators for Hatsune Miku related videos.
Finding the annotation needs of the botanical community in a digital libraryWilliam Ulate
The Center for Biodiversity Informatics at the Missouri Botanical Garden and Saint Louis University are analyzing the web annotation needs of the botanical community to develop a prototype of how those needs may be met within a digital library platform. We want to assess the practicality of existing tools to satisfy the technical, economic, and operational needs of botanical users to annotate. This will inform on requisites, best practices, and further developments for a research project to integrate an annotation tool within a virtual library. We surveyed 14 members of 10 different institutions in the botanical and scientific communities. We included both, those who currently annotate online as well as those who have only annotated offline (e.g. print or analog), in order to better understand the functionality needed to encourage and support online annotation activities. The answers to this survey were analyzed in the context of an annotation tool in a digital library and a prioritized list of annotation needs for users of a botanical virtual library was produced, taking into account the minimal and recommended functionality required to comply with the users requirements. Preliminary results from the report of the in-depth user assessments of annotation needs in the specific domain of botanists are shared with the attendees. Advances in the definition of a prototype are also shown.
A non-technical introduction to text mining for information specialists Tom De Schryver
This document provides a non-technical introduction to text mining. It discusses how text mining can both complement and threaten information specialists by requiring new skills, but also providing opportunities for collaboration. The document then provides examples of descriptive and predictive text analysis, including topic extraction, network analysis of document structures, and using text mining to more quickly find relevant literature for reviews. It discusses parsing, filtering, and enriching text data, as well as challenges like the curse of dimensionality.
eMargin Presentation given to Skills Funding AgencyRDUES
Presentation on the eMargin collaborative text annotation tool given to the Skills Funding Agency. Also contains description of AHRC Knowledge Transfer Fellowship project, working with A Level English Language students.
This document discusses using the MUSE software to process and deliver personal email archives in special collections. It identifies challenges with email archiving like copyright, privacy, and sensitive information. It provides examples of email collections. MUSE can be used to build search indexes, access controls, and merge emails with existing discovery portals. MUSE allows editing lexicons, grouping emails by topic or person, and generating relationship charts and summaries for web access while maintaining full individual emails for reading rooms. The document notes gaps in MUSE like more advanced search capabilities and original folder views, and proposes future work.
Learning and Text Analysis for Ontology Engineeringbutest
This document calls for papers and participation in a workshop on learning and text analysis for ontology engineering to be held in conjunction with the ECAI 2002 conference in Lyon, France. The workshop aims to bring together researchers from linguistics, natural language processing, knowledge representation, and machine learning to discuss issues around building, maintaining, and reusing ontologies and terminological resources. Topics of interest include using texts and linguistic/terminological resources as knowledge sources for building ontologies, applying machine learning and NLP tools to ontology engineering, and learning ontologies from sources like the web. The deadline for paper submissions is March 15th and for motivation abstracts is May 24th. The workshop will include paper presentations, discussions, and
The document summarizes efforts to support digital humanities research through collaboration at various institutions. It describes projects at Wheaton College involving students encoding a text using TEI XML under faculty supervision. It also discusses initiatives at the University of Vermont and Brown University to provide infrastructure and expertise for digital scholarship through partnerships between libraries, academic technology groups, and faculty researchers.
The document provides information about Samuel Kim's career portfolio, including his educational background receiving a Ph.D from University of Southern California and B.S./M.S from Yonsei University. His research has focused on audio information retrieval and social signal processing with the goal of developing machines that can understand scenes using unstructured audio signals and how humans perform and perceive behaviors in social interactions. The document also summarizes his current project on social signal processing and open challenges in his past work on acoustic topic models.
What is Digital Humanities?
What do we do under DH?
1. Digital Archives
Let us have introduction to a few projects
2. Computational Humanities
a. Using digital technology for analysis of literary text - research concerns
b. Using DT in teaching & learning - pedagogical concerns
c. Generative Literature
3. Multimodal Critique
The fundamentals of Humanities - Critical Inquiry
Specialising the EDM for Digitised Manuscript (SWIB13)Kai Eckert
The DM2E project developed a data model to standardize metadata for digitized manuscripts. It specialized the Europeana Data Model (EDM) by adding over 50 new properties and 23 classes to better represent physical and conceptual aspects of manuscripts. The DM2E model was documented in PDF and OWL formats and made available online for humans and machines. Future work includes addressing uncertain statements about timespans and creators.
From DARPA to Shakespeare: All the Data we Can Handle Kimberly Hoffman
This document discusses the opportunities and challenges of big data for libraries, researchers, and digital humanities. It notes that big data is growing exponentially from sensors, internet data, and scientific instruments. Libraries and librarians have new roles to play in data management, curation, and research data services. Researchers need help with data literacy, data management plans, and archiving research data. Digital humanities can use big data and visualization to gain new insights. Standards like TEI and services like data repositories are important to enable access and reuse of data.
This document contains John Hosie's resume. It lists his work experience including positions in used car sales, sales management, and hotel management from 1981 to 2015. It also lists his qualifications such as a diploma in cookery and certificates in car sales, crowd control, and assessment and training. References and contact information are provided at the bottom.
Hay diferentes tipos de animales según su alimentación, incluyendo carnívoros que comen carne como el león, y herbívoros que comen hierba como la cebra. Todos los animales son bonitos aunque se alimenten de forma distinta.
This document provides an overview of commonly used features in PowerPoint. It demonstrates how to add and format slides, apply themes, vary text formatting, insert images from clipart or files, add animated and timed text, apply slide transitions and sounds, use shapes and diagrams, embed videos, add action buttons for navigation, and link to web content and email addresses. The goal is to serve as both a user guide and example presentation to learn PowerPoint features.
El documento resume los principales tipos de energía utilizados en Europa entre 1970 y 2000, así como los cambios en el panorama energético durante ese período. Se describen cinco tipos principales de energía en Europa, con países dependientes del carbón, la energía nuclear, el agua u otros recursos. La crisis del petróleo de 1973 impulsó el desarrollo de la energía nuclear, pero también generó resistencia debido a preocupaciones ambientales y de seguridad tras varios accidentes notables. Para el siglo XXI, la dependencia del petróleo disminuy
Toro mérida, julián poder político y conflictos sociales en la españa de la ...Antonio Vives
Este documento es una introducción a una tesis doctoral sobre el poder político y los conflictos sociales en España durante la Primera República, con un enfoque en la dictadura del General Serrano en 1874. Presenta preguntas sobre el papel de Serrano como presidente de la República en 1874 y sobre si ese periodo debió considerarse de interinidad republicana o monárquica. También cuestiona si la forma unitaria hubiera consolidado más que la federal la viabilidad del sistema republicano durante ese tiempo.
Vijay S. Dhawale is seeking a challenging career in IVF or microbiology. He has a M.Sc. in microbiology from Govt. Institute of Science in Aurangabad. He has over 2 years of experience as a junior embryologist and IVF technician. He is currently working as a junior embryologist at Aditya Birla Memorial Hospital in Pune. He has skills in IVF procedures, embryo culture, ICSI, and microbiological analysis techniques.
This document discusses the importance of 543-39 in the field of mathematics. Specifically, it notes that 543-39 is a prime number that has many unique properties and applications. While complex, 543-39 has been shown to help advance our understanding of numbers, patterns, and mathematical problem-solving techniques.
El documento presenta cuatro preguntas de investigación relacionadas con estrategias para aumentar el hábito de la lectura, la participación de los padres, el uso de las TIC y cómo despertar el interés por la lectura en los estudiantes.
North Tartan - Amsterdam, Belgium & Paris - 2014Anthony Travel
This document describes a 9-day basketball tour organized by Anthony Travel that visits Amsterdam, Belgium, and Paris. The tour includes 3 days in Amsterdam playing 2 games, 1 day each in Antwerp and Bruges in Belgium also playing 1 game each day, and 4 days in Paris playing 2 games, with guided sightseeing in each city including museums, landmarks, and cultural activities. Transportation between cities is included along with hotels, breakfasts, and a tour director.
A empresa está enfrentando desafios financeiros devido à queda nas vendas e precisa cortar custos. O diretor financeiro recomenda demitir funcionários para economizar em folha de pagamento ou negociar reduções salariais para evitar demissões. Também sugere adiar investimentos não essenciais para preservar caixa.
The document discusses quality management for project management. It defines quality, describes quality management goals like customer satisfaction and continuous improvement, and explains key quality management processes like quality planning, quality assurance, and quality control. Quality planning involves activities to develop quality management documentation. Quality assurance ensures processes are adequate, while quality control evaluates work products through inspections and testing to find defects. The document emphasizes the importance of metrics for quality assurance to track testing progress, defects, and determine release readiness.
eMargin Presentation given to Skills Funding AgencyRDUES
Presentation on the eMargin collaborative text annotation tool given to the Skills Funding Agency. Also contains description of AHRC Knowledge Transfer Fellowship project, working with A Level English Language students.
This document discusses using the MUSE software to process and deliver personal email archives in special collections. It identifies challenges with email archiving like copyright, privacy, and sensitive information. It provides examples of email collections. MUSE can be used to build search indexes, access controls, and merge emails with existing discovery portals. MUSE allows editing lexicons, grouping emails by topic or person, and generating relationship charts and summaries for web access while maintaining full individual emails for reading rooms. The document notes gaps in MUSE like more advanced search capabilities and original folder views, and proposes future work.
Learning and Text Analysis for Ontology Engineeringbutest
This document calls for papers and participation in a workshop on learning and text analysis for ontology engineering to be held in conjunction with the ECAI 2002 conference in Lyon, France. The workshop aims to bring together researchers from linguistics, natural language processing, knowledge representation, and machine learning to discuss issues around building, maintaining, and reusing ontologies and terminological resources. Topics of interest include using texts and linguistic/terminological resources as knowledge sources for building ontologies, applying machine learning and NLP tools to ontology engineering, and learning ontologies from sources like the web. The deadline for paper submissions is March 15th and for motivation abstracts is May 24th. The workshop will include paper presentations, discussions, and
The document summarizes efforts to support digital humanities research through collaboration at various institutions. It describes projects at Wheaton College involving students encoding a text using TEI XML under faculty supervision. It also discusses initiatives at the University of Vermont and Brown University to provide infrastructure and expertise for digital scholarship through partnerships between libraries, academic technology groups, and faculty researchers.
The document provides information about Samuel Kim's career portfolio, including his educational background receiving a Ph.D from University of Southern California and B.S./M.S from Yonsei University. His research has focused on audio information retrieval and social signal processing with the goal of developing machines that can understand scenes using unstructured audio signals and how humans perform and perceive behaviors in social interactions. The document also summarizes his current project on social signal processing and open challenges in his past work on acoustic topic models.
What is Digital Humanities?
What do we do under DH?
1. Digital Archives
Let us have introduction to a few projects
2. Computational Humanities
a. Using digital technology for analysis of literary text - research concerns
b. Using DT in teaching & learning - pedagogical concerns
c. Generative Literature
3. Multimodal Critique
The fundamentals of Humanities - Critical Inquiry
Specialising the EDM for Digitised Manuscript (SWIB13)Kai Eckert
The DM2E project developed a data model to standardize metadata for digitized manuscripts. It specialized the Europeana Data Model (EDM) by adding over 50 new properties and 23 classes to better represent physical and conceptual aspects of manuscripts. The DM2E model was documented in PDF and OWL formats and made available online for humans and machines. Future work includes addressing uncertain statements about timespans and creators.
From DARPA to Shakespeare: All the Data we Can Handle Kimberly Hoffman
This document discusses the opportunities and challenges of big data for libraries, researchers, and digital humanities. It notes that big data is growing exponentially from sensors, internet data, and scientific instruments. Libraries and librarians have new roles to play in data management, curation, and research data services. Researchers need help with data literacy, data management plans, and archiving research data. Digital humanities can use big data and visualization to gain new insights. Standards like TEI and services like data repositories are important to enable access and reuse of data.
This document contains John Hosie's resume. It lists his work experience including positions in used car sales, sales management, and hotel management from 1981 to 2015. It also lists his qualifications such as a diploma in cookery and certificates in car sales, crowd control, and assessment and training. References and contact information are provided at the bottom.
Hay diferentes tipos de animales según su alimentación, incluyendo carnívoros que comen carne como el león, y herbívoros que comen hierba como la cebra. Todos los animales son bonitos aunque se alimenten de forma distinta.
This document provides an overview of commonly used features in PowerPoint. It demonstrates how to add and format slides, apply themes, vary text formatting, insert images from clipart or files, add animated and timed text, apply slide transitions and sounds, use shapes and diagrams, embed videos, add action buttons for navigation, and link to web content and email addresses. The goal is to serve as both a user guide and example presentation to learn PowerPoint features.
El documento resume los principales tipos de energía utilizados en Europa entre 1970 y 2000, así como los cambios en el panorama energético durante ese período. Se describen cinco tipos principales de energía en Europa, con países dependientes del carbón, la energía nuclear, el agua u otros recursos. La crisis del petróleo de 1973 impulsó el desarrollo de la energía nuclear, pero también generó resistencia debido a preocupaciones ambientales y de seguridad tras varios accidentes notables. Para el siglo XXI, la dependencia del petróleo disminuy
Toro mérida, julián poder político y conflictos sociales en la españa de la ...Antonio Vives
Este documento es una introducción a una tesis doctoral sobre el poder político y los conflictos sociales en España durante la Primera República, con un enfoque en la dictadura del General Serrano en 1874. Presenta preguntas sobre el papel de Serrano como presidente de la República en 1874 y sobre si ese periodo debió considerarse de interinidad republicana o monárquica. También cuestiona si la forma unitaria hubiera consolidado más que la federal la viabilidad del sistema republicano durante ese tiempo.
Vijay S. Dhawale is seeking a challenging career in IVF or microbiology. He has a M.Sc. in microbiology from Govt. Institute of Science in Aurangabad. He has over 2 years of experience as a junior embryologist and IVF technician. He is currently working as a junior embryologist at Aditya Birla Memorial Hospital in Pune. He has skills in IVF procedures, embryo culture, ICSI, and microbiological analysis techniques.
This document discusses the importance of 543-39 in the field of mathematics. Specifically, it notes that 543-39 is a prime number that has many unique properties and applications. While complex, 543-39 has been shown to help advance our understanding of numbers, patterns, and mathematical problem-solving techniques.
El documento presenta cuatro preguntas de investigación relacionadas con estrategias para aumentar el hábito de la lectura, la participación de los padres, el uso de las TIC y cómo despertar el interés por la lectura en los estudiantes.
North Tartan - Amsterdam, Belgium & Paris - 2014Anthony Travel
This document describes a 9-day basketball tour organized by Anthony Travel that visits Amsterdam, Belgium, and Paris. The tour includes 3 days in Amsterdam playing 2 games, 1 day each in Antwerp and Bruges in Belgium also playing 1 game each day, and 4 days in Paris playing 2 games, with guided sightseeing in each city including museums, landmarks, and cultural activities. Transportation between cities is included along with hotels, breakfasts, and a tour director.
A empresa está enfrentando desafios financeiros devido à queda nas vendas e precisa cortar custos. O diretor financeiro recomenda demitir funcionários para economizar em folha de pagamento ou negociar reduções salariais para evitar demissões. Também sugere adiar investimentos não essenciais para preservar caixa.
The document discusses quality management for project management. It defines quality, describes quality management goals like customer satisfaction and continuous improvement, and explains key quality management processes like quality planning, quality assurance, and quality control. Quality planning involves activities to develop quality management documentation. Quality assurance ensures processes are adequate, while quality control evaluates work products through inspections and testing to find defects. The document emphasizes the importance of metrics for quality assurance to track testing progress, defects, and determine release readiness.
La Unión Europea ha acordado un paquete de sanciones contra Rusia por su invasión de Ucrania. Las sanciones incluyen restricciones a las transacciones con bancos rusos clave y la prohibición de la venta de aviones y equipos a Rusia. Los líderes de la UE también acordaron excluir a varios bancos rusos del sistema SWIFT de mensajería financiera.
Este documento describe un Punto Libre Canaima, el cual es un espacio físico ubicado en instituciones educativas que provee soporte técnico e infraestructura tecnológica mínima. Explica que el Punto Libre Canaima del CBIT "Dr. Lisandro Lecuna" brinda sus servicios a 6 escuelas de la zona de lunes a viernes por la mañana y tarde.
Este manual proporciona instrucciones para la instalación, uso y mantenimiento de una estufa de pellet canalizable modelo Mesenia. Explica los requisitos de seguridad, las características técnicas del producto, los procedimientos de instalación, operación y mantenimiento, así como la información sobre garantía. El objetivo es asegurar el uso correcto y seguro del producto.
1) The document discusses the use of mobile technology for language learning, referencing several of the author's past projects that utilized mobile and stationary technologies together.
2) It outlines different e-learning models and the role of computers in each. It also discusses communicative competence and tools that can be used for mobile language learning and analysis.
3) Several of the author's projects integrating mobile and stationary technologies for language learning are described, including resources and goals for each. Open questions about designing effective mobile language learning are also posed.
This document discusses using pedagogic corpora in English language teaching. It introduces pedagogic corpora as an alternative to directly transferring corpus linguistics research methods to the classroom. Pedagogic corpora are compiled with thematic relevance and recontextualization for authentication. The document also describes tools for annotating pedagogy in corpora and integrating corpus activities into English language and content-based instruction.
This document provides an overview of the SHEBANQ project, which provides tools for querying annotated Hebrew text data. It describes the data sources and contributors that have built up the underlying text corpus over many years. It also outlines the steps taken to make this data and related tools more accessible, including developing a website, depositing data in archives, running demonstration projects, and integrating the data and tools into broader research environments through additional projects and publications. The goal has been to facilitate wider use of this linguistic resource and foster more digital humanities and data science work based on its contents.
Slides of the paper Curation Technologies for a Cultural Heritage Archive: Analysing and transforming a heterogeneous data set into an interactive curation workbench by Georg Rehm, Martin Lee, Julián Moreno Schneider and Peter Bourgonje at the 3rd Edition of the DATeCH2019 International Conference
This document summarizes a presentation about the history and future of digital repositories and text analysis tools. It discusses how text collections have evolved from non-digital and dispersed, to digitized but dispersed, to full text collections in repositories, and finally to texts organized into corpora. However, many challenges remain, such as incomplete digitization and a lack of tools for combining close and distant reading. The document envisions a future of distributed infrastructure that connects dispersed data and tools. However, careful interpretation of results will still be needed to understand what texts are included or missing and make valid claims.
Presentation - First International Library Staff Exchange Week, ZagrebIva Vrkic
Librarians at the Faculty of Science in Zagreb provide information literacy courses for graduate students and scholars. Topics covered include using plagiarism detection software, changes in scientific publishing, and copyright issues. Plans exist to expand offerings to include workshops for freshmen. Librarians look to colleagues at the University of Zagreb for inspiration on developing robust education programs.
Libraries at Harvard and Oxford offer diverse information literacy instruction through workshops, seminars, and online/hybrid courses. Common topics are using library resources, research skills like literature reviews, data management, reference management software, and open scholarship issues. Both institutions dedicate over 50% of instruction to online formats, with the remainder split between in-person and hybrid
This document discusses techniques for analyzing text and documents. It begins by introducing Angela Zoss and her background in linguistics, human-computer interaction, and natural language processing. It then discusses analyzing text at both the low level of individual words and the high level of full documents. Examples are given of using documents to learn about language patterns over time and across corpora, as well as using language to learn about the structure and topics of documents. The document focuses on describing, organizing, and classifying documents through both manual and automated methods. Examples described include studying pronoun frequencies, mythological networks, literary fingerprints, sentiment analysis, and classification schemes like the UCSD Map of Science and NIH Map Viewer.
Qualitative text analysis and sentiment analysis techniques were used to analyze various types of text data and answer research questions. Specifically, thematic coding was used to analyze interview transcripts on the experiences of pediatric speech language pathologists. Bag of words methods and sentiment analysis using lexicons were applied to study public perception of vaccines by examining tweets. Supervised and unsupervised classification identified Jim Crow laws from North Carolina legislation based on racially-based language.
study or concern about what kinds of things exist
what entities there are in the universe.
the ontology derives from the Greek onto (being) and logia (written or spoken). It is a branch of metaphysics , the study of first principles or the root of things.
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Alannah Fitzgerald
Presented at the Teaching and Language Corpora (TaLC) Conference in Lancaster on July 23, 2014. Based on collaborative work with the FLAX Language Project (Shaoqun Wu and Ian Witten) and the Language Centre at Queen Mary University of London (Martin Barge, William Tweddle, Saima Sherazi).
An-Exploration-of-scientific-literature-using-Natural-Language-ProcessingTheodore J. LaGrow
The document describes a study that used natural language processing to analyze scientific literature on specific topics and identify trends in technologies and methods used. The researchers extracted text from the first 100 articles returned for searches on Hawkes processes, galaxy evolution, T-cell receptor genomes, and natural language processing. NLP was used to identify noun phrases relating to predefined interesting words, and word clouds were generated to visualize frequencies. Results showed technologies and methods relevant to each topic. The researchers aim to continue improving the software to better connect researchers with useful tools.
Academic English With The Electronic Theses Online Service (EThOS) At The Bri...Martha Brown
This document provides an overview of a workshop on mining open access PhD theses from the British Library using the Electronic Thesis Online Service (EThOS). It discusses the FLAX language research project, EThOS, reuse of digital collections, abstracts, and tools for search, collocations, word lists and lexical bundles. It also links to the Wikipedia corpus in FLAX and discusses Wikification.
https://www.youtube.com/watch?v=5ZUlVlumIQo&list=PLqJzTtkUiq54DDEEZvzisPlSGp_BadhNJ&index=10
Over the last years, deep learning is rapidly advancing with impressive results obtained in several areas including computer vision, machine translation and speech recognition. Deep learning attempts to learn complex function through learning hierarchical representation of data. A deep learning model is composed of non-linear modules that each transforms the representation from lower layer to the higher more abstract one. Very complex functions can be learned using enough composition of the non-linear modules. Furthermore, the need for manual feature engineering can be obviated by learning features themselves through the representation learning. In this talk, we first explain how deep learning architecture in particular and neural networks in general are loosely inspired by mammalian visual cortex and nervous system respectively. We also discuss about the reason for big and successful comeback of neural networks with the deep learning models. Finally, we give a brief introduction of various deep structures and their applications to several domains.
References:
LeCun, Yann, Yoshua Bengio, and Geoffrey Hinton. "Deep learning." Nature 521.7553 (2015): 436-444.
Socher, Richard, Yoshua Bengio, and Chris Manning. "Deep learning for NLP." Tutorial at Association of Computational Logistics (ACL), 2012, and North American Chapter of the Association of Computational Linguistics (NAACL) (2013).
Lee, Honglak. "Tutorial on deep learning and applications." NIPS 2010 Workshop on Deep Learning and Unsupervised Feature Learning. 2010.
LeCun, Yann, and M. Ranzato. "Deep learning tutorial." Tutorials in International Conference on Machine Learning (ICML’13). 2013.
Socher, Richard, et al. "Recursive deep models for semantic compositionality over a sentiment treebank." Proceedings of the conference on empirical methods in natural language processing (EMNLP). Vol. 1631. 2013.
https://www.youtube.com/channel/UC9OeZkIwhzfv-_Cb7fCikLQ
https://www.udacity.com/course/deep-learning--ud730
http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-1-introduction-to-rnns/
Presentation on eMargin given to BCU Learning & Teaching CommitteeRDUES
The document describes a collaborative textual annotation tool called rdues.bcu.ac.uk that was developed with funding from JISC to support close reading of literary texts. It discusses limitations of traditional close reading methods and other existing annotation tools. It outlines the features of the new tool, which was tested successfully in a pilot study with students and allows annotation at the word level to facilitate close analysis and discussion of texts.
Resources at the Interface of Openness for Academic EnglishAlannah Fitzgerald
Presentation given at the Global OER Graduate Network (GO-GN) Research Track at the OpenCourseWare Consortium Global Conference 2014 in Ljubljana, Slovenia on April 24, 2014.
Digital Futures in Teacher Education workshopDEFToer3
This workshop was delivered by Anna Gruszczynska and Richard Pountney as part of the HEA-funded workshop "Promoting Digital Literacy through OER: the release, use and reuse of open educational resources" which took place at Oxford University on 5 July 2012.
This document is a curriculum vitae for Rutu Mulkar-Mehta that outlines her education, research interests, experience, publications, achievements, research projects, and academic projects. She has a PhD from the University of Southern California and research interests in natural language processing. Her experience includes positions as a graduate research assistant and invited lecturer at USC.
This play explores themes of women's legal and social status during the Renaissance period in England. Specifically, it examines the decline in women's rights that resulted from the dissolution of monasteries and convents. The play uses the story of the Duchess of Malfi to show how upper class women had little choice in marriage and were often seen as property to be exchanged between men. It also shows how women's bodies and reproductive rights were policed and controlled within patriarchal systems.
The Wife of Bath's Prologue and Tale discusses the antifeminist tradition, the Wife of Bath's experience with marriage over her 5 husbands, and her use of biblical authority to justify women's sexuality. It then summarizes a tale where a knight must complete a quest by learning what women truly desire, which he learns is sovereignty over their husbands. He is given a choice between a beautiful unfaithful wife or an ugly faithful one, and chooses the latter.
Response to Presentations at “Using Digital Humanities Research Tools in the Classroom” by Spencer Keralis, Director for Digital Scholarship and Research Associate Professor with the Digital Scholarship Co-Operative at the University of North Texas
The document provides an overview of the different genres that Shakespeare's play The Winter's Tale could be classified under, including comedy, tragedy, romance, tragicomedy, and pastoral. It examines elements from each genre that are present in the play, such as the happy endings of couples but also the king's terrible error. The document also discusses challenges to authority figures in the play and themes of infection, magic, and the relationship between rulers and subjects.
This document discusses Kenneth Branagh's 1996 film adaptation of Shakespeare's Much Ado About Nothing. It provides context on some of the key scenes and characters from the play. It also discusses different types of Shakespearean adaptations, including historical, modern dress, and translations to modern language. The document analyzes how Branagh's film version depicts the characters of Beatrice and Dogberry. It asks whether the film feels like a typical Shakespeare performance or brings something new through Branagh's cinematic influences.
Early Modern / Medieval Digital Archives: Women, DramaJessica C. Murphy
This document discusses the representation and inclusion of women in digital archives. It describes images on title pages of works by women authors such as Mary Wroth's Urania and Cary's Tragedy of Mariam. It also mentions the Feminism of the Archive project and the Poetess Archive Journal. The document appears to be about exploring digital archives and databases related to literature written by women during the Renaissance period. It includes workshops and discussions about finding information on women writers from that era and analyzing primary sources within relevant archives and databases.
This document compares and contrasts the fairy tales of Cinderella and Yeh-Shen. Both stories feature kind young girls who are mistreated by their stepmothers and stepsisters but find happiness in the end. Key elements that are discussed include the contrast between the virtuous protagonists and their vain oppressors, the magical items or animals that help the girls, and the fitting punishments that befall the wicked step relatives.
Strategies for Effective Upskilling is a presentation by Chinwendu Peace in a Your Skill Boost Masterclass organisation by the Excellence Foundation for South Sudan on 08th and 09th June 2024 from 1 PM to 3 PM on each day.
How to Fix the Import Error in the Odoo 17Celine George
An import error occurs when a program fails to import a module or library, disrupting its execution. In languages like Python, this issue arises when the specified module cannot be found or accessed, hindering the program's functionality. Resolving import errors is crucial for maintaining smooth software operation and uninterrupted development processes.
A workshop hosted by the South African Journal of Science aimed at postgraduate students and early career researchers with little or no experience in writing and publishing journal articles.
How to Setup Warehouse & Location in Odoo 17 InventoryCeline George
In this slide, we'll explore how to set up warehouses and locations in Odoo 17 Inventory. This will help us manage our stock effectively, track inventory levels, and streamline warehouse operations.
How to Make a Field Mandatory in Odoo 17Celine George
In Odoo, making a field required can be done through both Python code and XML views. When you set the required attribute to True in Python code, it makes the field required across all views where it's used. Conversely, when you set the required attribute in XML views, it makes the field required only in the context of that particular view.
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPRAHUL
This Dissertation explores the particular circumstances of Mirzapur, a region located in the
core of India. Mirzapur, with its varied terrains and abundant biodiversity, offers an optimal
environment for investigating the changes in vegetation cover dynamics. Our study utilizes
advanced technologies such as GIS (Geographic Information Systems) and Remote sensing to
analyze the transformations that have taken place over the course of a decade.
The complex relationship between human activities and the environment has been the focus
of extensive research and worry. As the global community grapples with swift urbanization,
population expansion, and economic progress, the effects on natural ecosystems are becoming
more evident. A crucial element of this impact is the alteration of vegetation cover, which plays a
significant role in maintaining the ecological equilibrium of our planet.Land serves as the foundation for all human activities and provides the necessary materials for
these activities. As the most crucial natural resource, its utilization by humans results in different
'Land uses,' which are determined by both human activities and the physical characteristics of the
land.
The utilization of land is impacted by human needs and environmental factors. In countries
like India, rapid population growth and the emphasis on extensive resource exploitation can lead
to significant land degradation, adversely affecting the region's land cover.
Therefore, human intervention has significantly influenced land use patterns over many
centuries, evolving its structure over time and space. In the present era, these changes have
accelerated due to factors such as agriculture and urbanization. Information regarding land use and
cover is essential for various planning and management tasks related to the Earth's surface,
providing crucial environmental data for scientific, resource management, policy purposes, and
diverse human activities.
Accurate understanding of land use and cover is imperative for the development planning
of any area. Consequently, a wide range of professionals, including earth system scientists, land
and water managers, and urban planners, are interested in obtaining data on land use and cover
changes, conversion trends, and other related patterns. The spatial dimensions of land use and
cover support policymakers and scientists in making well-informed decisions, as alterations in
these patterns indicate shifts in economic and social conditions. Monitoring such changes with the
help of Advanced technologies like Remote Sensing and Geographic Information Systems is
crucial for coordinated efforts across different administrative levels. Advanced technologies like
Remote Sensing and Geographic Information Systems
9
Changes in vegetation cover refer to variations in the distribution, composition, and overall
structure of plant communities across different temporal and spatial scales. These changes can
occur natural.
How to Build a Module in Odoo 17 Using the Scaffold MethodCeline George
Odoo provides an option for creating a module by using a single line command. By using this command the user can make a whole structure of a module. It is very easy for a beginner to make a module. There is no need to make each file manually. This slide will show how to create a module using the scaffold method.
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
The simplified electron and muon model, Oscillating Spacetime: The Foundation...RitikBhardwaj56
Discover the Simplified Electron and Muon Model: A New Wave-Based Approach to Understanding Particles delves into a groundbreaking theory that presents electrons and muons as rotating soliton waves within oscillating spacetime. Geared towards students, researchers, and science buffs, this book breaks down complex ideas into simple explanations. It covers topics such as electron waves, temporal dynamics, and the implications of this model on particle physics. With clear illustrations and easy-to-follow explanations, readers will gain a new outlook on the universe's fundamental nature.
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
Laurel Stvan dh ant_conc 2/27/13
1. Digital Tools in Pedagogy:
AntConc for
Humanities Students
DFW-area Digital Humanities Colloquium
UT Dallas
February 27th, 2013
Laurel Smith Stvan
Department of Linguistics and TESOL
UT Arlington
stvan@uta.edu
@Ling_Lass
3. Digital aspects
From paper to electronic texts
Digitize
-- via scanning and OCR
-- via downloading
-- via typing
+ coding or tagging
3
4. Humanities aspects
• Tools for analysis
Concordancing software
-- quicker gathering and skimming
-- frequency, collocations
• What are the text’s characteristics?
• What are the text’s themes?
Find support in the text for your claims!
4
10. Pedagogy aspects
Assignments:
• Compile your own set of texts (LSP
corpus)
• OCR training
• Try out POS tagging
(see what annotation buys you)
• Search for particular words:
the, you, but or now, health, whisper
• Commentary and reflection 1
0
11. Pedagogy aspects
Assignments:
Compile your own set of texts
•Project Gutenberg
http://www.gutenberg.org/catalog/
•Oxford Text Archive
http://ota.ahds.ac.uk/
•The Corpus of Late Modern English Texts
https://perswww.kuleuven.be/~u0044428/
clmetev.htm
11
14. Reflection
Using tools, but also using the tools to
answer research questions…
Frequency info or knowing what words
collocate with others is not a
conclusion, but those can be a good
steps along the way to understanding
the language use you can find in large
sets of texts.
1
4
15. References to share
Anthony, Laurence. (2012). AntConc (3.2.4). Tokyo, Japan:
Waseda
University.
http://www.antlab.sci.waseda.ac.jp/antconc_index.html
Bowker, Lynne. (2002). "Capturing Data in Electronic Form."
Computer-
Aided Translation Technology. Ottawa: Univ. of Ottawa Press.
Bowker, Lynne and Jennifer Pearson. (2002).Working with
Specialized
Language: A Practical Guide toUsing Corpora. London/New
York:
Routledge.
ReadIRIS OCR. https://www.irislink.com/
DH Subject and Course Guide at UTA:
http://libguides.uta.edu/digitalhumanities
15
Editor's Notes
Good afternoon, and thanks for inviting me to talk about AntConc as a DH tool.
Three issues are relevant, and are intertwined…. Digital aspects, Humanities aspects, and pedagogy aspects.As I said, I’m talking today about AntConc, which I’ve used several times in a class for linguistics students, but which is very applicable to text analysts from lots of humanities disciplines.
One main way that this is digital, is that the students learn to move materials from paper into digital form to manipulate.When we digitize, it can take the form of inputting via scanning and then running OCR; it can involve capturing online material; or in some cases retyping older texts. In all these cases we also start to hint at ways that tagging material, or coding to capture metadata can also be of use.
As for how this is considered to be humanities, with AntConc students still do many of the traditional tasks of analysis, but now on larger sets of data…
Have used many: Conc, old hypercard stack driven for Mac. Then WordSmith tools, feature rich software that runs only on Windows machines. Have settled on AntConc.It’s appealing for several reasons… free, cross-platform, so students can download it at home as well as in the lab at school.It works with txt or xml.
Here’s the origin of its odd name… Let me do a little demo of the features students can try that are relevant as a tool for humanities projects…
One cool aspect of using lots of texts is to see which words are used the most frequently.Little functions words at the top….Juicier words are further down the list.You can see in the sidebar that I am using a bunch of 19th century essays.
Once you’ve chosen one of the juicier words from your text, or from the set of texts in your genre, you can home in on all its uses and examine them in together in their contexts. Here I’m looking at whispered. But that may lead you to look at how a certain part of speech is used. Say I wanted to find all the past tense verbs. Or, all the past participles.
If I had text files tagged for parts of speech, I could look for all the verbs marked as past participle and pull them all up together. This list is from a different set of texts that seems to involve a lot of speaking….Now that you can see how easily students can find patterns in large sets of texts, let’s think about the pedagogical aspects.
4) For linguists it’s useful to compare the distribution of function words like articles, pronouns, and conjunctions with content words like adverbs, nouns and verbs. But most likely you’ll already have a word in mind for your own reasons that you want to the check for to find a bunch of examples and their patterns.5) Lastly, it’s important that the task not just be how to run the software, but also emphasize discussing the results and their ramifications.
As far as compiling your own texts to look at. A couple sources of existing public domain texts are useful—Project Gutenberg and the Oxford Text archive. Hendrik De Smet at the university of Leuven had gathered together texts from these first two sources into a very neat corpus of late modern English text that you can obtain for free. In the case of more modern texts, my students often work on compiling the texts of understudied languages that they are exploring, or, work with more modern sources like Tweets and texts. This last source offers some available Twitter corpora.
Lots of times students have a particular language or a particular era they are interested in studying. To get material, they may need to scan in paper pages. This is a Scottish-English dictionary that I chose for an exercise for my students. As a lesson in OCR issues, It has text that curves into the gutter, and small letters in the pronunciation that make it trickier than it first appears to use OCR on.
For a real challenge in how OCR works, here’s a bit of realia to work with. These receipts have a variety of fonts including dot matrix printing, bleed through from the back of the paper, and wrinkly pages.
Lastly, a note on how the using the tool is not the totality of the research. I want them to use to use tools like AntConc, but besides knowing how to use the tools, I want them to be able to use it to answer research questions. Frequency information, or knowing what words in a text collocate with others in not itself a conclusion. But those can be good steps along the way to understanding the language use one can find in large collections of texts.
As references I have not so much works cited, as useful sources I wanted to share today. The first three are sources I’m using in my corpus class this semester. For the fourth, there are many OCR programs out there. But the interactive “learn” feature in ReadIris programs are pretty fun and instructive.The last is a plug for the DH site maintained by our subject librarian at UTA, Rafia Mirza.Thank you!