Harnessing the “data deluge” is promoting new conversations between disciplines. Prof. Marciano and his collaborators have been pursuing research in a number of areas including: big cultural data, access to big heterogeneous data, records in the cloud, federated grid/cloud storage, visual interfaces to large collections, policy-based frameworks to automate content management, and distributed cyberinfrastructure to enable data sharing. But more importantly, innovative technical approaches require the convergence of creative insights across computer science, the social sciences, and the humanities. This talk touches on these topics and highlights a new collaboration with partners at Duke.
Richard Marciano is a professor in the School of Information and Library Science at the University of North Carolina at Chapel Hill, Director of the Sustainable Archives and Leveraging Technologies (SALT) lab, and co-director of the Digital Innovation Lab (DIL). He leads development of "big data" projects funded by Mellon, NSF, NARA, NHPRC, IMLS, DHS, NIEHS, and UNC. Recent 2012 grants include a JISC Digging into Data award with UC Berkeley and the U. of Liverpool, called "Integrating Data Mining and Data Management Technologies for Scholarly Inquiry," a Mellon / UNC award called "Carolina Digital Humanities Initiative," which involves the translating of big data challenges into curricular opportunities, and an NSF award on big heterogeneous data integration.
He holds a B.S. in Avionics and Electrical Engineering, and an M.S. and Ph.D. in Computer Science, and has worked as a postdoc in Computational Geography. He conducted interdisciplinary research at the San Diego Supercomputer at UC San Diego, working with teams of scholars in sciences, social sciences, and humanities.
Presentation from the official launch event for Pulse Lab Jakarta, held in Indonesia on 1 October 2012. Presentation includes a background on "Big Data for Development," a showcase of Pulse Lab Jakarta's initial social media analysis research results, and roadmap for the Lab. http://www.unglobalpulse.org/PLJLaunch
Designing intelligent social systems 121205Ramesh Jain
With emerging technologies and big data, it is now possible to design intelligent social systems. In this presentation, ideas related to designing such systems are presented
Buy Embedded Systems Projects,B tech Final Year Projects OnlineTechnogroovy
Get In Touch:
Technogroovy Systems India Pvt. Ltd.
www.technogroovy.com
http://www.technogroovy.com/index.php/student-zone/final-year-project
Email Id: technogroovy@gmail.com
Connect with us On Facebook:
https://www.facebook.com/Technogroovyindia
Presentation from the official launch event for Pulse Lab Jakarta, held in Indonesia on 1 October 2012. Presentation includes a background on "Big Data for Development," a showcase of Pulse Lab Jakarta's initial social media analysis research results, and roadmap for the Lab. http://www.unglobalpulse.org/PLJLaunch
Designing intelligent social systems 121205Ramesh Jain
With emerging technologies and big data, it is now possible to design intelligent social systems. In this presentation, ideas related to designing such systems are presented
Buy Embedded Systems Projects,B tech Final Year Projects OnlineTechnogroovy
Get In Touch:
Technogroovy Systems India Pvt. Ltd.
www.technogroovy.com
http://www.technogroovy.com/index.php/student-zone/final-year-project
Email Id: technogroovy@gmail.com
Connect with us On Facebook:
https://www.facebook.com/Technogroovyindia
How to plan and conduct hypotheis based science projects for A/L school project.
The project can be presented to National Science and Engineering Fair or to Google Science fair projects
This Slide was collected from a seminar "Machine Learning for Data Mining" which was arranged in Daffodil International University.The Chief Guest was Dr. Dewan Md. Farid. He made this wonderful Slide for described to us about Data Mining. He also shared his research experience which was just amazing.Totally unpredictable speech it was from Dr. Dewan Md. Farid Sir. He is one of the famous researcher.I hope , you will enjoy this slide. Details about Dr. Dewan Md. Farid sir is given below in this link
https://ai.vub.ac.be/members/dewan-md-farid
Prediction APIs are democratizing Machine Learning. They make it easier for developers to build smart features in their apps by abstracting away some of the complexities of building and deploying predictive models. In this talk we’ll look at the possibilities and limitations of ML, how to use Prediction APIs, how to prepare data to send to them, and how to assess performance.
Keynote on Crowd Computing presented at The 5th International Joint Conference on Natural Language Processing (IJCNLP2011) on November 10th in Chiang Mai, Thailand
In this presentation from the Hurricane Electric Carrier Event, Rich Brueckner from insideBIGDATA describes what's really behind this phenomenon and why you should care.
Watch the video presentation: http://wp.me/p3RLEV-1r1
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris
This lecture highlights current trends, challenges and opportunities related to the emergence of large amounts of data. It also presents Sirris’s recent research activities in this domain.
How to plan and conduct hypotheis based science projects for A/L school project.
The project can be presented to National Science and Engineering Fair or to Google Science fair projects
This Slide was collected from a seminar "Machine Learning for Data Mining" which was arranged in Daffodil International University.The Chief Guest was Dr. Dewan Md. Farid. He made this wonderful Slide for described to us about Data Mining. He also shared his research experience which was just amazing.Totally unpredictable speech it was from Dr. Dewan Md. Farid Sir. He is one of the famous researcher.I hope , you will enjoy this slide. Details about Dr. Dewan Md. Farid sir is given below in this link
https://ai.vub.ac.be/members/dewan-md-farid
Prediction APIs are democratizing Machine Learning. They make it easier for developers to build smart features in their apps by abstracting away some of the complexities of building and deploying predictive models. In this talk we’ll look at the possibilities and limitations of ML, how to use Prediction APIs, how to prepare data to send to them, and how to assess performance.
Keynote on Crowd Computing presented at The 5th International Joint Conference on Natural Language Processing (IJCNLP2011) on November 10th in Chiang Mai, Thailand
In this presentation from the Hurricane Electric Carrier Event, Rich Brueckner from insideBIGDATA describes what's really behind this phenomenon and why you should care.
Watch the video presentation: http://wp.me/p3RLEV-1r1
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris
This lecture highlights current trends, challenges and opportunities related to the emergence of large amounts of data. It also presents Sirris’s recent research activities in this domain.
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...ASIS&T
Curriculum development at the Tetherless World Constellation – the days after the “Day One” initiative
Peter Fox (RPI and WHOI)
Tetherless World Constellation
Training Data Management Practitioners panel
Presentation at Research Data Access & Preservation Summit 23 March 2012
Workshop session given at the Institutional Web Management Workshop 2012 (IWMW 2012) event held at the University of Edinburgh on 18th - 20th June 2012.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Epistemic Interaction - tuning interfaces to provide information for AI support
Socializing Big Data: Collaborative Opportunities in Computer Science, the Social Sciences, and the Humanitiesno
1. "Socializing 'Big Data':
Collaborative Opportunities in
Computer Science, the Social Sciences, and the Humanities"
Richard Marciano
UNC Chapel Hill
richard_marciano@unc.edu
http://salt.unc.edu
http://digitalinnovation.unc.edu
2. Current research Areas
•records in the cloud,
•big cultural data,
•access to big heterogeneous data,
•federated grid/cloud storage,
•visual interfaces to large collections,
•policy-based frameworks to automate content management,
•distributed cyberinfrastructure to enable data sharing.
3. Records in the Cloud
Kickoff meeting on Feb. 5, 2013
•UBC iSchool, Faculty of Law, School of Bus.
•UW iSchool
•Mid-Sweden Info. Tech and Media,
Delegating to cloud providers the responsibility for security,
accessibility, disposition and preservation.
4. • Grids in Context
1998 •
• Larry Smarr
Computational Grids
• Ian Foster and Carl Kesselman
• Distributed Supercomputing Applications
• Paul Messina
• Realtime Widely Distributed Instrumentation
• William E. Johnston
• Data-Intensive Computing
• Reagan Moore, … Richard Marciano, …
• Teleimmersion
• Tom DeFanti and Rick Stevens
• Application-Specific Tools
• Henri Casanova, Jack Dongarra, …
• Compilers, Languages, and Libraries
• Ken Kennedy
• Object-Based Approaches
• Dennis Gannon, Andrew Grimshaw
• High-Performance Commodity Computing
• Geoffrey Fox, Wojtek Furmanski
• The Globus Toolkit
• Ian Foster, Carl Kesselman
• High-Performance Schedulers
• Francine Berman
• High-Throughput Resource Management
• Miron Livny, Rajesh Raman
• Instrumentation and Measurement
• Jeffrey Hollingsworth, Bart Miller
• Performance Analysis and Visualization
• Daniel Reed, Randy Ribler
• Security, Accounting, and Assurance
• Clifford Neuman
•
2003 Computing Platforms
Tony Hey: • Andrew Chien
“The Data Deluge: An e-Science Perspective” • Network Protocols
• P.M. Melliar-Smith, Louise Moser
• Network Quality of Service
• Roch Guerin, Henning Schultzrinne
• Operating Systems and Network Interfaces
• Peter Druschel, Larry Peterson
• Network Infrastructure
Collaborative Science
2004 •
• Jon Postel, Joe Touch
Testbeds: Bridges from Research to Infrastructure
• Charlie Catlett, John Toole
5. Big Data is a Big Deal
White House announcement:
http://www.whitehouse.gov/blog/2012/03/29/big-data-big-deal
Big Data Across the Federal Government:
http://www.whitehouse.gov/sites/default/files/microsites/ostp/big_data_fact_sheet_final_1.pdf
More then $200M in new commitments (NSF, HHS/NIH, DOE, DOD, DARPA, USGS)
Goal: “improve the ability to extract knowledge and insights from large and complex
collections of digital data”.
DataNet
Long-term preservation and access of data
Software Infrastructure for Sustained Innovation (SI2)
Digging Into Data Challenge (NSF/NEH/IMLS & JISC)
Computational Humanities
Cyber-Enabled Discovery and Innovation (CDI)
Data enabled science and engineering
Core Techniques and Technologies for Advancing Big
Data Science & Engineering (BIGDATA)
Data Infrastructure Building Blocks (DIBBs)
DataWay
National Infrastructure for Heterogeneous Data
6. “Size Matters:
Big Data, New Vistas in the Humanities and Social Sciences”:
DataEdge, UC Berkeley May 31, 2012
Geoffrey Nunberg Panel:
“Something seems to happen, people feel, when you
get to that 13th zero, or 15th zero, or 18th zero, or 21st
zero, wherever it is, and bingo it’s the petabyte age, it’s
the age of big data.
It’s like combing your hair, you just comb, and comb,
and comb, and all of a sudden it’s like big hair.”
“The question is whether the advent of big data
changes the way we do social science and also what
role social scientists will play…”
12/31/2012 Forbes article by Edd Dumbill: “Big Data, Big Hype: Big Deal”
“Big data is an imprecise term. As such it’s a huge boon to marketers… not
everyone is pleased with the “bigger is better” argument. “Big data” really means
“smart use of data”.
7. Allistair Croll: “Big Data is our Generation’s civil rights issue, an we don’t know it.”
“Personalization” is another word for discrimination. We’re not discriminating if we tailor
things to you based on what we know about you — right? That’s just better service.
When bank managers tried to restrict loans to residents of
certain areas (known as redlining) Congress stepped in to
stop it (with the Fair Housing Act of 1968). They were able
to legislate against discrimination, making it illegal to change
loan policy based on someone’s race.
Home Owners’ Loan Corporation map showing redlining of “hazardous”
districts in 1936. see: DURHAM MAPS for T-RACES –project
Music selection and sharing with friends could allow to guess a person’s
racial background and deny a loan.
Publicly available last name information can
be used to generate racial boundary maps.
From the Mapping London project
10. May 2007
Socializing CI:
Networking the Humanities,
Arts, and Social Sciences
11. TUCASI data-Infrastructure Project (TIP)
TUCASI data-Infrastructure Project (TIP)
Managing Digital Research Data in Federated Storage
Managing Digital Research Data in Federated Storage
Clouds
Clouds
• Project Lead: Richard Marciano (UNC/SALT)
• Project Manager: Amy Shoop (UNC ITS)
• Oversight Council
– CIOs -- Head Librarians
• Tracy Futhey -- Duke CIO Deborah Jakubs -- Duke Librarian
• Marc Hoit – NCSU CIO Susan Nutter – NCSU Librarian
• Larry Conrad – UNC CIO Sara Michalak – UNC Librarian
– RENCI
• Alan Blatecky -- RENCI Stan Ahalt -- RENCI
– DICE Center
• Reagan Moore – DICE
– SALT Lab
• Richard Marciano -- SALT
12. Focus Group Membership
University Teams
Focus
Duke Chapel Hill NC State
Groups
Suzanne Cadwell (ITS-Academic
Classroom Samantha Earp (CC Outreach & Engagement) Lou Harrison (DELTA)
lead) (OIT-Academic Charlie Greene (ITS-Teaching & Hal Meeks (OIT-Outreach,
Capture Services) Learning) Communications and Consulting)
Pam Sessoms (Lib-e-Reference)
Amy Brooks (OIT-Systems)
Klara Jelinkova (OIT- Reagan Moore (S lead) (DICE)
Shared Services & Leesa Brieger (RENCI-Data)
Infrastructure) Brent Caison (ITS-Storage) Steve Morris (Lib-Systems)
Storage David Kennedy (Lib-Info. Dave Pcolar (Lib-Systems) Eric Sills (OIT-Research Computing)
Sys. Support) Bill Schulz (Lib-Systems)
Molly Tamarkin (Lib- Lisa Stillwell (RENCI-Data)
Systems)
Jim Tuttle (Lib-Systems)
Future Data & Paolo Mangiafico (Provost- Ruth Marinshaw (ITS-Research Kristin Antelman (FD&P lead)
Computing)
Dig. Info. Strategy) (Lib)
Policy Tim Pyatt (Lib-Archives) Will Owen (Lib-Systems) Susan Nutter (Lib-Head Librarian)
Rich Szary (Lib-Special Collections)
14. “Public Scholarship”
Kathy Woodward, UW Simpson Center for the Humanities
UNC, Duke, Asheville collaboration
• University of North Carolina Asheville (UNCA): staff (provost, head librarian, head of
special collections, library staff, departments of computer science / history / political
science), centers (National Environmental Modeling and Analysis Center / Center for
Diversity Education), and students
• community-based development organizations (Green Opportunities Corps, Asheville
Design Center)
• neighborhood community group leaders and residents (Southside, Burton Street, East
End)
• city of Asheville officials (Housing Authority of the City of Asheville, Planning &
Development Department, West Asheville Public Library, Chamber of Commerce)
• county (head of Buncombe County Register of Deeds, Land-Of-Sky Regional Council)
• other groups including the North Carolina Humanities Council, Mountain Housing
Opportunities Inc.
• “Twilight of a Neighborhood: Asheville’s East End, 1970” project. This project
examined the process and aftermath of urban renewal and collected voices of residents,
after the 2007 transfer of records to UNC Asheville. We have secured support
and commitment from the community groups relevant to tackling this project.
• Asheville’s African-American Community Historical Bus Tour, June 19, 2012 (35
people)
15.
16.
17.
18. UNCA & Asheville Partners:
• Dwight Mullen, UNCA Political Science
• Priscilla Ndiaye, chair of Asheville's Southside Advisory Commi
19. Big Heterogeneous Data (with Duke)
Mapping historical residential segregation in the
US
Researching the cyberinfrastructure implications of
supporting large scale content based indexing of highly
heterogeneous digital collections potentially embodying non-
uniform or sparse metadata architectures…
Intellectual Merit:
Demonstrating the creation of national collections through automation and citizen-
scientist crowdsourcing efforts is the focus of this task.
Broader Impacts:
This case-study will bring heterogeneous content from a variety of sources:
census, economic, historic, planning, insurance, financial, and scientific.
Outcomes:
Worfklows & Visual prototype
20. From Crowdsourcing to Citizen-led Sourcing
• Neighborhood community group leaders and residents (Southside, Burton Street, East End)
• University of North Carolina Asheville staff (provost, head of special collections, library staff, departments of
computer science / history / political science), centers (Renaissance Computing Institute / Center for Diversity
Education), and students
• Community-based development organizations (Green Opportunities Corps, Asheville Design Center)
• City of Asheville officials (Housing Authority of the City of Asheville, Register of Deeds, GIS, Planning &
Development Department, , West Asheville Public Library, Chamber of Commerce, Regional Council)
• Other groups including the North Carolina Humanities Council, Mountain Housing Opportunities Inc., Twilight
of a Neighborhood.
21. SALT
SALT
“We define the ‘discipline of data curation’ as the practice of collection,
annotation, conditioning , and preservation of data for both current and future
use”
– Helen Tibbo & Bryan H eidorn
Governance Policy
conditioning annotation
Content
collection
Infrastructure
preservation
current & future use
Evolution
Vectors – Annenberg Center for Communication SDSC: SALT
Editor's Notes
Thank you for having me. Hood Canal… by Union on the other side of Bremerton… Tacoma & Ballard. South Lake Union by Amazon. Thank you for your hospitality. I understand you have a number of searches going on… This is quite a mouthfull… of trendy terms…
Obama administration’s Open Government Initiative, which encourages public participation and collaboration. “ citizen sourcing” which has been defined as the “government adoption of crowdsourcing techniques for the purposes of (1) enlisting citizens in the design and execution of government services and to (2) tapping into the citizenry’s collective intelligence.” Vivek Kundra, Chief Information Officer of the United States from March 2009, to August 2011 under President Obama, described citizen sourcing as a way of driving “innovation by tapping into the ingenuity of the American people to solve those problems that are too big for government to solve on its own.” In the International Journal of Public Participation article, “ Citizensourcing: Applying the Concept of Open Innovation to the Public Sector, ” the authors present “ a structural overview of how external collaboration and innovation between citizens and public administrations can offer new ways of citizen integration and participation, enhancing public value creation and even the political decision-making process. ” Citizen sourcing is derived from the term crowdsourcing and emphasizes the type of civic engagement typically enabled through Web 2.0 participatory technologies, over a more impersonal crowd-based distributed problem-solving and production model. There are many excellent studies on the value of crowdsourcing for libraries, archives and museums. . The Archivist of the United States, David Ferriero, introduced the concept of “citizen archivists” in 2010. He made a parallel with citizen scientists and spoke of increasing public engagement in the archives given the National Archives and Records Administration’s over-abundance of paper records and need to digitize and transcribe them. He concluded that it wasn’t clear yet what types of citizen archivist projects were possible. at the August 2011 Society of American Archivists (SAA) annual meeting in Chicago Kate Theimer offered the following definition: Participatory Archive : An organization, site or collection in which people other than archives professionals contribute “knowledge or resources, resulting in increased understanding about archival materials, usually in an online environment.”