Draft current state of digital forensic and data science Damir Delija
In this presentation we will introduce current state of digital forensics, its positioning in general IT security and relations with data science and data analyses. Many strong links exist among this technical and scientific fields, usually this links are not taken into consideration. For data owners, forensic researchers and investigators this connections and data views presents additional hidden values.
Draft current state of digital forensic and data science Damir Delija
In this presentation we will introduce current state of digital forensics, its positioning in general IT security and relations with data science and data analyses. Many strong links exist among this technical and scientific fields, usually this links are not taken into consideration. For data owners, forensic researchers and investigators this connections and data views presents additional hidden values.
A talk describing the field of computer forensics and its relation with incident response. Live forensics, timelines, registry, smartphones, cloud forensic, ethics, writing and defending reports are issues that will be covered.
Digital forensics research: The next 10 yearsMehedi Hasan
Today’s Golden Age of computer forensics is quickly coming to an end. Without a clear strategy for enabling research efforts that build upon one another, forensic research will fall behind the market, tools will become increasingly obsolete, and law enforcement, military and other users of computer forensics products will be unable to rely on the results of forensic analysis. This article summarizes current forensic research directions and argues that to move forward the community needs to adopt standardized, modular approaches for data representation and forensic processing.
@2010 Digital Forensic Research Workshop. Published by Elsevier Ltd. All rights reserved
As our digital records are likely to be cyber-breached several times and/or we all have to deal with legal proceedings, learn how to use digital forensics experts efficiently.
For better or worse, electronic data is at the heart of many legal investigations. Therefore, it is becoming increasingly important for lawyers to have a basic understanding of computer forensics including:
- what computer forensics is and what types of things can a computer forensic expert do;
- types of mistakes lawyers or IT professionals make that can corrupt, alter, or destroy evidence that is key to investigations;
what types of electronic evidence exists;
- ways to work efficiently and effectively with a computer forensic expert; and
- when to consider hiring and how to choose a computer forensic expert as part of an investigation
Learn more from Winston & Strawn and listen to the presentation here: https://www.winston.com/en/thought-leadership/computer-forensics-what-every-lawyer-needs-to-know.html.
HTML5 and the dawn of rich mobile web applicationsJames Pearce
HTML5 and its related technologies are enabling new ways to build beautiful sites and applications for contemporary mobile devices. Native mobile developers can now use web technologies to surmount cross-platform headaches, and desktop web developers can reach mobile users in familiar, app-like ways. This session explores the state of the art in HTML5-based mobile web frameworks, and demonstrates the practical possibilities that this powerful and standards-based approach can bring.
A talk describing the field of computer forensics and its relation with incident response. Live forensics, timelines, registry, smartphones, cloud forensic, ethics, writing and defending reports are issues that will be covered.
Digital forensics research: The next 10 yearsMehedi Hasan
Today’s Golden Age of computer forensics is quickly coming to an end. Without a clear strategy for enabling research efforts that build upon one another, forensic research will fall behind the market, tools will become increasingly obsolete, and law enforcement, military and other users of computer forensics products will be unable to rely on the results of forensic analysis. This article summarizes current forensic research directions and argues that to move forward the community needs to adopt standardized, modular approaches for data representation and forensic processing.
@2010 Digital Forensic Research Workshop. Published by Elsevier Ltd. All rights reserved
As our digital records are likely to be cyber-breached several times and/or we all have to deal with legal proceedings, learn how to use digital forensics experts efficiently.
For better or worse, electronic data is at the heart of many legal investigations. Therefore, it is becoming increasingly important for lawyers to have a basic understanding of computer forensics including:
- what computer forensics is and what types of things can a computer forensic expert do;
- types of mistakes lawyers or IT professionals make that can corrupt, alter, or destroy evidence that is key to investigations;
what types of electronic evidence exists;
- ways to work efficiently and effectively with a computer forensic expert; and
- when to consider hiring and how to choose a computer forensic expert as part of an investigation
Learn more from Winston & Strawn and listen to the presentation here: https://www.winston.com/en/thought-leadership/computer-forensics-what-every-lawyer-needs-to-know.html.
HTML5 and the dawn of rich mobile web applicationsJames Pearce
HTML5 and its related technologies are enabling new ways to build beautiful sites and applications for contemporary mobile devices. Native mobile developers can now use web technologies to surmount cross-platform headaches, and desktop web developers can reach mobile users in familiar, app-like ways. This session explores the state of the art in HTML5-based mobile web frameworks, and demonstrates the practical possibilities that this powerful and standards-based approach can bring.
A talk I gave on to the brilliant young people currently on the Squared course, about the social web, marketing, and collaborative ways of working around this. Learn more about Squared here - https://plus.google.com/+WeareSquared/posts
Managing Professional Information Overload (K12 Version)Heather Braum
This presentation was given at an inservice for Kaw Valley USD 321 School District in January 2012 on Managing Information Overload. It is targeted at educators, but most people in any profession would find it useful!
Interlinking educational data to Web of Data (Thesis presentation)Enayat Rajabi
This is a thesis presentation about interlinking educational data to Web of Data. I explain how I used the Linked Data approach to expose and interlink educational data to the Linked Open Data cloud
DBpedia - An Interlinking Hub in the Web of DataChris Bizer
Given and overview about the DBpedia project and the role of DBpedia in the Web of Data and outlines the next steps from the Dbpedia project as well as ideas for using DBpedia data within the BBC.
For PDF go here: http://www.inkblurt.com/2008/04/15/linkosophy/
Please view full screen to be able to see notes! This was the closing "Plenary" for IA Summit 2008. Hope it starts some new, even better conversations.
The deep web is the world wide web content, but that is not the part of the surface web. Which is indexed by standard search engines. Which can’t be accessed by the conventional search engines. 400 to 500 time more public information are included in the deep web than the surface web. The total quantity of the deep web is 1000 to 2000 time greater than the surface web.
Lectio Praecursoria: Search Interfaces on the Web: Querying and Characterizin...Denis Shestakov
Lectio Praecursoria on my PhD dissertation titled "Search Interfaces on the Web: Querying and Characterizing" given in ICT building, Turku, Finland on June 12, 2008
Thesis contributions:
* Querying search interfaces
* Deep Web characterization
* Finding web databases
The text of thesis is available at http://www.slideshare.net/denshe/shestakov2008-search-interfacesonthewebqueryingandcharacterizing
Internet and its Applications.
@ Kindly Follow my Instagram Page to discuss about your mental health problems-
-----> https://instagram.com/mentality_streak?utm_medium=copy_link
@ Appreciate my work:
-----> behance.net/burhanahmed1
Thank-you !
The Next Web of Linked Data -- University of St Thomas SEIS 708Jay Myers
With hundreds of millions of active web sites on the web and millions of
new web pages added every day, the knowledge contained on the internet far
surpasses even the largest cache of big data. Today, many web developers
still publish web pages and services solely for human consumption.
However, there is a growing movement to put machine friendly data on the
web in various forms to create a web of Linked Data rather than a web
comprised of simple documents.
This talk will examine the history, technology and examples around Linked
Data, onotologies, and the schema.org movement that will enable a smarter
web -- not just for search, but also next-gen data tools and applications
that will connect entities and facts from across the internet, all enabled
by developers publishing data directly to the web.
Uvođenje novih sadržaja u nastavu digitalne forenzike i kibernetičke sigurnos...Damir Delija
Sažetak - U ovom radu razmatramo načine kontinuiranog uvođenje novih sadržaja u predmete s područja kibernetičke sigurnosti. Kao primjer navodimo „Osnove računalne forenzike“ u koji se novi sadržaji uvode korištenjem studentskih praktičnih i teoretskih radova, ideje za radove predlažu studenti i predavači. Predloženi postupak se sastoji iz testiranja kroz studentski rad, te ugradnje rezultata u nastavne materijale. Da bi se studentski rad uspješno koristio mora zadovoljiti niz zahtjeva: prilagođenost stupnju znanja studenta i raspoloživoj opremi, raspoloživost alata i sustava, jednostavna implementacija i prenosivost, upotreba alata otvorenog koda i slobodnih alata, te minimalna cijena.
Concepts and Methodology in Mobile Devices Digital Forensics Education and Tr...Damir Delija
One of draft versios of "Concepts and Methodology in Mobile Devices Digital Forensics Education and Training",
Abstract - This paper presents various issues in digital forensics of mobile devices and how to address these issues in the related education and training process. Mobile devices forensics is a new, very fast developing field which lacks standardization, compatibility, tools, methods and skills. All this drawbacks have impact on the results of forensic process and also have deep influence in training and education process. In this paper real life experience in training is presented, with tools, devices, procedures and organization with purpose to improve process of mobile devices forensics and mobile forensic training and education
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...Levi Shapiro
Letter from the Congress of the United States regarding Anti-Semitism sent June 3rd to MIT President Sally Kornbluth, MIT Corp Chair, Mark Gorenberg
Dear Dr. Kornbluth and Mr. Gorenberg,
The US House of Representatives is deeply concerned by ongoing and pervasive acts of antisemitic
harassment and intimidation at the Massachusetts Institute of Technology (MIT). Failing to act decisively to ensure a safe learning environment for all students would be a grave dereliction of your responsibilities as President of MIT and Chair of the MIT Corporation.
This Congress will not stand idly by and allow an environment hostile to Jewish students to persist. The House believes that your institution is in violation of Title VI of the Civil Rights Act, and the inability or
unwillingness to rectify this violation through action requires accountability.
Postsecondary education is a unique opportunity for students to learn and have their ideas and beliefs challenged. However, universities receiving hundreds of millions of federal funds annually have denied
students that opportunity and have been hijacked to become venues for the promotion of terrorism, antisemitic harassment and intimidation, unlawful encampments, and in some cases, assaults and riots.
The House of Representatives will not countenance the use of federal funds to indoctrinate students into hateful, antisemitic, anti-American supporters of terrorism. Investigations into campus antisemitism by the Committee on Education and the Workforce and the Committee on Ways and Means have been expanded into a Congress-wide probe across all relevant jurisdictions to address this national crisis. The undersigned Committees will conduct oversight into the use of federal funds at MIT and its learning environment under authorities granted to each Committee.
• The Committee on Education and the Workforce has been investigating your institution since December 7, 2023. The Committee has broad jurisdiction over postsecondary education, including its compliance with Title VI of the Civil Rights Act, campus safety concerns over disruptions to the learning environment, and the awarding of federal student aid under the Higher Education Act.
• The Committee on Oversight and Accountability is investigating the sources of funding and other support flowing to groups espousing pro-Hamas propaganda and engaged in antisemitic harassment and intimidation of students. The Committee on Oversight and Accountability is the principal oversight committee of the US House of Representatives and has broad authority to investigate “any matter” at “any time” under House Rule X.
• The Committee on Ways and Means has been investigating several universities since November 15, 2023, when the Committee held a hearing entitled From Ivory Towers to Dark Corners: Investigating the Nexus Between Antisemitism, Tax-Exempt Universities, and Terror Financing. The Committee followed the hearing with letters to those institutions on January 10, 202
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
A Strategic Approach: GenAI in EducationPeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
Francesca Gottschalk - How can education support child empowerment.pptxEduSkills OECD
Francesca Gottschalk from the OECD’s Centre for Educational Research and Innovation presents at the Ask an Expert Webinar: How can education support child empowerment?
Francesca Gottschalk - How can education support child empowerment.pptx
Deep Web and Digital Investigations
1. Deep Web and Digital
Investigations
Damir Delija
Milano 2014
1
2. What we will talk about
• Web and “Deep Web”
• Web and documents
• Definitions
• Technical issues
• Forensic issues
• I’m not an expert on deep or dark web
• Discussion based on many sources and
references
3. Inaccessible Web
• Deep Web is a name for data inaccessible by
regular search engines on the Internet
• Deep Web sounds much better than
inaccessible
• Searchable / Accessible web is also called
surface web
• Dark web is part of www with illegal or
immoral content
• Dark web is not Deep Web it is part of it, but
dark pages are on the surface web too
4. Inaccessible Resources
• Inaccessible resources
– it exists but we don’t know about it or it’s location
– we can’t use it
• It is an old problem
– you have it, even in your own room
• Is there any solution ?
– idea from Gopher days, Veronica
– it works well with static pages and data
– abandoned in web days, becomes a source of tremendous
power and wealth for Search Engines
5. Web and Internet and Documents
• WWW is not the Internet ☺
– also full data or document space of each networked
computer is not part of the Internet
• WWW is hypertext document based structure
– we have links among documents
– a document is not necessarily a web page
– documents must have a presentation ability to be visible
through the web interface (transcription layer, often
dynamicaly generated)
– Links, web pages and documents can be static or
dynamically generated
– Dynamic documents are here because of volume of data
(can’t be organised in static pages)
Definitions are crucial in understandig deep and
surface web
6. Volume of Data
• For each document there is in average of 11
copies in the system
– enterprise measurements pre SAN calculation
• Shows how document space expands rapidly
• Even simple mail can cause data avalanches
• From sourface web point of view ?
• Mostly invisible
• From Deep Web point of view ?
• Data/documents copies are probably floating
around, inaccessible to us
7. Web and Search Engines
• Web can access material which is only
referenced by a link and is not access
protected
• Today mostly we assumes search engine span
equals web and Internet
• To be effective search engines must have pre
organised data to answer query
• Enormous changing volume of collected data
and propagation lag
http://en.wikipedia.org/wiki/List_of_search_engines
8. Deep Resources
• Deep Web depends on the method of how
search engines acquire and store data
• Web can be crawled or explored as link space
• Hints are cache, proxy, protocol traffic
• No clear boundary between deep resources
and surface resources
9. Uncollectible Resources
Deep Web Resources
• Dynamic Web Pages
– returns in response to a query or accessed only through a form
• Unlinked Contents
– Pages without any backlinks
• Private Web
– sites requiring registration and login (password-protected resources)
• Limited Access web
– Sites with captchas, no-cache pragma http headers
• Scripted Pages
– Page produced by javascript, Flash, AJAX etc
• Non HTML contents
– Multimedia files e.g. images or videos
10. Uncollectible Resources
Documents and Disk Space
• This comes close to e-discovery field
• Is this part of Deep Web ?
• Documents not in the web tree
• accessible only by direct filesystem access
• or by dedicated script effort
• Files generally on the web servers and no-web
servers machines
– accessible only by direct filesystem access
11. Forgotten Data
• From the security aspect, forgotten data is a
very interesting part of Deep Web
• What is forgotten data – maybe data without
custodian ?
• Verizon reported about big data breach from
2008,
– unknown data being part of data breach in 66% of
incidents
12. Data Lifecycle
• Data creation and circulation
• How to find data and correlate it
• Search engines
• Proxies
• Metadata, Logs , Feeds
• Very interesting ideas in “Programming
Collective Intelligence” By: Toby Segaran,
O'Reilly Media, August 16, 2007
13. Hidden Data in Surface web ?
• Web handles data available trough html and
extensions
• What about metadata and embedded data which
is not accessible for search engines ?
14. Surface Web and Deep Issues
• “Hidden Data in Internet Published Documents”
– deep forensic impact
• Specific data formats can have embedded
elements which is not visible to search engine
– like thumb views embeded in pictures
– exif data in images
– metadata in documents
– stego
15. Idea of Treasure Island
• What is not on the map is unknown
• Hiden as treasure island
• Idea of unexplored, uncharted with big gains ..
• Because of size idea of Iceberg
16. Why Deep Web Exists ?
• Why search engine fails?
– Technology
• Most of the web data is behind dynamically
generated pages (web gateways)
– Web crawler cannot reach them or data not announced
– Can only be obtained if we have access to the system
containing the information
– Forms have to populated with values
– understanding the semantic of the web gateway and
data behind it
17. Measuring the Deep Web
• How to measure – estimates are based on known
examples
• Try to generate pages based on known home pages
and explore the link space, based on hop distances
• First Attempt: Bergman (2000)
– Size of surface web is around 19 TB
– Size of Deep Web is around 7500 TB
– Deep Web is nearly 400 times larger than the Surface Web
• 2004 Mitesh classified the Deep Web more accurately
– Most of the html forms are two hops from the home page
18. Deep Web Size
Current Estimates 2014
• Deep Web about 7500 Terabytes
• Surface Web about 19 terabytes
• Deep Web has between 400 and 550 times more
public information than the Surface Web.
• 95% of the Deep Web is publically accessible
• More than 200,000 Deep Web sites currently exist.
• 550 billion documents on Deep Web
• 1 billion documents on Surface Web
19. History of Deep Web
• Start: static html pages, web crawlers can easily
reach, only few cgi-scripts
• In mid-90’s: Introduction of dynamic pages, page
generated as a result of a query or link access
• In 1994: Jill Ellsworth used the term “Invisible
Web” to refer to these websites.
• In 2001, Bergman coined it as “Deep Web”
• Dark web goes in parallel as crime start to spread
over the Internet
20. Rough Timeline
• 2001: Raghavan et al -> Hidden Web Exposure
– domain specific human assisted crawler
• 2002: Stumbleupon used Human Crawler
– human crawlers can find relevant links that algorithmic crawlers miss.
• 2003: Bergman introduced LexiBot
– used for quantifying the Deep Web
• 2004: Yahoo! Content Acquisition Program
– paid inclusion for webmasters
• 2005: Yahoo! Subscriptions
– Yahoo started searching subcription only sites
• 2005: Noulas et. al. -> Hidden Web Crawler
– automatically generated meaningful queries to issue against search form
• 2005: Google site map
– Allows webmasters to inform search engines about urls on their websites that
are available for crawling.
– Web 2.0 infrastructure
– Today Mobile device and Internet of things
– each gadget can have (and has) web server for configuration
22. From Digital Forensic Viewpoint
• Is there a way to carry out forensically sound
actions on Deep Web ?
• Can we apply standard digital forensic
procedures and best practices ?
• In both cases yes,
– we are always limited in digital forensics, but that
does not prevent reliable results
23. Web and Digital Forensic
• Web is web ☺
• Web artifacts are web artifacts
• The type of investigation determines how we
handle web data
– key element is: legal
• Many possible scenarios and situations
– follow the forensic principles and best practices as
in any other situation
– use scientific method
– test and experiment to prove method
24. Deep Web and Forensic Tasks
• How to prove access to Deep Web resources
– same as ordinary resources, because it is mostly
through browsers
– advantage over blind Deep Web access since there
are history, cache, log artifacts which shows which
Deep Web resource was accessed
• Deep Web artifacts
– Mostly like any other web artifacts
– Hidden Data in Internet Published Documents
– Dark web as a specific subrange
25. Forensic Tools Issues
• Forensics of specialised browsers and access tools
– Thor / onion
– Unusual browsers/accessing tools links, lynx, wget
– Other browsers 12P Freenet
• Key Question: Does our forensic framework
support such tools?
– Internet Evidence Finder
– Encase
– FTK
– If not how to handle artifacts and data ?
• What about mobile devices?
26. Conclusion and Questions
• Challenging field
• Size will grow with IPv6 take over and
“Internet of things” concept
• Cloud concept is important (size, acces, legal
isuses)
• Each new tehnology will add a new layer of
invisibility eg. complexity
• Size of available data simply force use of
dynamic web pages
27. References
Too many links ...
• http://papergirls.wordpress.com/2008/10/07/timeline-deep-
web
• http://deepwebtechblog.com/federated-search-finds-content-
that-google-can’t-reach-part-i-of-iii
• http://deepwebtechblog.com/a-federated-search-primer-
part-ii-of-iii
• http://googleblog.blogspot.com/2008/07/we-knew-web-
was-big.html
• http://www.online-college-blog.com/features/100-
useful-tips-and-tools-to-research-the-deep-web/