presentation givent at the 2nd International Workshop on Web Intelligence & Virtual Enterprises (WIVE'10) held at the 11th IFIP Working Conference on Virtual Enterprises (PRO-VE'10)
http://www.emse.fr/wive/
"Hadoop and NoSQL: Scalable Back-end Clusters Orchestration in Real-world Systems" was presented in CloudCon2012: BIT’s 1st Annual World Congress of Cloud Computing 2012 will be held from August 28-30, 2012 in Dalian, China
IBM held its first SmartCamp event in July in Germany. It was also the first SmartCamp with a specific focus on Big Data and Business Analytics.
Keynote Speaker Philippe Souidi, Founder of echofy.me and tecpunk, summarized this topic perfectly when he called Big Data the “Oil of the next Century”… fitting, isn’t it?
Bringing 3D, Ultra-Resolution, and Virtual Reality into the Global LambaGrid ...Larry Smarr
07.11.07
Keynote
ACM Virtual Reality Software and Technology (VRST)
Title: Bringing 3D, Ultra-Resolution, and Virtual Reality into the Global LambaGrid Collaboratory
Newport, CA
"Hadoop and NoSQL: Scalable Back-end Clusters Orchestration in Real-world Systems" was presented in CloudCon2012: BIT’s 1st Annual World Congress of Cloud Computing 2012 will be held from August 28-30, 2012 in Dalian, China
IBM held its first SmartCamp event in July in Germany. It was also the first SmartCamp with a specific focus on Big Data and Business Analytics.
Keynote Speaker Philippe Souidi, Founder of echofy.me and tecpunk, summarized this topic perfectly when he called Big Data the “Oil of the next Century”… fitting, isn’t it?
Bringing 3D, Ultra-Resolution, and Virtual Reality into the Global LambaGrid ...Larry Smarr
07.11.07
Keynote
ACM Virtual Reality Software and Technology (VRST)
Title: Bringing 3D, Ultra-Resolution, and Virtual Reality into the Global LambaGrid Collaboratory
Newport, CA
This presentation is part of a lecture on networking and networking etiquette.
To request this lecture for your group send an emaill to info@evolutioncbl.com
Here's medley including a description and a sample slide or two from my 7 most popular astronomy / night sky related presentations. Contact me if you like a live performance of any of these. I'll go anywhere, ride in coach-class, and sleep on a cot if it means I can heighten awareness about night sky preservation and/or climate change! NOTE: the title slides appears several times because it's a navigation slide -- allows you to hyperlink to any show in the medley at any time. Enjoy!
BCC (2012): Federal Panel Identifying Future Government NeedsDuane Blackburn
The federal government held its annual Biometric Consortium Conference 18-20 September 2012. MITRE hosted a workshop during this conference to highlight FFRDC support to the federal biometrics enterprise. One panel in this workshop focused on identifying priorities that the federal government will not be able to address and/or sponsor, and that should be considered for attention by non-federal entities. This paper summarizes the priorities identified during this panel.
Cloud Computing is a growing research topic in recent years. The key concept of Cloud Computing is to provide a resource sharing model based on virtualization, distributed file system, parallel algorithm and web services. But how can we provide a testbed for cloud computing related training courses? In this talk we will share our experience to build cloud computing testbed for virtualization, high throughput computing and bioinformatics applications. It covers lots of open source projects, such as DRBL, Xen, Hadoop and bioinformatics related applications.
In short, Diskless Remote Boot in Linux (DRBL) provides a diskless or systemless environment for client machines. It works on Debian, Ubuntu, Mandriva, Red Hat, Fedora, CentOS and SuSE. DRBL uses distributed hardware resources and makes it possible for clients to fully access local hardware.
Xen is one of open source hypervisor for linux kernel. It had been used in Amazon EC2 production environment to provide cloud service model (1) — "Infrastructure as a Service (IaaS)". In this talk, we will show you how DRBL can help on fast deployment of Xen playground in classroom.
Hadoop is becoming the well-known open source cloud computing technology developed by Apache community. It is very power tool for data mining. It had been used in Yahoo and Facebook production environment to provide cloud service model (2) — "Platform as a Service (PaaS)". It’s easy to setup single hadoop node but difficult to manage a hadoop cluster. In this talk, we will show you how DRBL can help on fast deployment and management.
Most bioinformatics applications are open source, such as R, Bioconductor, BLAST, Clustal, PipMaker, Phylip, etc. But it also require traditional cluster job submission. In this talk we will show you how DRBL can help to build a testbed of bioinformatics research and provide cloud service model (3) — "Software as a Service (SaaS)". In this talk, we will cover how to:
- 1. Use DRBL to deploy Xen virtual cluster (drbl-xen)
- 2. Use DRBL to deploy Hadoop cluster (drbl-hadoop)
- 3. Use DRBL to deploy bioinformatics cluster (drbl-biocluster)
A live demonstration about drbl-hadoop and drbl-biocluster will be done in the talk, too.
This presentation is part of a lecture on networking and networking etiquette.
To request this lecture for your group send an emaill to info@evolutioncbl.com
Here's medley including a description and a sample slide or two from my 7 most popular astronomy / night sky related presentations. Contact me if you like a live performance of any of these. I'll go anywhere, ride in coach-class, and sleep on a cot if it means I can heighten awareness about night sky preservation and/or climate change! NOTE: the title slides appears several times because it's a navigation slide -- allows you to hyperlink to any show in the medley at any time. Enjoy!
BCC (2012): Federal Panel Identifying Future Government NeedsDuane Blackburn
The federal government held its annual Biometric Consortium Conference 18-20 September 2012. MITRE hosted a workshop during this conference to highlight FFRDC support to the federal biometrics enterprise. One panel in this workshop focused on identifying priorities that the federal government will not be able to address and/or sponsor, and that should be considered for attention by non-federal entities. This paper summarizes the priorities identified during this panel.
Cloud Computing is a growing research topic in recent years. The key concept of Cloud Computing is to provide a resource sharing model based on virtualization, distributed file system, parallel algorithm and web services. But how can we provide a testbed for cloud computing related training courses? In this talk we will share our experience to build cloud computing testbed for virtualization, high throughput computing and bioinformatics applications. It covers lots of open source projects, such as DRBL, Xen, Hadoop and bioinformatics related applications.
In short, Diskless Remote Boot in Linux (DRBL) provides a diskless or systemless environment for client machines. It works on Debian, Ubuntu, Mandriva, Red Hat, Fedora, CentOS and SuSE. DRBL uses distributed hardware resources and makes it possible for clients to fully access local hardware.
Xen is one of open source hypervisor for linux kernel. It had been used in Amazon EC2 production environment to provide cloud service model (1) — "Infrastructure as a Service (IaaS)". In this talk, we will show you how DRBL can help on fast deployment of Xen playground in classroom.
Hadoop is becoming the well-known open source cloud computing technology developed by Apache community. It is very power tool for data mining. It had been used in Yahoo and Facebook production environment to provide cloud service model (2) — "Platform as a Service (PaaS)". It’s easy to setup single hadoop node but difficult to manage a hadoop cluster. In this talk, we will show you how DRBL can help on fast deployment and management.
Most bioinformatics applications are open source, such as R, Bioconductor, BLAST, Clustal, PipMaker, Phylip, etc. But it also require traditional cluster job submission. In this talk we will show you how DRBL can help to build a testbed of bioinformatics research and provide cloud service model (3) — "Software as a Service (SaaS)". In this talk, we will cover how to:
- 1. Use DRBL to deploy Xen virtual cluster (drbl-xen)
- 2. Use DRBL to deploy Hadoop cluster (drbl-hadoop)
- 3. Use DRBL to deploy bioinformatics cluster (drbl-biocluster)
A live demonstration about drbl-hadoop and drbl-biocluster will be done in the talk, too.
Dr. Edward (Eddie) Bortnikov (Senior Director of Research) @ Verizon Media:
Ingestion and queries of real-time data in Druid are performed by a core software component named Incremental Index (I^2).
I^2’s scalability is paramount to the speed of the ingested data becoming queryable as well as to the operational efficiency of the Druid cluster.
The current I^2 Implementation is based on the traditional ordered JDK key-value (KV-)map.
We present an experimental I^2 implementation that is based on a novel data structure named OakMap - a scalable thread-safe off-heap KV-map for Big Data applications in Java.
With OakMap, I^2 can ingest data at almost 2x speed while using 30% less RAM.
The project is expected to become GA in 2020.
Towards CloudML, a Model-Based Approach to Provision Resources in the CloudsSébastien Mosser
The Cloud-computing paradigm advocates the use of re- sources available “in the clouds”. In front of the multiplicity of cloud providers, it becomes cumbersome to manually tackle this heterogene- ity. In this paper, we propose to define an abstraction layer used to model resources available in the clouds. This cloud modelling language (CloudML) allows cloud users to focus on their needs, i.e., the modelling the resources they expect to retrieve in the clouds. An automated provi- sioning engine is then used to automatically analyse these requirements and actually provision resources in clouds. The approach is implemented, and was experimented on prototypical examples to provision resources in major public clouds (e.g., Amazon EC2 and Rackspace).
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...i_scienceEU
Network of Excellence Internet Science Summer School. The theme of the summer school is "Internet Privacy and Identity, Trust and Reputation Mechanisms".
More information: http://www.internet-science.eu/
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
1. "How I Learned to Stop Worrying and Love the Bomb"
WIVE 2010
2. An unhinged computer scientist, known
as TimBL, has invented the WWW,
plunging the world into an information
vortex…
Now everybody is fighting to prevent
the knowledge apocalypse...
Recently, Dr. StrangeCloud, a former
mainframe virtualization specialist, has
been called to the rescue…
(story inspired by Kubrick's Dr. Strangelove)
3. Is Dr. Strangecloud going to save the
planet from the ever increasing danger of
death by information overload ?
4.
5. Web Intelligence
within/for VE
Virtual Organiza
resource sharing
Cloud based WI for VE
6. Looking for YACCP ?
Yet Another
Cloud Computing
Presentation ?
You'd better check the specialists instead…
8. as a research &
application field
(COMPSAC 2000, Taiwan)
9. Crossing hot topics
Artificial Intelligence
web
mining
web WEB
semantic
INTELLIGENCE
information retrieval web
cloud
our
computing focu
s
toda
y
Information Technology
10. Hot, mild or cold ?
(based on Wikipedia article popularity)
cumulated Wikipedia page views, Jan => June 2010
(source : access statistics from wikipedia's squid cluster as compiled by http://stats.grok.se/)
Lady Gaga
11 344 529
6 month trends for Wikipedia pages Web
Intelligence, Cloud Computing and Lady Gaga
10 000 000
1 000 000
100 000
10 000
1 000
100
jan feb mar apr may jun
Cloud Computing
1 911 127
Web Intelligence
1 632
11. But hot local
Web Intelligence
recipe
www.web-intelligence-rhone-alpes.org
12.
13. cloud based domain knowledge
repository enrichment
(use case in FP7 project proposal)
millions
crawls/month
web
crawler
triple 90M
triples
publication in LOD cloud
store initially
in put r
ual s/yea
semantic
n
ma triple
2.5
M extractor
14. UKWA
by The British Library
crawl, annotate , preserve
visual analysis & navigation
powered by
IBM BigSheets
on British Library private cloud
(demo on www.webarchive.org.uk/analytics/analytics.htm)
(details on news.cnet.com/8301-13846_3-10459507-62.html)
15. Public Terabyte Dataset
by Bixo Labs
50-200M pages from the 1M top US domains
SimpleDB
Elactic MapReduce S3
powered by
Hadoop Bixo
on AWS cloud
Tika
Avro Cascading
not yet available (09/2010)
big corpus ready for AWS based analysis
(WI research, evaluation, ...)
16.
17. the Web Intelligence paradox
All the Web data is at hand, ready for WI research and applications
2 simple steps :
pick up process it with all those
the data marvelous ML algorithms...
Wait a minute, it's not that simple ! What about :
politeness ?
scale ? heterogeneity?
(aka "crappiness")
copyright ?
18. Use the Semantic Web?
Looking for semantic annotations in 82k web pages
(Squido production systems, 01/2010)
less than 3%
19. kindof real world WI process
millions pages dedicated bandwith
crawl lot's of memory
lot's of i/o's
clean
(ML,...)
lot's of threads
process
lot's of CPU
(ML, ...)
21. Load may scale up or
down considerably with crawl size
when
testing/calibrating
consider
Cloud Computing
in production
if no crawl limits
22. 1 .45 automatic.
2 boxes of ammunition.
4 days' concentrated emergency rations.
1 drug issue containing antibiotics, morphine, vitamin pills, pep pills, sleeping pills, tranquilizer
pills.
1 miniature combination Russian phrase book and Bible.
100 dollars in rubles.
100 dollars in gold.
9 packs of chewing gum.
1 issue of prophylactics.
3 lipsticks.
3 pairs of nylon stockings.
23. Build from other's
Top 10 Lessons Learned from Deploying Hadoop in a Private Cloud
(Rod Cope, OpenLogic's CTO, CloudSlam'10)
24.
25. "Cloud computing is a trap"
warns GNU founder Richard Stallman
"It's stupidity.
It's worse than
stupidity: it's a
marketing hype
campaign."
(www.guardian.co.uk/technology/2008/sep/29/cloud.computing.richard.stallman)
=> we can still consider private cloud+OSS
26. web-scale
distributed crawl OSS
not mature
(Heritrix Cluster Controller build server exception)
Cloud OSS on the rise
(www.blackducksoftware.com/oss/projects/#cloud)
OSS stack for DC/DML
under active
development
28. Crawling
is the launch pad
in Web Intelligence
Don't take it easy !
Get yourself
a decent crawler
29. Crawling by millions
is not trivial...
many large objects www crappiness
in memory : means
transient ? endless ugly special
persistent ? cases
customizable revisit politeness is
policy ? challenging
30. DDOS is at the corner
with (poor) cloud based crawling
31. Infrastructure is not always key to perfs
Organic effect
of politeness fetch rate
on performance drops
over time
(ken-blog.krugler.org)
1,264,539 URLs from
41,978 unique domains
10 slaves cluster
4000 active fetch threads max
opportunity
brute force
to scale down !
32. a. Cloud Computing is worth considering for WI
b. Have a cloud survival kit
c. Consider private cloud & OSS
d. Compare prices
e. Get yourself a decent crawler
f. Don't turn into DDOS
g. Infrastructure is not always key to perfs
33. "SaaS intelligence on web data, for professionnals"
collect
share filter
monitor analyse www.squido.fr
35. Photos: Websites:
1. National Nuclear Security Administration/Nevada Site Office
wikipedia.org
2. Dr. Strangelove/Original film poster by Tomi Ungerer
3. Dr. Strangelove/movie still www.emse.fr/wive/
4. Dr. Strangelove/movie still csrc.nist.gov
6. cloudslam10.com/Gartner keynote slide, cloudslam10.com
National Institute of Standards and Technology web site screenshot www.web-intelligence-rhone-alpes.org
7. cia.gov/OHB lobby seal picture
stats.grok.se
8. amazon.com/Computational Web Intelligence book cover
10. Wikimedia Commons/Lady Gaga by petercruise www.ibm.com/software/ebusiness/jstart/bigsheets
12. Wikimedia Commons/Operation Crossroads Baker in color.jpg bixolabs.com/datasets/public-terabyte-dataset-project
13. Linking Open Data cloud diagram, by Richard Cyganiak and Anja www.openlogic.com
Jentzsch. http://lod-cloud.net/
www.blackducksoftware.com
14. flickr/British Library III/jovike,
ibm.com/The_British_Library_and_IBM_Bi.jpg crawler.archive.org
16. Dr. Strangelove/movie still www.apache.org
21. Wikimedia Commons/Castle Bravo Blast.jpg twitter.com
22. Dr. Strangelove/movie still
ken-blog.krugler.org
23. cloudslam10.com/OpenLogic slide
24. Dr. Strangelove/movie still
25. Wikimedia Commons/RMS iGNUcius techfest iitb.JPG
27. cloudslam10.com/OpenLogic slide
28. Wikimedia Commons/Peacekeeper_missile_after_silo_launch.jpg
31. kkrugler.files.wordpress.com/2009/05/fetch-performance2.png
32. Dr. Strangelove/movie still