The document summarizes how OpenCorporates built the largest open database of companies in the world. They started with just three countries and 3 million companies, importing public data from official registers. Their goal is to have an entry and URI for every corporate entity globally. The database is open, free to use, and indexed basic company information as well as directors and other linked data for advanced searching and matching capabilities. It is built using open-source tools and aims to contribute back to the open data community while exploring commercial services.
Workshop "Open Data 4 Start-up", organizzato dall'Associazione Luoghi di Relazione in collaborazione con TOP-IX all'interno del Digital Experience Festival - 30 maggio 2012 - Intervento di Massimo Zaglio (Open Data Ninja, Consorzio TOP-IX) e di Saverino Reale (Open Data Specialist, CSI Piemonte)
M12S07 - Retention & ESI - Paths to Success - Part TwoMER Conference
From MER Conference 2012
Speakers: Christine Burns and Carol Stainbrook
This session explains "why" your organization's technology selections impact "how" the updated retention schedules described in part one of this two-part session can be applied to electronically stored information (ESI). Learn reasonable and actionable approaches for embedding retention policies into e-mail, file shares and enterprise applications.
This session will address:
- Why "perfection" is often impractical, when it comes to applying retention policy to ESI and some reasonable alternatives to perfection.
- How the technologies for email, file shares, and other ESI affect the implementation of retention policies.
- When it may be necessary to choose different retention strategies for different technologies such e-mail, file shares and enterprise applications.
- Considerations for applying retention policy to data in enterprise applications.
- Criteria to help prioritize where to begin when applying retention policy.
In this session you will learn how to tailor your organization's approach to retention schedules so they are reasonable, actionable and result in the orderly destruction of eligible information, given your organization's technology selections.
A brief talk about the recent endeavour to introduce and encourage code sharing among participants and the community of the MediaEval Benchmark Initiative.
Looking at INSPIRE from an Open Source obsessed SMEsmespire
Presentation at the INSPIRE Workshop "Concrete steps to implement INSPIRE: synergies between the public and the private sector" - Florence, 24th June 2013
Learn from the Experts: The Do's and Don'ts of Data CollectionIQPC Exchange
George Rudoy, Founder and CEO of Integrated Legal Technology LLC, joins Legal IQ to reflect on some of the discussions at the Information Retention and e-Disclosure
Management Conference 2011 on data privacy, retention and preservation, and destruction in
various jurisdictions.
A complete introduction to open data in the context of local transportation, including definitions, examples, rationales, implementation challenges and guidelines.
Presentation given at Open Knowledge Festival, Helsinki, Sept 2012. Focuses on benefits to business to publishing open data, and examines business model of OpenCorporates, the largest open database of companies in the world
Talk at IEEE Big Data/Cloud conference in Santa Clara, June 28th, 2013.Jari Koister
Sharing why it is hard to succeed with Big Data/Predictive projects in terms of productionalizing them what you can do to reduce risk while take is steps in the right direction.
EDF2014: Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany: ...European Data Forum
Industry Keynote Talk by Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany at the European Data Forum 2014, 20 March 2014 in Athens, Greece: Probe Data Analytics and Processing for Traffic Information, Traffic Planning and Traffic Management.
Workshop "Open Data 4 Start-up", organizzato dall'Associazione Luoghi di Relazione in collaborazione con TOP-IX all'interno del Digital Experience Festival - 30 maggio 2012 - Intervento di Massimo Zaglio (Open Data Ninja, Consorzio TOP-IX) e di Saverino Reale (Open Data Specialist, CSI Piemonte)
M12S07 - Retention & ESI - Paths to Success - Part TwoMER Conference
From MER Conference 2012
Speakers: Christine Burns and Carol Stainbrook
This session explains "why" your organization's technology selections impact "how" the updated retention schedules described in part one of this two-part session can be applied to electronically stored information (ESI). Learn reasonable and actionable approaches for embedding retention policies into e-mail, file shares and enterprise applications.
This session will address:
- Why "perfection" is often impractical, when it comes to applying retention policy to ESI and some reasonable alternatives to perfection.
- How the technologies for email, file shares, and other ESI affect the implementation of retention policies.
- When it may be necessary to choose different retention strategies for different technologies such e-mail, file shares and enterprise applications.
- Considerations for applying retention policy to data in enterprise applications.
- Criteria to help prioritize where to begin when applying retention policy.
In this session you will learn how to tailor your organization's approach to retention schedules so they are reasonable, actionable and result in the orderly destruction of eligible information, given your organization's technology selections.
A brief talk about the recent endeavour to introduce and encourage code sharing among participants and the community of the MediaEval Benchmark Initiative.
Looking at INSPIRE from an Open Source obsessed SMEsmespire
Presentation at the INSPIRE Workshop "Concrete steps to implement INSPIRE: synergies between the public and the private sector" - Florence, 24th June 2013
Learn from the Experts: The Do's and Don'ts of Data CollectionIQPC Exchange
George Rudoy, Founder and CEO of Integrated Legal Technology LLC, joins Legal IQ to reflect on some of the discussions at the Information Retention and e-Disclosure
Management Conference 2011 on data privacy, retention and preservation, and destruction in
various jurisdictions.
A complete introduction to open data in the context of local transportation, including definitions, examples, rationales, implementation challenges and guidelines.
Presentation given at Open Knowledge Festival, Helsinki, Sept 2012. Focuses on benefits to business to publishing open data, and examines business model of OpenCorporates, the largest open database of companies in the world
Talk at IEEE Big Data/Cloud conference in Santa Clara, June 28th, 2013.Jari Koister
Sharing why it is hard to succeed with Big Data/Predictive projects in terms of productionalizing them what you can do to reduce risk while take is steps in the right direction.
EDF2014: Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany: ...European Data Forum
Industry Keynote Talk by Ralf-Peter Schaefer, Head of Traffic Product Unit, TomTom, Germany at the European Data Forum 2014, 20 March 2014 in Athens, Greece: Probe Data Analytics and Processing for Traffic Information, Traffic Planning and Traffic Management.
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...European Data Forum
BIG - NESSI Networking Session, Talk by Edward Curry, National University of Ireland Galway at the European Data Forum 2014, 20 March 2014 in Athens, Greece: The Big Data Value Chain.
EDF2014: BIG - NESSI Networking Session: Nuria de Lama, Representative to the...European Data Forum
BIG - NESSI Networking Session, Talk by Nuria de Lama, Representative to the European Commission, Research & Innovation ATOS, Spain at the European Data Forum 2014, 20 March 2014 in Athens, Greece: Towards a Big Data Public Private Partnership
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...European Data Forum
Selected Talk by Kush Wadhwa, Senior Partner, Trilateral Research & Consulting at the European Data Forum 2014, 20 March 2014 in Athens, Greece: Addressing risks and opportunities engendered by big data: The BYTE project
EDF2014: Adrian Cristal, Barcelona Supercomputing Center, RETHINK big Project...European Data Forum
Selected Talk by Adrian Cristal, Barcelona Supercomputing Center, Spain at the European Data Forum 2014, 20 March 2014 in Athens, Greece: The RETHINK big Project NEEDS YOU.
EDF2014: Dimitris Vassiliadis, Head of Unit, EXUS Innovation Attractor: From ...European Data Forum
Selected Talk by Dimitris Vassiliadis, Head of Unit, EXUS Innovation Attractor at the European Data Forum 2014, 20 March 2014 in Athens, Greece: From Carbon to Diamonds: Business cases of data value.
EDF2014: Rüdiger Eichin, Research Manager at SAP AG, Germany: Deriving Value ...European Data Forum
Selected Talk by Rüdiger Eichin, Research Manager at SAP AG, Germany at the European Data Forum 2014, 20 March 2014 in Athens, Greece: Deriving Value from Big Data for Enterprise Performance Management.
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...European Data Forum
Invited Talk by Paul Groth, Department of Computer Science & The Network Institute, VU University Amsterdam, Netherlands at the European Data Forum 2014, 20 March 2014 in Athens, Greece: Open PHACTS: A Data Platform for Drug Discovery.
EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...European Data Forum
Invited Talk by Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wolters Kluwer Germany at the European Data Forum 2014, 20 March 2014 in Athens, Greece: Linked Data and Open Government Data as part of the business strategy of Wolters Kluwer Germany.
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...European Data Forum
PPP on Data & Executive Panel on Big Data, Introduction by Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate General for Communications Networks, Content and Technology at the European Data Forum 2014, 20 March 2014 in Athens, Greece: Towards a Data Value Chain Partership in Europe.
EDF2014: Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the b...European Data Forum
Opening Keynote by Stefan Wrobel, Institute Director, Fraunhofer IAIS / Member of the board of BITKOM working group Big Data at the European Data Forum 2014, 19 March 2014 in Athens, Greece: Value of Big Data - From Data-Driven Enterprises to a Data-driven Economy
EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...European Data Forum
Selected Talk by Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, Italy at the European Data Forum 2014, 19 March 2014 in Athens, Greece: Toward Personal Big Data passing through user Transparency, Control and Awareness: a Living-Lab Experience
EDF2014: Allan Hanbury, Senior Researcher, Vienna University of Technology, A...European Data Forum
Selected Talk by Allan Hanbury, Senior Researcher, Vienna University of Technology, Austria at the European Data Forum 2014, 19 March 2014 in Athens, Greece: Conquering Data in Austria: a technology roadmap
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...European Data Forum
Selected Talk by Nikolaos Loutas, Manager at PwC Belgium at the European Data Forum 2014, 19 March 2014 in Athens, Greece: Business Models for Linked Government Data: What lies beneath?
EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...European Data Forum
Selected Talk by Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center, Austria at the European Data Forum 2014, 19 March 2014 in Athens, Greece: CODE - Linked Data in Context: Questions Matter
EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...European Data Forum
Selected Talk of Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain at the European Data Forum 2014, 19 March 2014 in Athens, Greece: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data
EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...European Data Forum
Invited Talk of Piek Vossen, Professor Computational Lexicology, VU University Amsterdam, Netherlands at the European Data Forum 2014, 19 March 2014 in Athens, Greece: NewsReader: recording history by processing massive streams of daily news
EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...European Data Forum
Selected Talk of Taru Rastas, Senior Advisor, Ministry of Communications of Finland at the European Data Forum 2014, 19 March 2014 in Athens, Greece: Open data for transport and communications
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
GridMate - End to end testing is a critical piece to ensure quality and avoid...ThomasParaiso2
End to end testing is a critical piece to ensure quality and avoid regressions. In this session, we share our journey building an E2E testing pipeline for GridMate components (LWC and Aura) using Cypress, JSForce, FakerJS…
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfPeter Spielvogel
Building better applications for business users with SAP Fiori.
• What is SAP Fiori and why it matters to you
• How a better user experience drives measurable business benefits
• How to get started with SAP Fiori today
• How SAP Fiori elements accelerates application development
• How SAP Build Code includes SAP Fiori tools and other generative artificial intelligence capabilities
• How SAP Fiori paves the way for using AI in SAP apps
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
EDF2012 Chris Taggart - How the biggest Open Database of Companies was built
1. How we built the largest
open database of
companies in the world
Thursday, 7 June 2012
2. A simple (huge) goal: an entry (and URI) for
every corporate legal entity in the world
URI is based on the company register
ID, meaning it’s open and IP-free
Also i
trade mpor
marks ting p
officia , gove ublic data
l regis rnme
ters & nt spe –
gazet nding
te not ,
ices..
.
Thursday, 7 June 2012
3. All Op
enly L
free re icens
use, e ed, al
ven c lowin
omm g
ercial
ly
Thursday, 7 June 2012
5. 1. An open identifying system
URIs can be used as common identifiers among a
variety of organisations
Can be used without reference to OpenCorporates
Because they map to the id issued by the company
register the corresponding entry in the registry (and
associated info) can be found, and vice versa
Fits the new EU Business Vocabulary
Can even by used for companies in jurisdiction we
haven’t yet imported
Thursday, 7 June 2012
6. 2. The simple search
Not to be underestimated
Massively reduces friction
(how long will it take you
to find and search
multiple jurisdictions)
Allows what if questions
Potentially generates
stories in its own right
Thursday, 7 June 2012
7. 3. Source for additional info
Addresses, filings,
status, websites...
Intl trademarks, UK
govt spending, official
notices, health & safety
violations...
Other IDs: SEC, CAGE,
etc – allows reverse
mapping queries, e.g.
show me legal entitity
mapped to a CIK code
Thursday, 7 June 2012
8. 4. Reconciliation
(matching names to legal entities)
Clean up messy
company names
(& prev names)
to legal entity,
and from there
to other data
Google Refine
reconciliation
service (specific
to jurisdiction)
Thursday, 7 June 2012
9. 5. The platform
API: allows all
information to be
retrieved as data,
even searches
Users can now
add data too
Coming soon: the
option to match
data to
companies
Thursday, 7 June 2012
10. New feature: directors/officers
We’ve just
started
importing &
indexing
company
directors &
officers,
allowing search
by name, &
other resources
finding links
between them
and other similarly named
companies
Thursday, 7 June 2012
11. How have we done it?
1. Started small,
with just three
countries and
3 million
companies
2. Increasingly
using official
sources, where
this is possible (i.e.
the company
registers are open
and make data
available)
Thursday, 7 June 2012
12. How have we done it?
3. Leveraged the
open data
community and
ScraperWiki to
scrape company
registers around
the world
4. Worked with
governments to
help understand
the problems – EU,
World Bank, G20
Financial Stability
Board, etc
Thursday, 7 June 2012
13. The technology
Vanilla, commodity open-source software, hosted on our
own UK-based servers
Database MySQL
(but considering PostgreSQL)
Search Solr
(but considering ElasticSearch)
Code Ruby
(RubyOnRails main app, Sinatra API,
vanilla Ruby for various internal libraries)
Webserver Nginx (webserver) + Memcached
(caching) + Redis (queue + persistence)
Thursday, 7 June 2012
14. How do we pay for all this?
Unlike many open data projects, we’re a for-profit
company – the open data movement needs successful
companies if it’s going to have a diverse ecosystem
But we’re a company whose business model is
dependent on making more data open, and an
advisory board to make sure we do the right thing
Not yet looking for customers, but...
Thursday, 7 June 2012
15. How do we pay for all this?
Two projected sources of income
Services model, especially around cleansing data/
reconciliation. Of course, you can use our API,
reconciliation service without asking us, but it may be
cheaper to pay us to do it. Ditto custom extracts, and
verticals
Dual-licence model – contribute back to the community
either with data, or financial support, e.g. if you have a
proprietary database you may not want to be bound by
the share-alike attribution restrictions
And we already have some (small) customers
Thursday, 7 June 2012
16. The problems
Getting the data Company registers have forgotten their
main role is as public record, and actively work to prohibit
free and open access to the data
Thursday, 7 June 2012
17. The problems
Understanding the data Language, legal and cultural
issues, not to mention the complexity of the subject
Thursday, 7 June 2012
18. The problems
Normalising the data How do we abstract company
types, status, industry codes, addresses, etc
Thursday, 7 June 2012
19. W3C Business Vocabulary
What are
we doing?
Why are we
doing it?
What does
it mean?
Where is it
going?
Thursday, 7 June 2012
20. The problems
Handling the data Over 150 million rows in some tables
(slow schema changes), heavy reading and writing,
evolving understanding of the problems and solutions
Thursday, 7 June 2012
21. tions
isdic tes
0 jur
nies in 5 23 US sta
compa clud ing
3million In
wo v er 4
No
Thursday, 7 June 2012