IBM Big Data References


Published on

The first installment of IBM Big Data

Published in: Technology
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

IBM Big Data References

  1. 1. IBM Big DataSuccess Stories
  2. 2. A note from Rob Thomas Big data. By now you have heard the term and it’s easy to grasp what it means as the world continues to create 2.5 quintillion bytes daily. Or, maybe not; can you fathom one quintillion bytes? I can’t. But I can relate to Vestas Wind Systems, a leader in the development of wind energy that uses their IBM big data solution and one of the world’s largest supercomputers to analyze weather information and provide location site data in minutes instead of weeks, even while its wind library is increasing from 2.8 petabytes to as much as 24 petabytes of data - the equivalent of 1420 times the books in America’s Library of Congress. In your business, you have your own big data challenges. You have to turn mountains of data about your customers, products, incidents, etc., into actionable information. While the volume, variety and velocity of big data seem overwhelming, big data technology solutions hold great promise. The way I see it, we are on the mountain top with a vista of opportunity ahead. We have the capacity to understand; to see patterns unfolding in real time across multiple complex systems; to model possible outcomes; and to take actions that produce greater economic growth and societal progress. IBM is marshaling its resources to bring smarter computing to big data. With the IBM big data platform, we are enabling our clients to manage data in ways that were never thought possible before. In this collection of Big Data Success Stories, we share a sample of our customers’ successes including: • [x+1], an end-to-end digital marketing platform provider for advertisers and agencies, is helping their clients realize a 20% growth in digital sales by analyzing massive volumes of advertising data in real-time using IBM Netezza • KTH Royal Institute of Technology in Stockholm, which uses streaming data in their congestion management system, is already reducing traffic in the Swedish capital by 20 percent, lowering average travel times by almost 50 percent and decreasing the amount of emissions by 10 percent • Researchers at the University of Ontario-Institute of Technology who are using streaming analytics to help neonatal care hospitals predict the onset of potentially fatal infections in premature babies We are humbled at “miracles” our clients are achieving and are very proud of the role we are playing in making cities, commerce, healthcare and a full spectrum of additional industries smarter. I hope you will enjoy reading these Big Data Success Stories and consider IBM when you take on big data challenges in your enterprise. Sincerely, Rob Thomas Vice President, Business Development IBM
  3. 3. Contents Bringing smarter computing to big data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 IBM Unveils Breakthrough Software and New Services to Exploit Big Data . . . . . . . . . . . . . . . . . . 2 Customer Success Stories Beacon Institute . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Faces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 Hertz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 KTH – Royal Institute of Technology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 Marine Institute Ireland . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 Technovated . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 TerraEchos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 University of Ontario Institute of Technology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 Uppsala University . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 Vestas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Watson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 [x+1] . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47 IBM Business Partner Ecosystem. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 Featured Business Partners Datameer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 Digital Reasoning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 Jaspersoft . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 Karmasphere. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67 MEPS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
  4. 4. Smarter computing builds a Smarter PlanetBringing smarter computing to big data.To build a smarter planet, we need smarter computing— CFO Study by the IBM Institute for Business Value showedcomputing that is tuned to the task, managed through the that companies that excel at fi nance effi ciency and havecloud and, importantly, designed for big data. more mature business analytics and optimization outperform their peers, with 49% higher revenue growth, 20 times moreHow big? We’re now creating 2.5 quintillion bytes daily— profit growth, and 30% higher return on invested much that 90% of the data in the world today has beencreated in the last two years alone. With continuously analyzed data, organizations can be what they want to be, at all times. Consider the Memphis PoliceThis data is also big in another way—in its promise. We now Department, which compiles volumes of crime records fromhave the capacity to understand, with greater precision than a variety of sources and systems, and has reduced seriousever before, how our world actually works—to see patterns crime by more than 30%. Fresh food grower Sun Worldunfolding in real time across multiple complex systems; to International is leveraging insights from their data to cutmodel possible outcomes; and to take actions that produce natural resource use by 20%. Research at the University greater economic growth and societal progress. of Ontario Institute of Technology is developing streamingWe can do more than manage information—we can manage analytics to help neonatal care hospitals. By analyzing 43vast information supply chains. They’re made up of not million streaming data points per patient, per day, they canonly the ones and zeros of structured data that traditional improve patient outcomes by using all of the data available.computers love, but streams of unstructured text, images, This list could go on. And at the leading edge of smartersounds, sensor-generated impulses and more. computing, IBM’s Watson—the computer that bested the twoWe can parse the real languages of commerce, processes all-time champions on the television quiz show Jeopardy!—and natural systems—as well as conversations from the demonstrates the power of analytics to provide meaningfulgrowing universe of tweets, blogs and social media. We insights from an ever-increasing volume and variety of data,can also draw on advanced technologies such as stream enabling correct answers and winning actions, in real time.computing, which fi lters gigabytes of data per second, As our world gets smaller, our data keeps getting bigger—analyzes these while still in motion and decides on the which is good news. Information that was once merelyappropriate action for the data, such as a real-time alert or overload now lets us see our planet in entirely new waysstoring an insight in a data warehouse for later analysis. and intervene to make it work better. Because computingBut we can only do all of this if our computing systems are systems designed for big data are systems designed for goodsmart enough to keep up. According to the IBM Business decision making. Which is, after all, what being smarter isAnalytics and Optimization for the Intelligent Enterprise study, all in three business leaders frequently make decisions Let’s build a smarter planet. Join us and see what others arewithout the information they need. Half don’t have access doing at the information they need to do their jobs. And that hassignificant competitive implications. The 2010 IBM Global 1
  5. 5. IBM Unveils Breakthrough Software and New Services to Exploit Big DataCommits $100 Million to Massive Scale Analytics ResearchARMONK, N.Y., - 20 May 2011: As companies seek to “The volume and velocity of information is generated atgain real-time insight from diverse types of data, IBM a record pace. This is magnified by new forms of data(NYSE: IBM) today unveiled new software and services coming from social networking and the explosion of mobileto help clients more effectively gain competitive insight, devices,” said Steve Mills, Senior Vice President and Groupoptimize infrastructure and better manage resources to Executive, IBM Software & Systems. “Through our extensiveaddress Internet-scale data. For the first time, organizations capabilities in business and technology expertise, IBM iscan integrate and analyze tens-of-petabytes of data in its best positioned to help clients not only extract meaningfulnative format and gain critical intelligence in sub-second insight, but enable them respond at the same rate at whichresponse times. the data arrives.”IBM also announced a $100 million investment for New Services Address Analytics for ITcontinued research on technologies and services that will Infrastructureenable clients to manage and exploit data as it continuesto grow in diversity, speed and volume. The initiative will Leveraging years of intellectual capital in managing datafocus on research to drive the future of massive scale centers and IT departments, as well as over 30 patentedanalytics, through advancing software, systems and technologies from IBM Research, the new IT services services capabilities. feature dozens of analytical tools to help IT professionals use server, storage and networking technologies moreThe news comes on the heels of the 2011 IBM Global efficiently, improving security and insight into planning majorCIO Study where 83 percent of 3,000 CIOs surveyed said IT investments. Examples of services that help clients withapplying analytics and business intelligence to their IT analytics include:operations is the most important element of their strategicgrowth plans over the next three to five years. • Cloud Workload Analysis -- The new analysis tool maps your IT workload characteristics and current capabilitiesToday’s news further enables Smarter Computing innovations to prioritize cloud deployment and migrations plans. Thisrealized by designing systems that incorporate Big Data for allows IT managers to identify cloud opportunities 90better decision making, and optimized systems tuned to the percent faster to reduce costs.task and managed in a cloud. • Server and Storage -- New server optimization andAccording to recent IT industry analyst reports, enterprise analysis tools achieve up to 50 percent reduceddata growth over the next five years is estimated to increase transformation costs and up to 80 percent fasterby more than 650 percent. Eighty percent of this data is implementation time. New storage services help createexpected to be unstructured. self-service to provision explosive growth while reducing architects time by 50 percent.The new analytics capabilities pioneered by IBM Research will enable chief information officers (CIOs) to construct • Data Center Lifecycle Cost Analysis Tool -- Identifies howspecific, fact-based financial and business models for their IT to reduce total data center costs by up to 30 percent byoperations. Traditionally, CIOs have had to make decisions assessing total cost plus including environmental impactabout their IT operations without the benefit of tools that can over a 10 to 20 year interpret and model data. • Security Analytic services -- Analytic systems identifyWith today’s news, IBM is expanding its portfolio and known events and automatically handle them; Thisfurthering its investments in analytics with: results in handling of more than 99 percent of critical events without human intervention.• New, patented software capabilities to analyze massive volumes of streaming data with sub-millisecond response IBM Big Data Software Taps into Hadoop times and Hadoop-based analytics software to offer scalable storage to handle tens-of-petabytes level data. IBM is making available new InfoSphere BigInsights and These capabilities complement and leverage existing IT Streams software that allows clients to gain fast insight into infrastructure to support a variety of both structured and information flowing in and around their businesses. The unstructured data types. software, which incorporates more than 50 patents, analyzes traditional structured data found in databases along with• 20 new services offerings, featuring patented analytical unstructured data -- such as text, video, audio, images, social tools for business and IT professionals to infuse media, click streams -- allowing decision makers to act on it predictive analytics throughout their IT operations. The at unprecedented speeds. services enable IT organizations to assess, design and configure their operations to address and take advantage of petabytes of data. 2
  6. 6. IBM Unveils Breakthrough Software and New Services to Exploit Big DataBigInsights software is the result of a four-year effort of University of Ontario Institute of Technologymore than 200 IBM Research scientists and is powered by Expands Neo-Natal Research to Chinathe open source technology, Apache Hadoop. The software provides a framework for large scale parallel processing Dr. Carolyn McGregor, Research Chair in Health Informatics and scalable storage for terabyte to petabytes-level data. It at the University of Ontario Institute of Technology has been incorporates Watson-like technologies, including unstructured exploring new approaches for the last 12 years to providetext analytics and indexing that allows users to analyze specialists in neonatal intensive care units better ways to spotrapidly changing data formats and types on the fly. potentially fatal infections in premature babies.Additional new features include data governance and Changes in streams of real-time data such as respiration,security, developer tools, and enterprise integration to make heart rate and blood pressure are closely monitored in herit easier for clients to build a new class of Big Data analytics work and now she is expanding her research to China.applications. IBM also offers a free downloadable BigInsights “Building upon our work in Canada and Australia, we willBasic Edition for clients to help them explore Big Data apply our research to premature babies at hospitals inintegration capabilities. China. With this new additional data, we can compare the differences and similarities of diverse populations ofAlso born at IBM Research, InfoSphere Streams software premature babies across continents,” said Dr. McGregor.analyzes data coming into an organization and monitors it for “In comparing populations, we can set the rules to optimizeany changes that may signify a new pattern or trend in real the system to alert us when symptoms occur in real time,time. This capability helps organizations to capture insights which is why having the streaming capability that the IBMand make decisions with more precision, providing an platform offers is critical. The types of complexities that we’reopportunity to respond to events as they happen. looking for in patient populations would not be accessible with traditional relational database or analytical approaches.”New advancements to Streams software makes it possibleto analyze Big Data such as Tweets, blog posts, video IBM’s Big Data software and services reinforces IBM’sframes, EKGs, GPS, and sensor and stock market data up analytics initiatives to deliver Watson-like technologiesto 350 percent faster than before. BigInsights complements that help clients address industry specific issues. On theStreams by applying analytics to the organization’s historical heels of The IBM Jeopardy! Challenge, in which the IBMdata as well as data flowing through Streams. This is an Watson system demonstrated a breakthrough capability toongoing analytics cycle that becomes increasingly powerful understand natural language, advanced analytical capabilitiesas more data and real-time analytic results are available to be can now be applied on real client challenges ranging frommodeled for improvement. identifying fraud in tax or healthcare systems, to predicting consumer buying behaviors for retail clients.As a long time proponent of open source technology, IBMhas chosen the Hadoop project as the cornerstone of its Big Over the past five years, IBM has invested more than $14Data Strategy. With a continued focus on building advanced billion in 24 analytics acquisitions. Today, more than 8,000analytics solutions for the enterprise, IBM is building upon IBM business consultants are dedicated to analytics and overthe power of these open source technologies while adding 200 mathematicians are developing breakthrough algorithmsimproved management and security functions, and reliability inside IBM Research. IBM holds more than 22,000 active that businesses demand. Hadoop’s ability to process a broad U.S. patents related to data and information management.set of information across multiple computing platforms,combined with IBM’s analytics capabilities, now makes To hear how IBM clients are using analytics to transformit possible for clients to tackle today’s growing Big Data their business visit: IBM’s portfolio of Hadoop-based offerings also ibmbusinessanalytics.include IBM Cognos Consumer Insight which integrates For more information on IBM Big Data initiatives, visit:social media content with traditional business analytics, IBM Coremetrics Explore which segments consumerbuying patterns and drills down into mobile data. Additionally, For more information on IBM’s full set of new analyticsHadoop is the software framework the IBM Watson computing services, visit: uses for distributing the workload for processinginformation, which supports the systems breakthrough abilityto understand natural language and provide specific answersto questions at rapid speeds. 3
  7. 7. Customer Success Stories Beacon Institute Faces Hertz KTH – Royal Institute of Technology Marine Institute Ireland Technovated TerraEchos University of Ontario Institute of Technology Uppsala University Vestas Watson [x+1] 4
  8. 8. Big Data ProfilesIBM Software Group Beacon Institute, Clarkson University and IBM Managing the environmental impact on rivers by streaming information Most of the world’s population lives near a river or estuary. Yet, there is typically no way to gain a clear understanding of what is happening below Overview the surface of the water to help predict and manage changes in the river The need that could impact local communities that rely on the waterway. Scientists need new technology to study complex environmental interactions to better understand how communities and The River and Estuary Observatory Network (REON) project is a joint ecosystems interact. effort between Beacon Institute for Rivers and Estuaries, Clarkson The solution University and IBM® Research. REON is the first technology-based, IBM InfoSphere Streams software and real-time monitoring network for rivers and estuaries of its kind, and allows high-performance computing system for continuous monitoring of physical, chemical and biological data from collect and analyze data in real time as it points in New York’s Hudson, Mohawk and St. Lawrence Rivers by means streams in from environmental data sources to support predictive analysis of an integrated network of sensors, robotics, mobile monitoring and and decision making. computational technology deployed in the rivers. The benefit Streaming real-time data technology “Imagine predicting environmental impacts the way we forecast and report helps resource management programs the weather,” says John Cronin, Founding Director of Beacon Institute and respond more effectively to chemical, Beacon Institute Fellow at Clarkson University. “With that technological physical and biological alterations to local water resources. capability we can better understand the effects of global warming, the movements of migrating fish or the transport of pollutants. The implications for decision-making and education are staggering.” 5
  9. 9. Big Data ProfilesIBM Software Group Applying real-time technology to help understand Solution components: the environment REON is a test bed for the IBM System S stream computing system. A Software team of IBM engineers and scientists work on the REON collaboration • IBM® InfoSphere® Streams and have access to IBM’s extensive analytical and computational resources from the IBM Watson Research Lab. The IBM Global Engineering Solutions team executed the fundamental design elements of the data “Imagine predicting streaming pilot. This high-performance architecture rapidly analyzes data environmental impacts as it streams in from many sources. the way we forecast A networked array of sensors in the river provides the data necessary and report the weather. to locally observe spatial variations in such variables as temperature, . . . The implications for pressure, salinity, turbidity, dissolved oxygen and other basic water chemistry parameters. All of these sensors, transmitting information in decision-making and real time, results in massive amounts of data. education are staggering.” Using real-time, multi-parameter modeling systems helps develop a — John Cronin, Founding Director of Beacon better understanding of the dynamic interactions within local riverine and Institute for Rivers and Estuaries and Beacon Institute Fellow at Clarkson University estuarine ecosystems. Making real-world data easily accessible to outside systems, researchers, policymakers and educators helps foster increased collaboration. The ultimate benefit is helping resource management programs respond more effectively to chemical, physical and biological alterations to local water resources. REON—New technology for the smarter water management “The Hudson River is the pilot river system for REON, and the 12 million people who live within its watershed will be the first beneficiaries of our work,” says Cronin. Helping to make sense of all that data is IBM InfoSphere® Streams software, part of IBM’s big data platform. InfoSphere Streams provides capabilities that collect and analyze data from thousands of information sources to help scientists better understand what is happening in the world—as it happens. Eventually, REON data could be applied to visualize the movement of chemical constituents, monitor water quality, and protect fish species as they migrate, as well as provide a better scientific understanding of river and estuary ecosystems. 2 6
  10. 10. Big Data ProfilesIBM Software Group “As water resource management expert Doug Miell has said, you can’t manage what you can’t measure. . . Society and business are facing “The Hudson River is increasingly complex challenges when it comes to understanding and the pilot river system managing water resources on this planet,” says John E. Kelly III, Senior for this groundbreaking Vice President and Director, IBM Research. “Getting smart about water is important to all of us for one simple reason: water is too precious a initiative, and the 12 resource to be wasted.” million people who live within its watershed will Positively Impacting the Environment Worldwide be the first beneficiaries Cronin concludes, “This new way of observing, understanding and predicting how large river and estuary ecosystems work ultimately will of our work.” allow us to translate that knowledge into better policy, management and education for the Hudson River and for rivers and estuaries worldwide.” — John Cronin For more information To learn more about IBM InfoSphere Streams, visit: To learn more about IBM big data, visit: To increase your big data knowledge and skills, visit: To get involved in the conversation, visit: For more information on Beacon Institute for Rivers and Estuaries, visit: 3 7
  11. 11. IBM Software Manufacturing and Computer ServicesInformation Management IBM Applies emerging technologies to deliver instantaneous people searches With an enterprise population of over 600,000 people worldwide, how Overview do IBM® employees find and connect with their colleagues? For over a decade, IBM BluePages has been the primary source. This high-demand, The need intranet application provides information on all IBM employees and With over 600,000 names in BluePages, contractors, including areas of expertise and responsibilities. And with IBM’s employee directory, and over 500,000 queries daily, the average search IBM’s focus on innovation and emerging technologies, positive changes session takes two minutes. IBM needed a are always on the horizon. faster, more efficient application. The solution “BluePages is one of the most used applications at IBM,” says Sara Weber, manager of IBM’s CIO Lab Analytics team. “At one time, Using Apache open source technologies, the IBM CIO Lab Analytics team BluePages was state-of-the-art; however, over the years it was not developed a new people-search updated to keep up with new advances in Internet technology. With over application that allows flexible queries 500,000 BluePages searches done every day, and with BluePages accessing and returns as many results as possible, as fast as possible. Additional capabilities huge volumes of data, an average search session can take up to two include quick browsing and photo minutes. When multiple results are returned they do not show individual images. photo images, and incorrect spelling may yield no results. My team was The benefit tasked with addressing the question: ‘How can we build a better and The new Faces application offers faster people search?’” instantaneous response time, saving on average over a minute for each search The goals for this project, aptly named Faces, were to support flexible session—and thousands of hours daily queries and return as many results as possible, as fast as possible. Results for IBM employees. that more closely matched the query would appear first. Additional capabilities would permit quick browsing and photo images. 8
  12. 12. IBM Software Manufacturing and Computer ServicesInformation Management Applying emerging technologies to deliver “At IBM, when we innovation Weber’s CIO Lab Analytics team identifies problems that IBM employees find an open source are experiencing and finds ways to apply emerging technologies to technology that has develop solutions. “We had to process tremendous amounts of data, and then store it in a way that it could be accessed quickly,” says Weber. “For potential, we experiment this project, we selected Apache Hadoop and Apache Voldemort; both are with it to understand open source technologies. My development team has extensive expertise how to use it to bring in using Hadoop technology. The Faces application was developed by two members of our team over a five month period.” the most business value to IBM. For example, Apache Hadoop allows developers to create distributed applications that run on clusters of computers. Organizations can leverage this IBM InfoSphere infrastructure to handle large data sets, by dividing the data into “chunks” BigInsights is a new and coordinating the data processing in the distributed, clustered class of analytics platform environment. Once the data has been distributed to the cluster, it can be processed in parallel. Apache Voldemort is a distributed key-value storage based on Hadoop and system that offers fast, reliable and persistent storage and retrieval. innovation from IBM. It Specific keys return specific values. If no additional query power is needed, a key value store is faster than a database. can store raw data ‘as-is’ and help clients gain “At IBM, when we find an open source technology that has potential, we rapid insight through experiment with it to understand how to use it to bring the most business value to IBM,” says Weber. “For example, IBM InfoSphere® BigInsights large scale analysis.” is a new class of analytics platform based on Hadoop and innovation from IBM. It can store raw data ‘as-is’ and help clients gain rapid insight —Sara Weber, Manager, IBM’s CIO Lab through large scale analysis.” Analytics team For Faces, Hadoop preprocesses data from the IBM Enterprise Directory and Social Networks and sends this information to the Voldemort Person Store (2.2 GB). Voldemort, in turn, sends data to Hadoop processing for the Person ID fetcher, Reports Loader, Query Expander, and Location Expander. These results are saved to Voldemort’s Query Store (5.5 GB). Hadoop also receives images from BluePages that are saved in Voldemort’s image store to remain available for Hadoop’s montage generator. 2 9
  13. 13. IBM Software Manufacturing and Computer ServicesInformation Management “We placed all 600,000 names into memory for immediate access,” says Solution components Weber. “Preprocessing with Hadoop directly improves performance. Each time you type a letter in a name, results are immediate. We have Servers precomputed the search process to retrieve every employee name that ● IBM® BladeCenter® servers matches what is entered. Every time you type another letter, scoring Software retrieves people who are more relevant to the search criteria. The ● Apache Hadoop information is available and, from a performance perspective, everything ● Apache Voldemort Key Value is ready to go. Memory and storage are inexpensive and nightly Storage System processing takes only a few hours.” Weber adds, “We run Hadoop on ten, five-year-old IBM BladeCenter® “We could not have servers. These Blades are low powered, but Hadoop distributes the workload and takes advantage of the hardware to the fullest. If more developed Faces without computation is needed, we can add machines and improve performance the distributed processing without modifying the code.” capabilities Hadoop Measuring business value provides. The Faces According to Weber, the new Faces application enables employees to application has really receive instantaneous search results. “Conservatively speaking, we are saving on average over a minute for each search session,” says Weber. highlighted the power of “Searches are faster and easier. The information is timely and accurate. Hadoop and has helped With over 500,000 searches daily, IBMers are saving thousands of hours us address a major pain each day.” point for all IBMers.” For IBM employees, the improvement is noticeable. “To gain user acceptance or change user behavior, we know any new solution we create —Sara Weber has to be significantly faster and better,” says Weber. “As far as I know, Faces is the fastest growing innovation ever introduced at IBM. In the first two weeks, Faces went from zero to 85,000 users with continued viral growth throughout the entire IBM organization. What used to take minutes now takes milliseconds. We provide a feedback button on all our applications so users can report errors or issues. With Faces, IBMers were using the feedback button to say, ‘Thank you for making my job so much easier.’” Weber concludes, “We could not have developed Faces without the distributed processing capabilities Hadoop provides. The Faces application has really highlighted the power of Hadoop and has helped us address a major pain point for all IBMers.” 3 10
  14. 14. For more informationTo learn more about IBM Information Management solutions, pleasecontact your IBM sales representative or IBM Business Partner, or visitthe following website: learn more about IBM InfoSphere BigInsights,, financing solutions from IBM Global Financing can enableeffective cash management, protection from technology obsolescence,improved total cost of ownership and return on investment. Also, ourGlobal Asset Recovery Services help address environmental concerns withnew, more energy-efficient solutions. For more information onIBM Global Financing, visit:© Copyright IBM Corporation 2011IBM CorporationSoftware GroupRoute 100Somers, NY 10589U.S.A.Produced in the United States of AmericaOctober 2011All Rights ReservedIBM, the IBM logo,, InfoSphere, and BladeCenter are trademarks ofInternational Business Machines Corporation in the United States, other countriesor both. If these and other IBM trademarked terms are marked on their first occurrencein this information with a trademark symbol (® or ™), these symbols indicate U.S.registered or common law trademarks owned by IBM at the time this informationwas published. Such trademarks may also be registered or common law trademarks inother countries. A current list of IBM trademarks is available on the web at“Copyright and trademark information” at company, product and service names may be trademarks or service marksof others.References in this publication to IBM products or services do not imply thatIBM intends to make them available in all countries in which IBM operates. Please Recycle IMC14698-USEN-00 11
  15. 15. Big Data ProfilesIBM Software Group Hertz, Mindshare Technologies and IBM Analyzing huge volumes of customer comments in real time delivers a competitive edge As the world’s largest airport car rental brand with more than 8,300 Overview locations in 146 countries, Hertz continually requests and receives feedback from its customers. To retain a competitive edge, the feedback The need Improving service means listening to is analyzed so that issues can be identified in real-time and problems customers and gathering thousands can be addressed and resolved quickly. of comments via web, email and text messages. Each comment is viewed and categorized manually for customer “Hertz gathers an amazing amount of customer insight daily, including service reporting. Inconsistencies were thousands of comments from web surveys, emails and text messages. at an unacceptable level. We wanted to leverage this insight at both the strategic level and the The solution local level to drive operational improvements,” says Joe Eckroth, Chief Using feedback management and content analytics software, customer Information Officer, The Hertz Corporation. comments are captured in real time to be transformed into actionable intelligence. Linguistic rules automatically analyze Leveraging unstructured data to improve and tag unstructured content into customer satisfaction meaningful service reporting categories. Hertz and Mindshare Technologies, a leading provider of enterprise The benefit feedback solutions, are using IBM® Content Analytics software to Automated tagging increased report examine customer survey data, including text messages. The goal is consistency, freed Hertz field managers from tagging comments, and roughly to identify car and equipment rental performance levels to enable doubled what the managers had pinpointing issues and making the necessary adjustments to improve achieved manually. customer satisfaction levels. IBM Content Analytics allows for deep, rich text analysis of information, helping organizations gain valuable insight from enterprise content regardless of source or format. This technology can help reveal undetected problems, improve content-centric process inefficiencies, and take customer service and revenue opportunities to new levels, while helping to reduce operating costs and risks. 12
  16. 16. Big Data ProfilesIBM Software Group Using Content Analytics together with a sentiment-based tagging Solution components: solution from Mindshare Technologies, Hertz introduced a “Voice of the Customer” analytics system that automatically captures large Software volumes of information reflecting customer experiences in real-time, • IBM® Content Analytics and helps transform the information into actionable intelligence. Using a series of linguistic rules, the “Voice of the Customer” system categorizes comments received via email and online with descriptive terms, such as “Hertz gathers an Vehicle Cleanliness, Staff Courtesy and Mechanical Issues. The system amazing amount of also flags customers who request a callback from a manager or those who customer insight daily, mention #1 Club Gold, Hertz’s customer loyalty program. including thousands of “Working closely with the IBM-Mindshare team, we are able to better comments from web focus on improvements that our customers care about, while removing a surveys, emails and text time-consuming burden from our location managers. This has greatly messages. We wanted to improved the effectiveness of our ‘Voice of the Customer’ program and has leverage this insight at helped build on our reputation for delivering superior customer service.” both the strategic level Improving speed and accuracy of processing and the local level to drive customer feedback operational improvements.” In the ultra-competitive world of vehicle and equipment rental, Hertz recognizes that understanding customer feedback and adapting the — Joe Eckroth, Chief Information Officer, business accordingly is what drives market share and success. However, The Hertz Corporation most of this valuable information is trapped inside free-form customer feedback surveys. Prior to working with IBM and Mindshare Technologies, Hertz location managers read each customer comment submitted online via email or by phone, and then manually categorized it for basic reporting and analysis. This approach proved to be labor-intensive and inconsistent, as comments were categorized based on a manager’s personal interpretation. Automating the task of tagging customer comments has increased report consistency and roughly doubled what the managers had achieved manually. 2 13
  17. 17. Big Data ProfilesIBM Software Group IBM Content Analytics software has improved the accuracy and speed of the tagging and analyzing process, setting the stage for more reliable “Working closely with analytics. Free from manually tagging comments, Hertz field managers the IBM-Mindshare can now focus attention on performing deep-dive analysis on the team, we are able to information, quickly identifying trends or issues and adjusting better focus on operational service levels accordingly. improvements that For instance, wait times at car rental locations can be a contentious our customers care issue. The faster and more efficient the car rental/return process, about, while removing the more likely the customer will do repeat business. Using analytics a time-consuming software, Hertz location managers are able to effectively monitor burden from our customer comments to deliver top customer satisfaction scores for this critical level of service. In Philadelphia, survey feedback led managers location managers.” to discover that delays were occurring at the returns area during certain parts of the day. They quickly adjusted staffing levels and ensured a – Joe Eckroth manager was always present in the area during these specific times. Hertz remains focused on customers and providing superior service The Internet and new social media technologies have made consumers more connected, empowered and demanding. The average online user is three times more likely to trust peer opinions over retailer advertising, underlining the importance for retailers to tap new technologies that pay close attention to what customers are saying. This effort with Hertz reflects IBM’s focus on helping organizations use analytics to get the most value from their information. IBM has a Business Analytics & Optimization services organization, with 7,000 consultants who can help clients get up and running with deep analytics capabilities. 3 14
  18. 18. For more informationTo learn more about IBM Content Analytics, learn more about IBM Business Optimization andAnalytics services, visit: increase your big data knowledge and skills, visit:www.BigDataUniversity.comTo get involved in the conversation, more information on Hertz,© Copyright IBM Corporation 2011IBM CorporationSoftware GroupRoute 100Somers, NY 10589U.S.A.Produced in the United States of AmericaOctober 2011All Rights ReservedIBM, the IBM logo and are trademarks or registered trademarks ofInternational Business Machines Corporation in the United States, other countries,or both. If these and other IBM trademarked terms are marked on their firstoccurrence in this information with a trademark symbol (® or ™), these symbolsindicate U.S. registered or common law trademarks owned by IBM at the time thisinformation was published. Such trademarks may also be registered or common lawtrademarks in other countries. A current list of IBM trademarks is available on theWeb at “Copyright and trademark information” at company, product and service names may be trademarks or service marksof others.References in this publication to IBM products or services do not imply that IBMintends to make them available in all countries in which IBM operates. Please Recycle IMC14706-USEN-00 15
  19. 19. Let’s build a smarter planet Education KTH – Royal Institute of Technology Analyzes real-time data streams to identify traffic patterns The Royal Institute of Technology (abbreviated KTH) is a university Stockholm, Sweden in Stockholm, Sweden. KTH was founded in 1827 as Sweden’s first polytechnic and is with Aalto University School of Science and Technology in Espoo, depending on definition, Scandinavia’s largest institution of higher education in technology and one of the leading technical universities in Europe. “ Analyzing large volumes of streaming data in real The Opportunity time is leading to smarter, Researchers at KTH, Sweden’s leading technical university, gather real-time traffic data from a variety of sources such as GPS from large more efficient and numbers of vehicles, radar sensors on motorways, congestion charging, environmentally friendly weather, etc. The integration and analysis of the data in order to better traffic in urban areas.” manage traffic is a difficult task. — Haris N. Koutsopoulos, Head of Transportation and Logistics, What Makes It Smarter Royal Institute of Technology, Collected data is now flowing into IBM InfoSphere Streams software—a Stockholm, Sweden unique software tool that analyzes large volumes of streaming, real-time data, both structured and unstructured. The data is then used to help intelligently identify current conditions, and estimate how long it would take to travel from point to point in the city, offer advice on various travel alternatives, such as routes, and eventually help improve traffic in a metropolitan area. Real Business Results • Uses diverse data, including GPS locations, weather conditions, speeds and flows from sensors on motorways, incidents and roadworks • Enters data into the InfoSphere Streams software, which can handle all types of data, both structured and unstructured • Handles, in real time, the large traffic and traffic-related data streams to enable researchers to quickly analyze current traffic conditions and develop historical databases for monitoring and more efficient management of the system 16
  20. 20. For more informationSolution Components Please contact your IBM sales representative or IBM Business Partner. Visit us at:• IBM® InfoSphere™ Streams• IBM BladeCenter® HS22 To learn more about KTH – Royal Institute of Technology visit:• IBM BladeCenter H Chassis• IBM System Storage® DS3400• Red Hat Linux® © Copyright IBM Corporation 2011 IBM Corporation 1 New Orchard Road Armonk, NY 10504 U.S.A. Produced in the United States March 2011 All Rights Reserved IBM, the IBM logo,, BladeCenter and InfoSphere are trademarks of International Business Machines Corporation, registered in many jurisdictions worldwide. A current list of IBM trademarks is available on the Web at “Copyright and trademark information” at Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both. Other company, product or service names may be trademarks or service marks of others. The information contained in this documentation is provided for informational purposes only. While efforts were made to verify the completeness and accuracy of the information contained in this documentation, it is provided “as is” without warranty of any kind, express or implied. In addition, this information is based on IBM’s current product plans and strategy, which are subject to change by IBM without notice. IBM shall not be responsible for any damages arising out of the use of, or otherwise related to, this documentation or any other documentation. Nothing contained in this documentation is intended to, nor shall have the effect of, creating any warranties or representations from IBM (or its suppliers or licensors), or altering the terms and conditions of the applicable license agreement governing the use of IBM software. Please Recycle BLC03060-USEN-00 17
  21. 21. Marine Institute Ireland Putting real-time data to work and providing a platform for technology development When sensors become pervasive, entirely new and unexpected uses forOverview the flood of information they produce often arise, yielding benefits far beyond those originally envisioned. Seeing the world in a new way—The need via technology—generates an inventive spark, prompting people toThe Marine Institute sought to establishSmartBay as a research, test and devise new uses for information that they may never have considereddemonstration platform for new envi- before.ronmental technologies—paving theway to commercialization and thedevelopment of new markets for Irish- That’s exactly what is happening in Ireland’s Galway Bay, as part of thebased companies. SmartBay project initiated by the Marine Institute Ireland. In support of its advanced technology platform, which seeks to make Ireland aThe solution major player in the development of smart ocean technologies, theThe Institute, working with IBM, devel-oped a pilot information system to feed project’s initial purpose was to develop a platform for testing environ-environmental data into a data ware- mental monitoring technologies, and the idea was simple: Deploy ahouse, where it is processed, analyzed series of radio-equipped “smart buoys” in the bay containing sensorsand displayed in new ways. that could collect data such as sea state (wave height and action) andWhat makes it smarter other weather conditions, water data such as salinity, and similar envi-The project yields greater insight into ronmental information.the bay environment, as well as provid-ing practical value—from understandinghow water quality impacts fisheries to A basis for economic transformationpredicting hazard locations and more. When the Marine Institute learned of the IBM Big Green Innovations initiative to find ways to use technology to promote and enable envi- ronmental science, the idea of a collaboration on the SmartBay project was born. The IBM Advanced Water Management Centre Dublin built upon the domain expertise of the Marine Institute, complimenting it with its deep computing intelligence. While the synergy with the IBM Smarter Planet™ strategy’s drive towards Smart Green technology was clear, the real impetus behind the decision to expand SmartBay is largely economic. Beginning in the 1990s, the Irish economy became a global growth powerhouse. Wise policy decisions and forward-thinking investment had transformed Ireland into a manufacturing phenomenon. More recently, with the global economy encountering difficulty, Ireland’s prosperity began to wane. The government saw the need to change course, moving the country towards a knowledge- based economy. Investment in projects that showcase Ireland as a tech- nological leader would not only create new commercial opportunities, 18
  22. 22. attract talent and additional capital investment, but also prompt a new Business benefits generation of Irish citizens to pursue careers in knowledge-based industries. ● Enables the creation of a vast array of diverse applications that goes far beyond the original purpose of the Taking SmartBay to a new level project, from technical research to The Marine Institute, working in conjunction with government tourism promotion agencies, research institutions and the private sector, is working ● Real-time access via the web together to leverage the significant R+D capacity that exists in Ireland delivers valuable insight quickly to remote users to help drive economic development. There is clear potential to expand SmartBay into an international platform demonstrating new ● Open architecture enables new appli- cations to be brought on line easily, approaches to environmental challenges and delivering new technolog- combining data from both SmartBay ical solutions for a range of global markets. sensors and other sources, such as geographical information systems IBM is working with the Marine Institute to speed the process of inno- ● Add-on effect of the project promotes vation, starting with an assessment of existing capabilities. The team education and stimulates economic development in the Irish economy saw that if the data could be centralized, processed and accessed in the right way, it could become far more useful—the information already available could be turned into intelligence and put to work to create real practical value that impacts the lives of citizens directly. IBM designed and deployed an enterprise-scale data warehouse using IBM InfoSphere™ Warehouse, that is connected to the SmartBay sensors, as well as external sources such as mapping databases and sensors beyond the bay. An open-standards application layer processes and analyzes the data in a variety of ways, making it available via a Web interface enabled by IBM WebSphere® Portal and WebSphere Application Server. Additional WebSphere products, including WebSphere MQ and WebSphere Sensor Events, provide a key middleware layer that integrates the sensors with the data warehouse. To ensure reliability and scalability, the system is housed on IBM System x® 3950 servers.Smarter water: Creating new value from environmental data Instrumented Sensors deployed on buoys in Galway Bay transmit key data on ocean conditions and water quality. Interconnected Sensor data is fed into a central data warehouse for aggregation and processing, and can be accessed by diverse groups using customized web applications to generate targeted value. Intelligent Combining real-time data with a flexible technology platform creates near-limitless new uses for information—from environmental research to predictive monitoring, technology validation and much more. 2 19
  23. 23. The system design makes it easy to combine data from the sensors Solution components with other online databases—such as geographical information—as needed to create new functionality. Rapid development, enabled by Software IBM DB2® Alphablox® is an important feature, giving project man- ● IBM DB2® Alphablox® v9.5 ● IBM DB2 Enterprise Server agers the ability to deploy new applications quickly and easily. Edition v9.5 ● IBM InfoSphere™ Streams The project yields greater insight into the bay environment and can ● IBM WebSphere® Application Server v6.1 provide real-time information feeds to a range of stakeholders, while at ● IBM WebSphere MQ v5 the same time enabling commercial technology developers to test new ● IBM WebSphere Sensor Events environmental product and service offerings. The project is now mov- IBM WebSphere Portal Server v6.1 ing into a new phase, with higher bandwidth and powered cabled ● Servers sensors being deployed that will enable more information to be ● IBM System x® 3950 gathered. IBM is also working with Irish-based companies on an advanced initiative to add stream (i.e., real-time) computing capabilities Services ● IBM Global Business Services® to the project, with the goal of increasing its capacity utilizing the real- time analytical processing capacity of InfoSphere Streams. Applications limited only by imagination“The immediate benefits As the IBM and Marine Institute team began to map out the of SmartBay, whether possibilities for delivering information and services via the SmartBay portal, more and more potential new uses began to spring up. it’s helping and support- Stakeholders—the harbormaster, fishermen, researchers, tourism offi- ing industrial develop- cials and others—were all part of the brainstorming process. The ment or promoting SmartBay vision was quickly expanding far beyond its initial goals. marine safety, are The variety of applications either deployed or under consideration for tangible, direct and SmartBay is strong testament to the power of creative thinking enabled by the right technological tools. The critical element is the ability to worthwhile.” analyze, process and present the data in a useful form, tailored to the needs of specific users. For example: —John Gaughan, project coordinator, SmartBay ● Technology developers can conduct a variety of sophisticated studies remotely and in near real time, instead of retroactively. Climate researchers, using sensors on land paired with sensors in the bay, can learn about the exchange of CO2 across the land-sea interface, and marine biologists can use acoustic sensors deployed throughout the bay to assess marine mammal populations. ● Alternative energy developers can access real-time wave data and use it to determine the effectiveness of prototype wave-energy gener- ators, and developers of new sensor technologies can deploy proto- types on the buoys to find out how well the hardware holds up in a harsh marine environment, with continuous monitoring. ● The project can also promote commercial interests. Fishermen can use environmental data to tell them when to put to sea. Fishery managers can monitor and track water quality issues, gaining a com- prehensive view of actual conditions throughout the bay. 3 20
  24. 24. ● Applications developed as part of the SmartBay project can also help increase public safety. Mariners who spot floating objects that pose a hazard to navigation can report the location, and the system will combine this information with geographic data, real-time weather, current, and tide data to predict the path and position of the hazard hours in advance. Collaboration with the Galway harbormaster has also enabled the creation of an expert system based on human expert- ise that can issue flood warnings more promptly and accurately than he can himself, based on real-time weather, sea state and tidal information.Gaughan says the project provides a positive benefit in many areas.“The immediate benefits of SmartBay, whether it’s helping and sup-porting industrial development or promoting marine safety, aretangible, direct and worthwhile.”For more informationTo learn more about how IBM can help you transform your business,please contact your IBM sales representative or IBM Business Partner.Visit us at:●●© Copyright IBM Corporation 2010IBM Corporation1 New Orchard RoadArmonk, NY 10504U.S.A.Produced in the United States of AmericaNovember 2010All Rights ReservedIBM, the IBM logo,, Let’s Build A Smarter Planet, the planet icons,AlphaBlox, DB2, Global Business Services, InfoSphere, System x and WebSphereare trademarks of International Business Machines Corporation, registered in manyjurisdictions worldwide. Other product and service names might be trademarks ofIBM or other companies. A current list of IBM trademarks is available on the web case study illustrates how one IBM customer uses IBM products. There is noguarantee of comparable results.References in this publication to IBM products or services do not imply thatIBM intends to make them available in all countries in which IBM operates. Please Recycle ODC03150-USEN-00 21
  25. 25. a jStart™ using Big Data to identify Big Opportunities in retail case study helping companies deliver the web experience their customers want. At a Glance There is a ―Big Data‖ challenge in the e-commerce industry with the explo- sive growth of social networking sites. With 700 million users on Face- book—expected to reach 1 billion in 2011, and Twitter up to 140 million tweets per day, retailers are trying to reach their customers and understand their shopping habits better using these channels. Without social analytics, online retailers risk becoming a victim to this deluge of data – unable to“We are able to vastly improve the make sense out of the massive volume of product data and customer feed- online shopping experience by back, or even able to respond to it in a timely way. responding almost instantly to customers and delivering the Working with IBM’s jStart™ team, Technovated created a system that uses products they want to purchase IBM BigSheets to reduce manual processes while simultaneously tackling at a very attractive price point.” the ―Big Data‖ challenges that many online retailers experience. -Gareth Knight Providing a Big Data Edge CEO, Technovated Technovated is able to respond to shoppers instantly based on customers’ latest product searches, blog posts and tweets about recent purchases. Using this valuable consumer insight, Technovated can automatically set up new online stores in a matter of days to deliver shoppers with the prod- ucts they are searching for at a competitive price point. It used to take six See how IBM using analytics to create Smarter Retail 22
  26. 26. a jStart™ case study weeks to put products up for sale online. Now, using IBM technology com-About Techovated bined with Technovated’s know-how, it takes a few days.jStart works with a wide variety of clients and custom-ers, but frequently, we find some of the best partner-ships to be with startups. Technovated is very much Enter Big Data Analyticsa partner in that vein. With offices in London andJohannesburg, Technovated describes itself this By using IBM BigSheets, Technovated plans to jump-start its businessway: ―we are able to vastly improve the online shop- growth. Starting off its Web stores with a few thousand product stock-ping experi- keeping units (SKUs), Technovated will quickly be able toence by cull through terabytes of data to set up niche e-commercerespondingalmost instantly to customers and delivering the sites ranging from office chairs to running shoes.products they want to purchase at a very attractive IBM BigSheets is a system developed by IBM’s Emerging Internet Technol-price point.‖ The Technovated team is focused on ogies group to allow for the easy and quick exploration of big data. If you’releveraging the latest technologies to give them—andtheir customers—a competitive edge. In this case, wondering what your data may be trying to tell you, BigSheets is a greatutilizing IBM Big Data technologies, like BigSheets, to place to start—since any line-of-business professional can manipulate theprovide capabilities and business opportunities that tool to identify and take action onsimply didn’t exist for SMB’s until today. opportunities which may reside in the data, itself. Since BigSheetsget started with jStart: can merge data from numerousDavid Sink sources, your company can obtainProgram Director, jStart Team a high level overview of what’sIBM Emerging possible with the data available—Tel: 919.254.4648 and the opportunity to act on thoseEd Elze insights.Manager, Bus. Dev., Strategy & Client EngagementjStart Team, IBM Emerging Technologies The jStart team also has 360.866.0160 experience with IBM data analytics technologies and solutions as well.Jim Smith By leveraging these technologies, your business could extract informationManager, Client Engagements, Chief ArchitectjStart Team, IBM Emerging Technologies from publicly available sources, internal data sources, and partner sources, and use them to identify patterns, markets, and opportunities toTel: 919.387.6653 make the sale. In the end, big data can help identify big opportunities forJohn Feller retail. Ready to get started? jStart is. Contact us today.Manager, DevelopmentjStart Team, IBM Emerging 919.543.7971 Who is jStart?Learn More: jStart is a highly skilled team focused on providing fast, smart, and business solutions leveraging the latest technologies. The team typically cuses on emerging technologies which have commercial potential within 18 months. This allows the team to keep ahead of the adoption curve, being prepared for client engagements and partnerships. The team’s focus in 2011 includes: big data, text analytics, and the commercialization of IBM’s Watson technologies. © Copyright IBM Corporation 2010, IBM Corporation Software Group, Route 100, Somers, NY 10589, USA. Produced in the United States of America, 06- 10, All Rights Reserved. IBM, the IBM logo, and jStart, are trademarks of International Business Machines Corporation in the United States, other coun- tries, or both. Other company, product, and service names may be trademarks or service marks of others. 23
  27. 27. Big Data ProfilesIBM Software Group TerraEchos and IBM Streaming data technology supports covert intelligence and surveillance sensor systems A leading provider of covert intelligence and surveillance sensor systems, TerraEchos, Inc., helps organizations protect and monitor critical infrastructure Overview and secure borders. One T erraEchos client is a science-based, applied The need engineering national laboratory dedicated to supporting the U.S. Department U.S. Department of Energy (DOE) of Energy in nuclear and energy research, science and national defense. Research lab needed a solution to protect and monitor critical infrastructure and secure its perimeters and border areas. One of the lab’s initiatives is to be the first to develop safe, clean and The solution reliable nuclear power. Another is to investigate and test emerging IBM Business Partner, TerraEchos, capabilities for the production, manufacturing, conveyance, transmission implemented an advanced security and consumption of renewable energy, such as solar and wind power. and covert surveillance system based Securing the scientific intelligence, technology and resources related to on the TerraEchos Adelos S4 System with IBM InfoSphere Streams software these initiatives is vital. Protecting and sustaining the resiliency and and IBM BladeCenter hardware. operational reliability of the country’s power infrastructures—from natural The benefit disasters, cyber attacks and terrorism—are matters of national and Captures and analyzes huge volumes homeland security. of real-time, streaming, acoustical data from sensors around research Protecting its work and securing America’s energy future are responsibilities lab perimeters and borders, providing unprecedented insight to detect, classify, the lab takes seriously. To this end, it needed a technology solution that would locate, track, and deter potential threats. detect, classify, locate and track potential threats—both mechanical and biological; above and below ground—to secure the lab’s perimeters and border areas. This solution would provide scientists with more situational awareness and enable a faster and more intelligent response to any threat. Distinguishing the sound of a whisper from the wind even from miles away The requirements of the ideal solution were considerable. The solution would have to continuously consume and analyze massive amounts of information-in-motion, including the movements of humans, animals and the atmosphere, such as wind. In addition, because scientists lacked time to record the data and listen to it later, the solution had to gather and analyze information simultaneously. 24
  28. 28. Big Data ProfilesIBM Software Group Once analyzed, scientists could extract meaningful intelligence, as well as Solution components: verify and validate the data, such as distinguishing between the sounds of a trespasser versus a grazing animal. T put the sophistication of the needed o Software technology into perspective, the data consumption and analytical requirements • IBM® InfoSphere® Streams would be akin to listening to 1,000 MP3 songs simultaneously and successfully Server discerning the word “zero” from every song—within a fraction of a second. • IBM BladeCenter® servers The solution would also serve as the lab’s central nervous system and would have to meet strict technical requirements, including: • Interoperability, allowing sensors to work with other sensor types— such as video data—and enabling scientists to collect an array of data and create a holistic view of a situation. • Scalability to support new requirements as the lab’s fiber-optic arrays, surveillance areas, and security perimeters change. • Extensibility, serving as a framework to fit into the lab’s existing IT architecture and integrating with signal processors and mobile and mapping applications. To meet these requirements, the lab sought to implement and deploy an advanced security and surveillance system. Advanced fiber-optics combine with real-time streaming data The lab turned to IBM® Business Partner, T erraEchos, to implement an advanced security and covert surveillance system based on its TerraEchos Adelos S4 System, IBM InfoSphere® Streams software and IBM BladeCenter® servers. InfoSphere Streams is part of the IBM big data platform. TerraEchos selected InfoSphere Streams as the engine that processes approximately 1,600 megabytes of data in motion continually generated from fiber optic sensor arrays. The processing capacity of InfoSphere Streams enables Adelos to analyze all of the data streaming from the sensors. In addition, the technology enables Adelos to match the sound patterns against an extensive library of algorithms, giving TerraEchos the most robust classification system in the industry. The Adelos S4 solution is based on advanced fiber-optic acoustic sensor technology licensed from the United States Navy. Using InfoSphere Streams as the underlying analytics platform, the Adelos S4 solution analyzes highly unstructured audio data in real time before the audio signals are stored in the database. InfoSphere Streams allows multiple sensor types and associated streams of structured and unstructured data to be integrated into a fused intelligence system for threat detection, classification, correlation, prediction and communication by means of a service-oriented architecture (SOA). 2 25