Whatever the size or type of organization, Big Data has permeated our transportation industry. It is no longer a question of IF Big Data will be useful, but instead WHY is it useful and HOW can we best apply it. This presentation aims to address how we can leverage existing services and available partnerships in transportation, consider new and emerging technologies, and determine strategy for what’s to come in transportation, including connected and autonomous vehicles. While it may be a huge challenge to solve transportation problems with Big Data, it can help us make better travel decisions today and plan for better infrastructure tomorrow.
CITE Start Thinking Big Data 2019 01-30 FINALJon Kostyniuk
Whatever the size or type of organization, Big Data has permeated our transportation industry. It is no longer a question of IF Big Data will be useful, but instead WHY is it useful and HOW can we best apply it. This presentation aims to address how we can leverage existing services and available partnerships in transportation, consider new and emerging technologies, and determine strategy for what’s to come in transportation, including connected and autonomous vehicles. While it may be a huge challenge to solve transportation problems with Big Data, it can help us make better travel decisions today and plan for better infrastructure tomorrow.
Sotiris is currently working as Research Director with the Institute of Computer Science at the Foundation for Research and Technology - Hellas, where his research interests include systems, networks, and security. He is also a member of the European Union Agency for Network and Information Security (ENISA) Permanent Stakeholders Group! During Data Science Conference, Sotiris will talk about how data sharing between private companies and research facilities may lead to monetization.
The document reviews several standards for smart cities from organizations like ISO, ITU, and BSI. It summarizes 6 standards and specifications in detail, including the ISO 37120 standard for measuring city services and quality of life, ISO/DIS 37101 for planning and managing smart city initiatives, and technical reports from ITU on smart sustainable cities and key performance indicators. It also mentions other standards from BIS on smart city terminology, overview documents, and planning guidelines. The document concludes that while metrics are becoming standardized, a comprehensive map of all possible smart city interventions is still needed.
3 Business Cases on top of the Lynx Legal Knowledge GraphLynx Project
The main objective of the Lynx research and innovation project is to create an ecosystem of smart cloud services to better manage compliance, based on a Legal Knowledge Graph (LKG) that integrates and links multilingual and heterogeneous compliance data sources including legislation, case law, standards, regulations and other private contracts, besides others.
This webinar will provide insights into (i) problem statement and requirements, (ii) business cases (iii) and technical solutions as well as (iv) showcase demos of the 3 compliance related Pilots of the Lynx project that are based and implemented on the Lynx Services Platform (LySP). These are (a) question-answering solution in the field of labour law by the Spanish law firm Cuatrecasas, (b) contract analysis and management by the Austrian legaltech startup Cybly, and finally (c) the geothermal energy compliance recommender by the Norwegian consulting company DNV.GL.
L'economia europea dei dati. Politiche europee e opportunità di finanziamento...Data Driven Innovation
L'economia europea dei dati: soluzioni politiche e giuridiche per realizzare un'economia dei dati a livello di Unione Europea, nell'ambito della strategia per il mercato unico digitale. La consultazione pubblica 'Building the European Data Economy'. Il paternariato pubblico privato (PPP) Big Data Value ed opportunità di finanziamento in Horizon 2020. L'incubatore Data Pitch: opportunità per Start-up e Piccole e Medie Imprese.
This document provides an overview of the state of open data and open knowledge in Belgium. It discusses Open Knowledge Belgium's mission to promote openness through advocacy, research and technology. It then outlines some of the progress made in open data policies and portals in Belgium, Flanders, Wallonia and Brussels. It also notes that while much low hanging fruit has been achieved, new challenges remain around issues like algorithm ethics, open science business models, real-time data and linked data.
La telefonía móvil como fuente de información para el estudio de la movilidad...Esri España
Existe una multitud de sectores donde es necesario disponer de datos que permitan entender los patrones de comportamiento de la población: la planificación y la operación de los sistemas de transporte requiere información precisa, fiable y actualizada sobre la demanda de viajes; los patrones de actividad y movilidad de los turistas tienen profundas implicaciones para la planificación de infraestructuras, el desarrollo de la oferta turística y las estrategias de marketing turístico; entender el comportamiento espacial de los clientes es clave para optimizar las estrategias de distribución, comercialización y publicidad, determinar la localización de un nuevo comercio o punto de venta, o maximizar el retorno de la inversión en acciones de marketing. Las fuentes de datos tradicionales, basadas fundamentalmente en encuestas y registros administrativos, proporcionan información muy valiosa, pero no están exentas de inconvenientes. En general, las encuestas resultan caras y lentas de realizar, lo que limita el tamaño de la muestra y la frecuencia de actualización de la información, a lo que hay que añadir otras limitaciones intrínsecas, como las respuestas incorrectas e imprecisas, o la dependencia de la disposición a responder de los entrevistados. En los últimos años, la generalización del uso de dispositivos móviles ha abierto nuevas oportunidades para superar muchas de estas limitaciones. La posibilidad de recoger datos geolocalizados sobre la actividad de las personas, de manera dinámica y a un coste sensiblemente inferior al de los métodos tradicionales, abre la puerta a infinidad de aplicaciones. Las más evidentes son quizá las relacionadas con el transporte y la movilidad, pero el abanico es mucho más amplio, abarcando casi cualquier área que requiera información sobre los patrones de actividad y movilidad de la población. Las nuevas fuentes de datos plantean asimismo importantes retos, desde la necesidad de desarrollar nuevas metodologías de análisis, hasta la protección de la privacidad.
Vídeo de la ponencia: https://youtu.be/5PKC5Qm0eHM
CITE Start Thinking Big Data 2019 01-30 FINALJon Kostyniuk
Whatever the size or type of organization, Big Data has permeated our transportation industry. It is no longer a question of IF Big Data will be useful, but instead WHY is it useful and HOW can we best apply it. This presentation aims to address how we can leverage existing services and available partnerships in transportation, consider new and emerging technologies, and determine strategy for what’s to come in transportation, including connected and autonomous vehicles. While it may be a huge challenge to solve transportation problems with Big Data, it can help us make better travel decisions today and plan for better infrastructure tomorrow.
Sotiris is currently working as Research Director with the Institute of Computer Science at the Foundation for Research and Technology - Hellas, where his research interests include systems, networks, and security. He is also a member of the European Union Agency for Network and Information Security (ENISA) Permanent Stakeholders Group! During Data Science Conference, Sotiris will talk about how data sharing between private companies and research facilities may lead to monetization.
The document reviews several standards for smart cities from organizations like ISO, ITU, and BSI. It summarizes 6 standards and specifications in detail, including the ISO 37120 standard for measuring city services and quality of life, ISO/DIS 37101 for planning and managing smart city initiatives, and technical reports from ITU on smart sustainable cities and key performance indicators. It also mentions other standards from BIS on smart city terminology, overview documents, and planning guidelines. The document concludes that while metrics are becoming standardized, a comprehensive map of all possible smart city interventions is still needed.
3 Business Cases on top of the Lynx Legal Knowledge GraphLynx Project
The main objective of the Lynx research and innovation project is to create an ecosystem of smart cloud services to better manage compliance, based on a Legal Knowledge Graph (LKG) that integrates and links multilingual and heterogeneous compliance data sources including legislation, case law, standards, regulations and other private contracts, besides others.
This webinar will provide insights into (i) problem statement and requirements, (ii) business cases (iii) and technical solutions as well as (iv) showcase demos of the 3 compliance related Pilots of the Lynx project that are based and implemented on the Lynx Services Platform (LySP). These are (a) question-answering solution in the field of labour law by the Spanish law firm Cuatrecasas, (b) contract analysis and management by the Austrian legaltech startup Cybly, and finally (c) the geothermal energy compliance recommender by the Norwegian consulting company DNV.GL.
L'economia europea dei dati. Politiche europee e opportunità di finanziamento...Data Driven Innovation
L'economia europea dei dati: soluzioni politiche e giuridiche per realizzare un'economia dei dati a livello di Unione Europea, nell'ambito della strategia per il mercato unico digitale. La consultazione pubblica 'Building the European Data Economy'. Il paternariato pubblico privato (PPP) Big Data Value ed opportunità di finanziamento in Horizon 2020. L'incubatore Data Pitch: opportunità per Start-up e Piccole e Medie Imprese.
This document provides an overview of the state of open data and open knowledge in Belgium. It discusses Open Knowledge Belgium's mission to promote openness through advocacy, research and technology. It then outlines some of the progress made in open data policies and portals in Belgium, Flanders, Wallonia and Brussels. It also notes that while much low hanging fruit has been achieved, new challenges remain around issues like algorithm ethics, open science business models, real-time data and linked data.
La telefonía móvil como fuente de información para el estudio de la movilidad...Esri España
Existe una multitud de sectores donde es necesario disponer de datos que permitan entender los patrones de comportamiento de la población: la planificación y la operación de los sistemas de transporte requiere información precisa, fiable y actualizada sobre la demanda de viajes; los patrones de actividad y movilidad de los turistas tienen profundas implicaciones para la planificación de infraestructuras, el desarrollo de la oferta turística y las estrategias de marketing turístico; entender el comportamiento espacial de los clientes es clave para optimizar las estrategias de distribución, comercialización y publicidad, determinar la localización de un nuevo comercio o punto de venta, o maximizar el retorno de la inversión en acciones de marketing. Las fuentes de datos tradicionales, basadas fundamentalmente en encuestas y registros administrativos, proporcionan información muy valiosa, pero no están exentas de inconvenientes. En general, las encuestas resultan caras y lentas de realizar, lo que limita el tamaño de la muestra y la frecuencia de actualización de la información, a lo que hay que añadir otras limitaciones intrínsecas, como las respuestas incorrectas e imprecisas, o la dependencia de la disposición a responder de los entrevistados. En los últimos años, la generalización del uso de dispositivos móviles ha abierto nuevas oportunidades para superar muchas de estas limitaciones. La posibilidad de recoger datos geolocalizados sobre la actividad de las personas, de manera dinámica y a un coste sensiblemente inferior al de los métodos tradicionales, abre la puerta a infinidad de aplicaciones. Las más evidentes son quizá las relacionadas con el transporte y la movilidad, pero el abanico es mucho más amplio, abarcando casi cualquier área que requiera información sobre los patrones de actividad y movilidad de la población. Las nuevas fuentes de datos plantean asimismo importantes retos, desde la necesidad de desarrollar nuevas metodologías de análisis, hasta la protección de la privacidad.
Vídeo de la ponencia: https://youtu.be/5PKC5Qm0eHM
- ConTaaS is a novel contextualization architecture and technique for scaling up contextualization of internet-of-things data to internet scales.
- It employs prime factorization to efficiently contextualize large volumes of data from many IoT devices.
- The approach was implemented on Amazon EC2 cloud infrastructure and evaluated using synthetic data from Melbourne city datasets. It provides a way to represent, contextualize, and query large-scale IoT data.
Inspire Helsinki 2019 - Keynote Hanna Niemi HugaertsHannaHorppila
The Inspire Helsinki 2019 event brought together around 170 people from 29 countries to foster discussion and new ideas on how to realise the full potential of spatial data. The three-day event featured data challenges, practical hands-on workshops and future-oriented keynote presentations. The event was summed up in a panel discussion, in which perspectives on tackling remaining challenges were brought up.
The Digital Journey - A Local Government PerspectiveSocitm
This document discusses the digital journey for a local authority CIO. It outlines several technology disruptions like digital, big data, and the internet of things that are impacting local authorities. The CIO's role is shifting from tightly managing the ICT service to facilitating data sharing and being a community digital leader. Some principles for the CIO include standardizing systems, using open APIs and cloud architecture, and ensuring initiatives are customer-driven. The document cautions that the baseline for local government digital systems has yet to be established and suppliers do not fully recognize the implications of open-by-default approaches.
Tomáš Maršálek - Optimal course of IXP development – NIX.CZPROIDEA
This document discusses the optimal development of internet exchange points (IXPs), using NIX.CZ as a case study. It outlines that IXPs should be a neutral platform for interconnecting competitors and sharing experiences, not a battlefield or means to fulfill individual ambitions. NIX.CZ has transformed since being established in 1996 from a voluntary association to a commercial one with full-time staff. It now handles over 70Gbps of traffic monthly and has 89 members. Its success is attributed to professionalizing management, cooperation with other IXPs, and maintaining strategic location and neutrality. Risks include unsuitable people, market changes, and lack of technology advances, but NIX.CZ plans to build reserves and attract foreign networks while
The goal of TODE’2017 is to look into the future of RegTech and discuss key developments within the 10+year horizon. Participants will learn and discuss the requirements, challenges and solutions necessary to achieve transparent, efficient and global, trusted, open data ecosystems, responding to today’s market, regulatory, legal and technological developments. The conference sessions and panels will cross the industries of banking, insurance, pensions funds, investment firms, securities and other to enable connected view and analysis across legal, data and technological perspectives.
The Italian business graph: fueling innovation in financeCerved Group SpA
Cerved is an Italian data company that collects and analyzes complex data from over 50 sources to provide credit risk information, marketing solutions, and credit management services. Cerved began developing a graph database in 2011 to better analyze relationships between entities like companies, people, locations, and properties. Their graph platform now contains over 35 million nodes and 70 million relationships and is used for customer solutions, internal applications, and machine learning projects that apply graph algorithms to large datasets. Moving forward, Cerved aims to support more customer use cases and integrate additional customer data while also using their graph for internal search, feature creation, and machine learning applications.
Cerved is an Italian data and analytics company. It uses graph databases and network analysis to better understand complex relationships within data from over 50 sources. Cerved started using graph databases in 2011 to improve an algorithm, and has expanded usage to better link corporate data and power solutions through visualization and machine learning algorithms on graph data. Graph databases allow flexible and connected modeling of Cerved's data, and native graph storage and processing improves querying, integration, and analysis compared to relational databases.
The Open Data Institute (ODI) connects commercial, non-commercial, and government actors to address global challenges through the use of open data and a robust data infrastructure. The ODI works with sectors to identify how the web of data can impact businesses and the economy. It inspires innovation through various programs including training, startup acceleration, research, and events. The goal is to build a strong data infrastructure that enables open innovation on a global scale.
The document discusses Croatia's efforts towards e-government and a digital administration. It outlines the integration of base registries in 2012 to allow citizens to access official documents from any registry office. It also details the Central Salary System which calculates salaries for 250,000 public sector employees using the integrated RegZap registry. Further, it explains laws established to coordinate IT projects through the ProDII office and improve data sharing between systems using the Metaregister. Statistics are provided on growing use of the e-Citizens portal and its top 5 most used e-services. Finally, it outlines strategies in the e-Croatia 2020 plan to improve interoperability, develop cloud infrastructure and sectoral e-services, as
Presentation about how individuals can manage their own personal data for use in dealing with the organisations they have interactions with. Presented to the annual conference of the public sector IT management organisation, Socitm, on 11 October 2010
This document provides information about Haluk Demirkan's background and experience. It summarizes his educational and professional qualifications, including over 10 years of research and higher education experience, as well as over 15 years of consulting and executive education experience. It also lists some of his academic accomplishments such as over 150 publications and research funded by several major companies. Finally, it provides details on some of the education topics he teaches related to services, information technology, and project management.
Service oriented architecture (SOA) deserves service oriented dataShahid Shah
Centralized, monolithic databases primarily built using relational approaches have ruled for decades; they’ve given us tremendous advances such as vertically scaled business-critical transactional systems and web applications. The next generation of microapps, microservices, and web widgets demand a scale that vertical scale application-centric relational databases are having difficulty with so we need to move to a more service-oriented database approach in which even small services like those that service patients in a patient portal or specific modules of EHRs can and should have their own databases.
This talk encourages the idea of service-focused databases and how they differ from application-centric databases; using this new approach allows faster delivery of applications, less coupling, and better scalability. Healthcare and biomedical databases are notoriously complex and no single database technology can serve its needs so we need a more service-oriented approach to database design.
You’ll learn how to choose the right database technology for each service, how to model service-oriented databases differently than application-oriented ones, and how to keep service databases running smoothly.
Open-IX: Improving interconnection through industry standardsInternet Society
The Open-IX Association (OIX) develops common standards for internet exchanges (IXPs) and data centers to improve global interconnection. It establishes committees to develop standards for technical requirements, operations, and certification. The OIX-1 and OIX-2 standards cover infrastructure, operations, and transparency requirements. Companies can apply for certification by implementing the standards, which helps network operators identify compliant organizations. Several international IXPs and data centers have already achieved OIX certification.
This document discusses service system engineering from a computer science perspective. It describes modeling service systems using the Linked USDL language to provide formal yet readable descriptions. It outlines developing Linked USDL to model different aspects of service systems like pricing, SLAs, and interactions. The document also discusses using Linked USDL for applications like cloud service aggregation and developing a service cloud platform. Finally, it lists resources and next steps around areas like service analytics, service network analysis, and developing a textbook on service systems.
myTask - crowdsourcing for field marketing | EasyData - Denis Slabakov
The presentation introduces two SaaS solutions from myTask Ltd: myTask, an online platform for crowdsourcing information collection across locations, and EasyData, an innovative mobile workforce management system. MyTask allows companies to assign tasks to a network of over 5,000 agents across Russia to gather field data, eliminating multiple intermediaries. EasyData provides cloud-based task assignment and monitoring of remote employees. Both platforms offer cost-effective alternatives to traditional field research and workforce systems. Major companies have adopted the solutions to streamline data collection and management of mobile staff.
The document discusses the growth of connectivity and the internet of everything. It notes that connectivity is growing 5 times faster than electricity or telephony. By 2020 there will be over 50 billion connected devices. The internet of everything will drive value through improved supply chains, customer experience, employee productivity and more. Trillions of dollars in value are expected to be created as more things and people get connected. The document discusses how Cisco is positioned to help enable cities, businesses and other organizations leverage connectivity and the internet of everything through its platforms and solutions.
The Digital Side Of Startup Ecosystem Development GEC 2018 istanbulGrow VC Group
The digital economy requires economic development and digital development to be understood and be operated closely together for ecosystem orchestration.
In this session, we explore how to unbundle and connect application silos, to build connectivity between applications to make valuable data to flow within and between ecosystems. What practical steps are required and who should be involved?
We explore learning from other industries to help imagine use and concepts of digital in ecosystem development and orchestration context and share our own key learnings of digital from several ecosystems around the world.
Big Data PPP Industrial Data Platforms - Towards cross-sectorial optimization and traceability
To start identifying synergies and to learn how different projects will address key data collection, sharing, integration, and exploitation challenges, a series of webinars have been organized under the umbrella of this Big Data Value PPP. These webinars are also organized by BDVA, BDVe project, and other projects which are part of this PPP.
This presentation gives a brief introduction to blockchain and proposes a unified analytical framework for trustable machine learning and automation running with blockchain.
Neo4j GraphTalks - Einführung in GraphdatenbankenNeo4j
The document announces a Neo4j GraphTalks event in October 2016 in Berlin. It includes an agenda with presentations on ADAMA's use of Neo4j for data sharing and knowledge management, and their experiences implementing and demoing Neo4j. There will also be an open networking session with NeoTechnology and PRODYNA representatives.
This document provides an overview of Neo4j, including:
- Neo4j is a graph database company with over 260 employees and $80M in funding.
- It has over 10M downloads and 275+ enterprise customers, including top retail, financial, and software firms.
- Neo4j is used for common use cases like recommendations, fraud detection, knowledge graphs, and master data management by customers like NASA, eBay, Walmart, and Marriott.
- The document describes several customer case studies and how Neo4j helped organizations like NASA, ICIJ, and others solve problems and gain insights.
This document summarizes key findings from research on smart city best practices in 22 cities. It identifies three common routes cities take to becoming smart - the anchor, platform, and beta city models. It also outlines common technology enablers, challenges cities face, and examples of smart living, safety, and sustainability applications seen across different cities.
- ConTaaS is a novel contextualization architecture and technique for scaling up contextualization of internet-of-things data to internet scales.
- It employs prime factorization to efficiently contextualize large volumes of data from many IoT devices.
- The approach was implemented on Amazon EC2 cloud infrastructure and evaluated using synthetic data from Melbourne city datasets. It provides a way to represent, contextualize, and query large-scale IoT data.
Inspire Helsinki 2019 - Keynote Hanna Niemi HugaertsHannaHorppila
The Inspire Helsinki 2019 event brought together around 170 people from 29 countries to foster discussion and new ideas on how to realise the full potential of spatial data. The three-day event featured data challenges, practical hands-on workshops and future-oriented keynote presentations. The event was summed up in a panel discussion, in which perspectives on tackling remaining challenges were brought up.
The Digital Journey - A Local Government PerspectiveSocitm
This document discusses the digital journey for a local authority CIO. It outlines several technology disruptions like digital, big data, and the internet of things that are impacting local authorities. The CIO's role is shifting from tightly managing the ICT service to facilitating data sharing and being a community digital leader. Some principles for the CIO include standardizing systems, using open APIs and cloud architecture, and ensuring initiatives are customer-driven. The document cautions that the baseline for local government digital systems has yet to be established and suppliers do not fully recognize the implications of open-by-default approaches.
Tomáš Maršálek - Optimal course of IXP development – NIX.CZPROIDEA
This document discusses the optimal development of internet exchange points (IXPs), using NIX.CZ as a case study. It outlines that IXPs should be a neutral platform for interconnecting competitors and sharing experiences, not a battlefield or means to fulfill individual ambitions. NIX.CZ has transformed since being established in 1996 from a voluntary association to a commercial one with full-time staff. It now handles over 70Gbps of traffic monthly and has 89 members. Its success is attributed to professionalizing management, cooperation with other IXPs, and maintaining strategic location and neutrality. Risks include unsuitable people, market changes, and lack of technology advances, but NIX.CZ plans to build reserves and attract foreign networks while
The goal of TODE’2017 is to look into the future of RegTech and discuss key developments within the 10+year horizon. Participants will learn and discuss the requirements, challenges and solutions necessary to achieve transparent, efficient and global, trusted, open data ecosystems, responding to today’s market, regulatory, legal and technological developments. The conference sessions and panels will cross the industries of banking, insurance, pensions funds, investment firms, securities and other to enable connected view and analysis across legal, data and technological perspectives.
The Italian business graph: fueling innovation in financeCerved Group SpA
Cerved is an Italian data company that collects and analyzes complex data from over 50 sources to provide credit risk information, marketing solutions, and credit management services. Cerved began developing a graph database in 2011 to better analyze relationships between entities like companies, people, locations, and properties. Their graph platform now contains over 35 million nodes and 70 million relationships and is used for customer solutions, internal applications, and machine learning projects that apply graph algorithms to large datasets. Moving forward, Cerved aims to support more customer use cases and integrate additional customer data while also using their graph for internal search, feature creation, and machine learning applications.
Cerved is an Italian data and analytics company. It uses graph databases and network analysis to better understand complex relationships within data from over 50 sources. Cerved started using graph databases in 2011 to improve an algorithm, and has expanded usage to better link corporate data and power solutions through visualization and machine learning algorithms on graph data. Graph databases allow flexible and connected modeling of Cerved's data, and native graph storage and processing improves querying, integration, and analysis compared to relational databases.
The Open Data Institute (ODI) connects commercial, non-commercial, and government actors to address global challenges through the use of open data and a robust data infrastructure. The ODI works with sectors to identify how the web of data can impact businesses and the economy. It inspires innovation through various programs including training, startup acceleration, research, and events. The goal is to build a strong data infrastructure that enables open innovation on a global scale.
The document discusses Croatia's efforts towards e-government and a digital administration. It outlines the integration of base registries in 2012 to allow citizens to access official documents from any registry office. It also details the Central Salary System which calculates salaries for 250,000 public sector employees using the integrated RegZap registry. Further, it explains laws established to coordinate IT projects through the ProDII office and improve data sharing between systems using the Metaregister. Statistics are provided on growing use of the e-Citizens portal and its top 5 most used e-services. Finally, it outlines strategies in the e-Croatia 2020 plan to improve interoperability, develop cloud infrastructure and sectoral e-services, as
Presentation about how individuals can manage their own personal data for use in dealing with the organisations they have interactions with. Presented to the annual conference of the public sector IT management organisation, Socitm, on 11 October 2010
This document provides information about Haluk Demirkan's background and experience. It summarizes his educational and professional qualifications, including over 10 years of research and higher education experience, as well as over 15 years of consulting and executive education experience. It also lists some of his academic accomplishments such as over 150 publications and research funded by several major companies. Finally, it provides details on some of the education topics he teaches related to services, information technology, and project management.
Service oriented architecture (SOA) deserves service oriented dataShahid Shah
Centralized, monolithic databases primarily built using relational approaches have ruled for decades; they’ve given us tremendous advances such as vertically scaled business-critical transactional systems and web applications. The next generation of microapps, microservices, and web widgets demand a scale that vertical scale application-centric relational databases are having difficulty with so we need to move to a more service-oriented database approach in which even small services like those that service patients in a patient portal or specific modules of EHRs can and should have their own databases.
This talk encourages the idea of service-focused databases and how they differ from application-centric databases; using this new approach allows faster delivery of applications, less coupling, and better scalability. Healthcare and biomedical databases are notoriously complex and no single database technology can serve its needs so we need a more service-oriented approach to database design.
You’ll learn how to choose the right database technology for each service, how to model service-oriented databases differently than application-oriented ones, and how to keep service databases running smoothly.
Open-IX: Improving interconnection through industry standardsInternet Society
The Open-IX Association (OIX) develops common standards for internet exchanges (IXPs) and data centers to improve global interconnection. It establishes committees to develop standards for technical requirements, operations, and certification. The OIX-1 and OIX-2 standards cover infrastructure, operations, and transparency requirements. Companies can apply for certification by implementing the standards, which helps network operators identify compliant organizations. Several international IXPs and data centers have already achieved OIX certification.
This document discusses service system engineering from a computer science perspective. It describes modeling service systems using the Linked USDL language to provide formal yet readable descriptions. It outlines developing Linked USDL to model different aspects of service systems like pricing, SLAs, and interactions. The document also discusses using Linked USDL for applications like cloud service aggregation and developing a service cloud platform. Finally, it lists resources and next steps around areas like service analytics, service network analysis, and developing a textbook on service systems.
myTask - crowdsourcing for field marketing | EasyData - Denis Slabakov
The presentation introduces two SaaS solutions from myTask Ltd: myTask, an online platform for crowdsourcing information collection across locations, and EasyData, an innovative mobile workforce management system. MyTask allows companies to assign tasks to a network of over 5,000 agents across Russia to gather field data, eliminating multiple intermediaries. EasyData provides cloud-based task assignment and monitoring of remote employees. Both platforms offer cost-effective alternatives to traditional field research and workforce systems. Major companies have adopted the solutions to streamline data collection and management of mobile staff.
The document discusses the growth of connectivity and the internet of everything. It notes that connectivity is growing 5 times faster than electricity or telephony. By 2020 there will be over 50 billion connected devices. The internet of everything will drive value through improved supply chains, customer experience, employee productivity and more. Trillions of dollars in value are expected to be created as more things and people get connected. The document discusses how Cisco is positioned to help enable cities, businesses and other organizations leverage connectivity and the internet of everything through its platforms and solutions.
The Digital Side Of Startup Ecosystem Development GEC 2018 istanbulGrow VC Group
The digital economy requires economic development and digital development to be understood and be operated closely together for ecosystem orchestration.
In this session, we explore how to unbundle and connect application silos, to build connectivity between applications to make valuable data to flow within and between ecosystems. What practical steps are required and who should be involved?
We explore learning from other industries to help imagine use and concepts of digital in ecosystem development and orchestration context and share our own key learnings of digital from several ecosystems around the world.
Big Data PPP Industrial Data Platforms - Towards cross-sectorial optimization and traceability
To start identifying synergies and to learn how different projects will address key data collection, sharing, integration, and exploitation challenges, a series of webinars have been organized under the umbrella of this Big Data Value PPP. These webinars are also organized by BDVA, BDVe project, and other projects which are part of this PPP.
This presentation gives a brief introduction to blockchain and proposes a unified analytical framework for trustable machine learning and automation running with blockchain.
Neo4j GraphTalks - Einführung in GraphdatenbankenNeo4j
The document announces a Neo4j GraphTalks event in October 2016 in Berlin. It includes an agenda with presentations on ADAMA's use of Neo4j for data sharing and knowledge management, and their experiences implementing and demoing Neo4j. There will also be an open networking session with NeoTechnology and PRODYNA representatives.
This document provides an overview of Neo4j, including:
- Neo4j is a graph database company with over 260 employees and $80M in funding.
- It has over 10M downloads and 275+ enterprise customers, including top retail, financial, and software firms.
- Neo4j is used for common use cases like recommendations, fraud detection, knowledge graphs, and master data management by customers like NASA, eBay, Walmart, and Marriott.
- The document describes several customer case studies and how Neo4j helped organizations like NASA, ICIJ, and others solve problems and gain insights.
This document summarizes key findings from research on smart city best practices in 22 cities. It identifies three common routes cities take to becoming smart - the anchor, platform, and beta city models. It also outlines common technology enablers, challenges cities face, and examples of smart living, safety, and sustainability applications seen across different cities.
A presentation on how technology will impact on the future of East Sussex. Why build more roads when self-drive vehicles are coming? How will 3D printing impact on local production? How will the world of work look and do we have the work spaces to accomodate it? What jobs will be replaced by technology, and where will the new jobs come from? Should we start preparing for 5G now?
This presentation was given to the Team East Sussex board in July 2017 (TES are part of the South East Local Enterprise Partnership - SELEP)
Smart City: A Call for a Shift in MindsetCharles Mok
Charles Mok argues that Hong Kong needs a shift in mindset to become a truly smart city. He outlines opportunities that open data presents for improving transportation systems by allowing real-time traffic information. However, Hong Kong currently lacks open data and data sharing between government departments. Mok calls for increased coordination, updated laws, and a review of current infrastructure to allow innovation. The priorities for developing Hong Kong as a smart city include talent, funding, culture, infrastructure, markets, and reducing barriers between government and innovation.
IoT Semantic Interoperability: Keynote at Haystack Connect 2017Milan Milenkovic
Title: "IoT Semantic Interoperability and Project Haystack: Beginning of a Beautiful Friendship"
Definition and types of of interoperability, importance, standards, proposed cross-domain approach and feasibility POC.
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
Watch here: https://bit.ly/3719Bi7
Advanced data science techniques, like machine learning, have proven an extremely useful tool to derive valuable insights from existing data. Platforms like Spark, and complex libraries for R, Python and Scala put advanced techniques at the fingertips of the data scientists. However, these data scientists spent most of their time looking for the right data and massaging it into a usable format. Data virtualization offers a new alternative to address these issues in a more efficient and agile way.
Attend this webinar and learn:
-How data virtualization can accelerate data acquisition and massaging, providing the data scientist with a powerful tool to complement their practice
- How popular tools from the data science ecosystem: Spark, Python, Zeppelin, Jupyter, etc. integrate with Denodo
- How you can use the Denodo Platform with large data volumes in an efficient way
-About the success McCormick has had as a result of seasoning the Machine Learning and Blockchain Landscape with data virtualization
The document discusses digital transformation and innovation. It covers 6 sessions: (1) the building blocks of digital transformation, (2) data and the data revolution, (3) digital economics, (4) decision making, (5) innovation, and (6) data storytelling. The agenda focuses on understanding clients and stakeholders to improve knowledge, leverage transactions, and measure efficiency and effectiveness. Digital transformation is driven by properties, platforms, people and practices changing organizational conversations.
Bria Francesca. BCN Open Source, Agile Digital Transformation strategyFrancesca Bria
The document outlines Barcelona's digital city roadmap for 2017-2020. The objectives are to empower citizens through open source and agile transformation of city hall, develop a city data infrastructure to drive innovation, and diversify and strengthen the tech economy. Key initiatives include adopting agile methods, ensuring data and technological sovereignty for citizens, and launching flagship pilots like using big data for affordable housing and a central data analytics office. The roadmap aims to transform government and foster an open, participatory digital innovation ecosystem in Barcelona.
Using FME to Automate Data Integration in a CitySafe Software
Learn how the City of Coquitlam uses FME to solve diverse data integration challenges across multiple departments and projects, improving data sharing and accessibility between staff and contractors.
How Government Agencies are Using MongoDB to Build Data as a Service SolutionsMongoDB
The document discusses how government agencies are using MongoDB to build Data as a Service (DaaS) solutions. It provides examples of the Veterans Affairs using MongoDB for its VLER program to share veteran records, the Consumer Financial Protection Bureau using it for an open data platform, and the FCC using it for a mobile broadband speed test program. It also mentions the city of Chicago's use of MongoDB for a predictive analytics program called Windy Grid.
TripChain: A Peer-to-Peer Trip Generation DatabaseJon Kostyniuk
This is the presentation given by Jon Kostyniuk at the #ITEToronto2017 conference on August 1, 2017.
The TripChain framework represents an innovation in how trip generation data points can be stored and propagated for use within the transportation industry. This is accomplished through the implementation of a distributed, open peer-to-peer database for transportation professionals.
For more information, please visit http://tripchain.org/.
This document discusses methods for harnessing big data. It describes how sensors collect Internet of Things (IoT) data and how Volvo applies analytics. It also summarizes three methods: 1) The US Air Force uses an integrated data warehouse and geospatial analysis to track assets globally. 2) Siemens uses data discovery processes to predict train failures by analyzing sensor and failure report data. 3) Yahoo uses Hadoop as a data lake to store and analyze large amounts of user data from various sources like social media and clickstreams. The document emphasizes that no single technology is a silver bullet for big data.
This document discusses streaming data processing and the adoption of scalable frameworks and platforms for handling streaming or near real-time analysis and processing over the next few years. These platforms will be driven by the needs of large-scale location-aware mobile, social and sensor applications, similar to how Hadoop emerged from large-scale web applications. The document also references forecasts of over 50 billion intelligent devices by 2015 and 275 exabytes of data per day being sent across the internet by 2020, indicating challenges around data of extreme size and the need for rapid processing.
Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...MongoDB
NorthPoint Digital worked with the Penton and MongoDB teams to deliver a MongoDB based solution, Govalytics, to serve city and county governments. We will review the design decisions made and steps taken to implement and integrate into the existing digital platform.
In the session, we will review:
How Govalytics fits into Penton's entire digital platform?
What were the business drivers for choosing MongoDB (with Product Owner testimony) and why it was so successful?
How NorthPoint Digital implemented a complete, highly interactive UX solution powered by MongoDB as part of an integrated solution and not just as a database
Roadmap for the future – how the solution was designed to be independently scalable
Watch here: https://bit.ly/3i2iJbu
You will often hear that "data is the new gold". In this context, data management is one of the areas that has received more attention by the software community in recent years. From Artificial Intelligence and Machine Learning to new ways to store and process data, the landscape for data management is in constant evolution. From the privileged perspective of an enterprise middleware platform, we at Denodo have the advantage of seeing many of these changes happen.
Join us for an exciting session that will cover:
- The most interesting trends in data management.
- Our predictions on how those trends will change the data management world.
- How these trends are shaping the future of data virtualization and our own software.
This document provides an overview of big data and how to manage large amounts of data. It defines big data, discusses the characteristics of big data including volume, variety and velocity. It describes who generates big data and technologies that can be used to analyze big data like Hadoop, data warehousing and stream computing. The challenges of handling big data are also mentioned.
Open source, Agile Digital transformation BCNFrancesca Bria
The document outlines Barcelona's Digital City Roadmap for 2017-2020. The main objectives are to empower citizens through digital democracy and data sovereignty, transform government and public services through agile methods and open source technologies, and foster innovation through strengthening the tech sector and facilitating access to public procurement. Key initiatives include developing an open data portal, digital public services, and pilot projects leveraging data for affordable housing, healthcare, mobility and sustainability. The roadmap aims to advance Barcelona's position as a digital and technologically sovereign city.
GlobalLogic Java Community Webinar #16 “Zaloni’s Architecture for Data-Driven...GlobalLogic Ukraine
20 липня відбувся вебінар від Java Community – “Zaloni’s Architecture for Data-Driven Design” by Максим Дем’яновський — Software Engineer, GlobalLogic.
Доповідь надасть уявлення про Data-Driven Design, основні його переваги і практичну користь, а також покаже як його можна реалізувати на практиці.
The document is a presentation about data monetization and its opportunities for the tourism industry. It discusses what data monetization is, the constraints around it, and actual examples of data monetization. It notes that data monetization is the process of creating economic value from company data by selling it or using it internally. It presents data monetization as a great opportunity for the tourism industry to gain advantages from their data. The presentation concludes by encouraging attendees to start taking action to monetize their own data.
Similar to OTC Start Thinking BIG Data 2018 10-18 (20)
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...Social Samosa
The Modern Marketing Reckoner (MMR) is a comprehensive resource packed with POVs from 60+ industry leaders on how AI is transforming the 4 key pillars of marketing – product, place, price and promotions.
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Aggregage
This webinar will explore cutting-edge, less familiar but powerful experimentation methodologies which address well-known limitations of standard A/B Testing. Designed for data and product leaders, this session aims to inspire the embrace of innovative approaches and provide insights into the frontiers of experimentation!
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
The Ipsos - AI - Monitor 2024 Report.pdfSocial Samosa
According to Ipsos AI Monitor's 2024 report, 65% Indians said that products and services using AI have profoundly changed their daily life in the past 3-5 years.
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeWalaa Eldin Moustafa
Dynamic policy enforcement is becoming an increasingly important topic in today’s world where data privacy and compliance is a top priority for companies, individuals, and regulators alike. In these slides, we discuss how LinkedIn implements a powerful dynamic policy enforcement engine, called ViewShift, and integrates it within its data lake. We show the query engine architecture and how catalog implementations can automatically route table resolutions to compliance-enforcing SQL views. Such views have a set of very interesting properties: (1) They are auto-generated from declarative data annotations. (2) They respect user-level consent and preferences (3) They are context-aware, encoding a different set of transformations for different use cases (4) They are portable; while the SQL logic is only implemented in one SQL dialect, it is accessible in all engines.
#SQL #Views #Privacy #Compliance #DataLake
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
End-to-end pipeline agility - Berlin Buzzwords 2024Lars Albertsson
We describe how we achieve high change agility in data engineering by eliminating the fear of breaking downstream data pipelines through end-to-end pipeline testing, and by using schema metaprogramming to safely eliminate boilerplate involved in changes that affect whole pipelines.
A quick poll on agility in changing pipelines from end to end indicated a huge span in capabilities. For the question "How long time does it take for all downstream pipelines to be adapted to an upstream change," the median response was 6 months, but some respondents could do it in less than a day. When quantitative data engineering differences between the best and worst are measured, the span is often 100x-1000x, sometimes even more.
A long time ago, we suffered at Spotify from fear of changing pipelines due to not knowing what the impact might be downstream. We made plans for a technical solution to test pipelines end-to-end to mitigate that fear, but the effort failed for cultural reasons. We eventually solved this challenge, but in a different context. In this presentation we will describe how we test full pipelines effectively by manipulating workflow orchestration, which enables us to make changes in pipelines without fear of breaking downstream.
Making schema changes that affect many jobs also involves a lot of toil and boilerplate. Using schema-on-read mitigates some of it, but has drawbacks since it makes it more difficult to detect errors early. We will describe how we have rejected this tradeoff by applying schema metaprogramming, eliminating boilerplate but keeping the protection of static typing, thereby further improving agility to quickly modify data pipelines without fear.
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...sameer shah
"Join us for STATATHON, a dynamic 2-day event dedicated to exploring statistical knowledge and its real-world applications. From theory to practice, participants engage in intensive learning sessions, workshops, and challenges, fostering a deeper understanding of statistical methodologies and their significance in various fields."
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataKiwi Creative
Harness the power of AI-backed reports, benchmarking and data analysis to predict trends and detect anomalies in your marketing efforts.
Peter Caputa, CEO at Databox, reveals how you can discover the strategies and tools to increase your growth rate (and margins!).
From metrics to track to data habits to pick up, enhance your reporting for powerful insights to improve your B2B tech company's marketing.
- - -
This is the webinar recording from the June 2024 HubSpot User Group (HUG) for B2B Technology USA.
Watch the video recording at https://youtu.be/5vjwGfPN9lw
Sign up for future HUG events at https://events.hubspot.com/b2b-technology-usa/
2. london.ca 2
• Jon Kostyniuk, P.Eng.
• Primarily transportation
modelling and forecasting
background.
• All models are wrong, but
some are useful.
• TripChain.Org project lead,
trip generation data using
blockchain technology.
• Interested in data, its value,
and how we can better apply
its usefulness.
Who am I?
3. london.ca 3
• It’s not just coming, it’s here.
• It will demand more attention with:
oConnected and Autonomous Vehicles (CAVs)
oMobility-as-a-Service (MaaS)
oDistributed Systems
• We’re not all going to be data experts, but we should
have some basic data literacy.
• Help increase data-driven decision making (DDDM).
• Data like a currency, can provide value when timely
transmitted; therefore, communication is key.
Why should I care about Big Data?
4. london.ca 4
• Volume: Vast amounts generated every second.
• Velocity: Not only speed generated, but speed
at which it moves around.
• Variety: Types of data to work with, not just
structured data anymore.
• Veracity: What is the trustworthiness of the
data?
• Value: How does the data bring business value?
Defining “Big Data”
6. london.ca 6
Existing Traffic Signal System
• Existing system from 2005, updated regularly,
becoming dated with limited abilities with RT aspirations.
• 401 existing signal locations connected to a central
system at City Hall.
• Limited real-time awareness at intersections without
context.
7. london.ca 7
• Video data collection – TMCs, ATRs.
• Bluetooth readers for travel time and
OD studies, limited use.
• Radar detection, including many of
over 40 permanent ATRs.
• Traffic data via Google Maps and
APIs, limited use/confidence
• Limitations to “snapshot in time”
data collection, but demand is
dynamic.
Existing Data Services
8. london.ca 8
• Our TIMMS project, may include:
oTSP to support RT,
o“Adaptive” Signals at key corridors,
oA modern TMC,
oCCTV monitoring, and/or
oTravel time monitoring and feedback.
• Need to manage expectations for new
system, not a panacea.
• Goals and objectives to include
development of a Data Strategy.
New System Planned
9. london.ca 9
• Public web app, road closure and
disruption information.
• Lots of detailed information, used
by many business units.
• Need to “get” information from
website, not everyone knows
about.
• Custom-build, used for past
decade, but somewhat dated.
• Currently reviewing its use cases
and future direction.
Renew London Web App
11. london.ca 11
• Better strategy to “push” data to
popular apps and services.
• Enhanced Renew London, added
data feed.
oWaze’s CIFS XML format
• Agnostic on third-party usage of
feed.
• Joined Waze’s CCP in April 2018,
now live.
• Use Waze’s web interfaces, third
party tools, or build your own.
Renew London Integration with Waze
Construction
Road
Closures
Collisions
Road
Hazards
12. london.ca 12
WARP Project
• Open-source traffic data solution by Louisville KY,
supported by OGC – QR to GitHub repo.
• Polls traffic data, dumps in database for historical
use.
• Processor features, in-development:
oHosted cloud service, database
oAPI endpoints, integration
oTraffic study tool, analytics
oInteractive map, visualization
13. london.ca 13
Train-Delay Warning System
• Testing sensors at 3 crossing locations.
• Participating in BCIP pilot program with
TRAINFO in 2019.
• Provide crossing insights along with
Bluetooth sensors.
• Gathers data, predicts when a train may
be present
• Early warning system via VMS to allow
drivers to choose better routes or other
“push” services.
14. london.ca 14
A blockchain is a distributed database
that is updated in near real-time, stored in
decentralized locations, and easy to
monitor. It has a level of security that
insures that no one party can modify a
database entry, because, in a sense,
everyone is watching.
Blockchain Technology
16. london.ca
Process
Characteristics
Blockchain Benefits
Consensus between
Multiple Parties
Enhanced coordination and choreography between
parties through a shared view of the latest data
status.
Reconciliation Master source of data instead of disparate data
sources that require constant validation and
reconciliation.
Data Lineage
(Temporal or 4D Data)
Complete traceability, ensuring integrity of data that is
continuously updated and maintained by multiple
parties.
Auditability Reliable and accurate audit trail with transparency of
the party responsible for each data change.
16
Blockchain Technology
18. london.ca
• CAV Technical
Background report,
May 2018
oQR Code report
link above.
• Council resolution,
develop a CAV
Strategic Plan.
18
Connected and Autonomous Vehicles
19. london.ca 19
• Some municipalities early pilots in SPaT and MAP
V2I connectivity.
oSignal phase and timing
oPhysical geometry of intersection
• Data partnerships with OEMs, other non-traditional
stakeholders.
oE.g., vehicle manufacturers, communications, ride
sharing, etc.
• Each CAV a valuable data source.
oHow can municipalities leverage the data?
oImpacts on liability, responsibility, etc.?
Connected and Autonomous Vehicles
20. london.ca 20
• Key points on data:
oThere are significant
privacy issues.
oTechnology
expertise is urgently
needed.
oFocus on digital
infrastructure.
Connected and Autonomous Vehicles
• PPSC Report, The Future of Automated Vehicles in
Canada, January 2018 – QR Code report link above.
• Governments should build data expertise and capacity.
22. london.ca 22
A plan designed to
improve all of the ways
you acquire, store,
manage, share, and
use data.
What is a Data Strategy?
23. london.ca 23
• Each component
independent and can
evolve as needed.
• Each component has an
individual set of skills and
capabilities.
• “Enterprise-class” strategy
not always needed.
• Complexity increases with
organizational scope.
• Establish goals for each
component.
Essential Data Strategy Components
Source: The 5 essential Components of a Data Strategy, SAS Institute Inc.
24. london.ca 24
Identify data and understand its meaning.
Provision data to be made available while
respecting rules and access guidelines.
Govern through policies and mechanisms to
ensure effective data usage.
Store persistent data in a structure and location
that supports access and processing.
Integrate by moving and combining data to
provide a unified view.
Core Components Defined
25. london.ca 25
• Do we currently partner with the right people?
• What are our data bottlenecks?
• How do we optimize our data collection and usage?
• Are we getting the most out of our equipment / services?
• Which data services are the most unreliable and why?
• Do we have the right IT systems in place?
• What parts of our operations could be more efficient?
• What are our core data competencies?
• What data skills gaps exist in our municipality?
• What are our key skills needed in the next two years?
Example Strategic Questions
26. london.ca 26
Big Data Skills
• Business Skills: Understanding
transportation services, including
communication and interpersonal.
• Analytical Skills: Spot patterns, cause and
effect, build models, etc.
• Computer Science: Hardware, software,
AI, programming, etc.
• Statistics and Mathematics: Determine
relevant data, sample sizes, algorithms, etc.
• Creativity: Ability to convey insights
effectively to an audience.
27. london.ca 27
• Well, maybe not completely… but, hello visualizations!!
• Consider best way to communicate data with audience.
• Insightful to add 4th dimension… time.
Goodbye Spreadsheets
28. london.ca 28
• Challenging to have staff and expertise.
• Organizational challenges – operations, structure, size.
• BDaaS viable alternative to do-it-yourself.
• Collect, store, analyze, and provide access
• Does service provider support your strategy?
• Cost of Big Data vs. traditional approach.
Big Data as a Service (BDaaS)
29. london.ca 29
Key Takeaways
• Like promises of the past, Big Data is
not a panacea, manage expectations.
• Plan ahead, use Big Data effectively.
• Beware of too many purpose-built apps.
• Informed travellers = better decisions
and less frustration.
“Fewer red lights, less stop and start and a
minimum of delays, those are the goals of a
computerized traffic control system in which London
may participate” ~ London Free Press, 1962
Thank You
I appreciate you all attending and listening to this talk.
I’d like to extend a special thank you to the OTC staff and directors, and in particular Scott Godwin and Doug Green, for giving me the opportunity to speak today.
Ask Questions to the Audience
How do you feel when you just get a green light only to be stopped again at a red light at the next intersection?
How do you feel when you turn the corner and see those flashing lights and railroad crossing bells ringing, not knowing how long you’ll be stuck there?
Who am I?
[Cover bullets on slide]
Why am I telling you about Big Data?
Big Data really is this undefined “blob” that means many things to many people – ongoing challenge.
Essentially, I wish to advocate that:
All municipalities are different (e.g. size, staffing, abilities, needs, etc.), so we should consider our Strategic Needs;
When data insights are provided to travellers in near real-time, better decisions can be made to reduce frustration; and
When presented effectively, DDDM can help us make better transportation planning and design decisions.
I will briefly cover:
Some of the conventional approaches we are taking at the City of London (which I’m sure is similar with many of us);
Look at some emerging and even fanciful technologies; and
Most importantly, overview how we can pursue developing a Data Strategy for our own municipalities.
Why should I care about Big Data?
It’s not just coming, it’s here.
It’s only going to demand more attention with the advent of:
Connected and Autonomous Vehicles (CAVs);
Mobility-as-a-Service (MaaS); and
Distributed Systems (i.e. Blockchain and Smart Contracts).
We’re not all going to be data experts (don’t be scared), but we should have some basic data literacy.
If applied effectively, Big Data can help increase data-driven decision making.
Data is like a currency and can provide value when it is timely transmitted to where it’s needed. Therefore, communication of data is key.
Defining “Big Data”
Generally, big data can be defined by one or more of these 5 “Vs”.
Volume
[Read slide…] We’re not just talking about gigabytes – but terabytes, petabytes, and even zettabytes. How do we comprehend let alone work with this?
Velocity
[Read slide…] Data, and especially real time data, is most useful when it gets to where it’s needed in a timely manner.
Variety
[Read slide…] While structured data, like SQL databases, will continue to retain importance, unstructured data such as audio, video, and photos are becoming more important.
Veracity
[Read slide…] Given the other Vs, we need to have a certain level of trust in our data sources to help us drive appropriate decisions.
Value
[Read slide…] If the data we are collecting does not help us gain relevant insights, why are we doing it?
Existing Traffic Signal System
Existing traffic signal system in-place since 2005 and updated regularly.
We have 401 traffic signal locations in current system, including both full signals and IPS.
All signals connected to a central system, can change timings from City Hall.
System becoming dated, coming to end of lifespan, and limited considering RT aspirations.
Limited real-time awareness at each intersection without context. [Talk about February 2018 flood.]
Existing Data Services
Typical practices compared with many medium-sized cities currently.
Video data collection for most common TMCs and ATRs.
Use four (4) “portable” Bluetooth units to perform travel time and OD studies, but this is of limited use.
In recent years, radar detection has become our “go-to” technology, including many of our over 40 permanent ATR locations.
On occasion we have used Google Maps and APIs to obtain traffic data, but still limited use and confidence in this approach.
While these services provide a good, basic data background, there are still limitations to the “snapshot in time” approach whereas travel demands are dynamic.
New System Planned
We do have a new system planned called our Transportation Integrated Mobility Management System (TIMMS) project. Still under development, but may include TSP, “Adaptive” Signals, TMC, CCTVs, and/or Travel Time Sensors.
Despite this we need to manage expectations for the new system.
One of the main components of this new system I wish to discuss today will be the development of a Data Strategy.
Renew London Web App
The City curates a web app that provides up-to-date road closure and disruption information.
Lots of detailed information available within the app, used may many business units within the City.
However, one needs to “get” the information from the website.
What if you don’t know about the website or are a visitor to the City?
This is frustrating to travellers and goes to the point that data is like a currency that needs to provide timely value when it is needed.
While there are similar third-party services available, Renew London is a custom-build, but is becoming somewhat dated.
We are currently assessing the use cases and future direction of Renew London.
Renew London Integration
First step: Instead to going to the Renew London website to “get” data, we’ll create a “push” data feed any third-party can use.
Our feed uses Waze’s Closure and Incident Feed Specification (CIFS) format as de-facto standard.
Generally, we have an agnostic stance on third-party usage:
Working with internal stakeholders such as London Transit Commission, Emergency Services, and Corporate Security.
Open to third-party external apps and services, e.g. TomTom, Garmin, Apple Maps, Navmii,
We joined Waze’s Connected Citizen’s Program (CCP) as of April 2018.
Free, two-way agreement.
Provide CIFS data every 2 minutes and obtain crowdsource and congestion data from Waze.
Push common data such as construction, collisions, road closures, and road hazards.
Ability to push out Emergency Shelter information in event of emergency.
Can access Waze via their web interface, third-party tools, or build/integrate your own.
WARP Project
Unique, less conventional ways are being explored to capture traffic data.
Open-source Waze Analytics Relational-database Platform (WARP) is on project currently under development by Louisville KY, the Open Government Coalition, and various other partners.
Includes four man areas of development:
A database to process and store historical data;
Data hooks via an API to integrate with other third-party products;
A traffic study tool to provide analytics to assist DDDM; and
An interactive map to visualize events and patterns over periods of time.
I’ve heard claims this could effectively provide data for FREE* traffic studies.
Time will tell, but caution that FREE likely means “Some conditions may apply”.
However, crowdsourced data could provide a preliminary snapshot of problematic areas of interest.
Source
https://docs.google.com/presentation/d/1loAV4BDAUyXdrn44QoLmYiwZdLmL59C4jvJGlZ1a-AY/edit
Train-Delay Warning System
Who likes getting stuck at train crossings?
We are currently testing train presence sensors at three (3) London railway crossing locations.
In 2019, we will be participating in a pilot program with TRAINFO through the Build in Canada Innovation Program (BCIP).
Our current project with this system is to gather train disruption data and determining the impacts to traffic patterns.
Over time, the algorithms build a crossing profile to “predict” when likely train disruption events will occur.
The intent is to provide timely, early warning systems to inform drivers and allow them to choose alternate routes.
Use Variable Message Signs (VMS) and/or travel apps, such as Waze.
Still limitations in propagation time from sensor detection to “push” notifications in apps - the train disruption could be over.
Blockchain Technology
Who here has heard of blockchain technology or smart contracts?
Who here has heard about Bitcoin or Ethereum?
Many people seem to know Bitcoin, but don’t (literally) buy into the hype!
Many people are similarly less familiar with Blockchain Technology.
I would argue that this technology is more important and has more of a potential future in Big Data and transportation.
Let’s start with a definition… [Read Definition].
In other words, a blockchain is distributed, immutable, and auditable.
Blockchain Technology
To visualize how blockchains interact, consider a centralized system, such as a central traffic signal system or ATMS.
Blockchain generally acts as a distributed system, but has characteristics of a decentralized system – a hybrid between the two.
Effectively, new data propagates through the network in near real-time as in the distributed model.
However, individuals in isolation can access the network as in the decentralized model.
Redundancy in the network – several, if not most, of the network nodes could fail while continuing to maintain functionality.
Potential Use Cases
Link together Mobility-as-a-Service (MaaS) partners for seamless data exchange.
Transparency and incentive tracking for Public-Private-Partnerships (P3s) – MOBI.
Help the AI in Autonomous Vehicle algorithms “learn” quicker by propagating new data in near real-time.
Break down data silos and give credit and value to data producers – Ocean Protocol.
Shameless plug for my pet project, TripChain.Org – propagate trip generation data to improve transportation planning decisions.
Many other Blockchain projects, constant churn, still emerging.
Prognosis for Blockchain
Overall still an emerging, promising technology, but watch progress in coming 5-10 years.
Technology still not proven in production, no “killer app” yet to emerge.
Although data storage getting cheaper and cheaper, still beware of data bloat.
Connected and Autonomous Vehicles
Briefly touch upon Connected and Autonomous Vehicles (CAVs).
While still several years out, we wish to take a more proactive approach to preparing for CAVs as opposed to reacting when they show up on our roads.
Developed a CAV Technical Background report in May 2018 and obtained a Council resolution to prepare a CAV strategy for the City of London.
Focus on recommendations including Infrastructure, Land Use, Transit, Parking, Accessibility, Safety, Privacy and Security, and Public Awareness and Education.
Connected and Autonomous Vehicles
Getting an early start, some municipalities are already piloting Vehicle-to-Infrastructure (V2I) communication, including SPaT and MAP data.
To pull CAVs off, we will need to explore and leverage non-traditional data partnerships.
Each CAV itself will likely be a valuable data source.
How can municipalities leverage this data for planning and operations?
Will vehicle self-report pot holes, road conditions, etc.?
How will municipalities respond?
What are the impacts on liability.
May questions to explore in the near future.
Connected and Autonomous Vehicles
The report included several key points related to data and municipalities:
There are significant privacy issues: As more and more vehicle data is generated, collected and shared, governments must act to protect the privacy rights and security of individuals.
Technology expertise is urgently needed: It will be crucial for regulators to develop expertise in data science and computer science. CAVs will generate large volumes of data which currently have no clear ownership rights. Governments will need to effectively address these concerns and protect the public interest.
Physical infrastructure modifications can wait, focus on digital infrastructure: CAVs are being designed to work with the physical or “hard” infrastructure that exists today. However, governments will need to be aware of which technologies automakers and suppliers are using. Digital interfaces such as sensor data, maps, etc. warrant more immediate attention to help support CAV systems.
Recommends governments should build data expertise and capacity as part of the interdisciplinary effort to make CAVs a reality in the coming years and decades.
Example Strategic Questions
Thinking at a high level, here are some example strategic questions municipalities could consider.
[Review question list]
Big Data Skills
Five (5) essential data science skills to develop.
[Review slides]
Goodbye Spreadsheets
I have a love/hate relationship with spreadsheets.
Tables and charts will always be valuable to gain insights, but depending on the audience, data visualizations are a great tool.
Observing data over time also gives a sense to guide decision making.
Big Data as a Service (BDaaS)
Of course, it can be challenging for many municipalities to have staff and expertise move from data to insights.
Organizational challenges such as operations, structure, and size may provide hurdles to gaining insights.
BDaaS is a growing and viable alternative to do-it-yourself data analysis. Depending on the service can collect, store, analyze, and provide access to data.
Ultimately need to consider whether a BDaaS provider supports your strategy and look at the cost vs. traditional approaches.
These are not an endorsement nor an extensive list, but here are several companies in our industry which may offer BDaaS to varying degrees.
I’m sure there are many others that I have not yet considered.
Key Takeaways
I like this quote… [Read Quote]. Does this ring true today, nearly 60 years later?
Like promises of the past, Big Data is not a panacea, so manage expectations with key stakeholders and the public.
By creating a Data Strategy and planning ahead, your municipality can use Big Data effectively.
Beware of too many purpose-built apps.
Look integration opportunities with prominent or popular apps or services to maximize your impact.
Not just serving your municipality, but also visitors and travellers passing through.
And lastly, informed travellers have the ability to make better travel decisions, hopefully experiencing less frustration.
Help to use the transportation network more effectively.
Knowledge is power and can help minimize the frustration the comes from the unexpected.