Last year Basis Technology introduced Odyssey – an analytics solution, which provides an open, scalable platform for search, navigation and discovery. Its purpose is to streamline the development of highly customizable solutions for efficiently discovering relevant information from vast volumes of structured and unstructured content. Basis Technology has recently teamed with Kapow to incorporate their industry leading Big Data integration platform into the Odyssey solution to enhanced both the range of data now available to Odyssey as well as the ease of deployment. During this session, Stefan Andreasen (Kapow) and Jeff Godbold (Basis Technology) will provide an overview of this joint solution, highlighting the many benefits it offers to the world of multilingual, information discovery.
Keynote Slides: Profiting From Technology TrendsRoss Dawson
Slides for Ross Dawson's keynote at National Association of Federal Credit Unions Board of Directors Conference in Maui. Slides were designed to support the keynote, not to be viewed alone. For more see www.rossdawson.com
Agile:MK November 2017 - Why is change so hard?Milan Juza
We all know that the world changes rapidly, with new things becoming completely obsolete in a few short years, new methods being discovered, new competitors entering many markets. To succeed, teams and organisations need to be able to respond to and benefit from all these changes. Agility is all about change: from the fourth point of the Agile manifesto “responding to change over following a plan” to the final principle requiring regular retrospectives where the team changes their process. And we meet at Agile:MK out of a desire to change and improve ourselves and our working lives.
Change is both constant and necessary for survival and growth, both personally and for our companies. And yet we also all know that change can be agonisingly slow when we’re trying to introduce it. Change can be scary and unwelcome when it’s inflicted on us, even if we agree with the need.
At the November Agile:MK meetup we discussed this paradox, try see if we can understand why change is hard, and what we might do to make it easier. We hope to share practical experiences, approaches and practices and explore how to lead change successfully.
The Future of Global Financial Services - Vision 2020 Mumbai September 2008Ross Dawson
Presentation by Ross Dawson at the Vision 2020 - Financial Services Sector conference, run by NDTV Convergence and Wipro, held in Mumbai on 12 September 2020
5 minute Pitch deck for www.crowd-solve.it from the Hunter Pitchfest 2016 finals held at Dantia Smart Hub on 20/10/16.
CrowdSolve is a mobile platform that bridges the gap between the community and government. It's our mission to empower you to make an impact by sharing suggestions for community, building an engaged audience and connecting you with decision makers tp make your idea a reality.
For government, we provide the means to radically increase community engagement on local projects, as well as make sense of the ever increasing volume of unstructured data available, ultimately leading to a more productive and informed government.
"Merging of Digital & Physical Tech" - insights by Veemal Gungadin for #TSEA ...GEVME
Check out the main insights from the speech of Veemal Gungadin, CEO of GlobalSign.in and VP of Digital & Innovations, SACEOS during "The Special Event Asia".
Why Rural Areas Need Excellent Economic Development WebsitesBen Wright
Rural communities may be small, but their importance is not. Small communities make up a vast majority of cities and counties in the United States, which means having a great economic development website to carve out their space in the digital marketplace is essential.
Learn how to research and utilize Big Data to tell the story of your community and ultimately attract companies, talent, and capital to your front door.
Keynote Slides: Profiting From Technology TrendsRoss Dawson
Slides for Ross Dawson's keynote at National Association of Federal Credit Unions Board of Directors Conference in Maui. Slides were designed to support the keynote, not to be viewed alone. For more see www.rossdawson.com
Agile:MK November 2017 - Why is change so hard?Milan Juza
We all know that the world changes rapidly, with new things becoming completely obsolete in a few short years, new methods being discovered, new competitors entering many markets. To succeed, teams and organisations need to be able to respond to and benefit from all these changes. Agility is all about change: from the fourth point of the Agile manifesto “responding to change over following a plan” to the final principle requiring regular retrospectives where the team changes their process. And we meet at Agile:MK out of a desire to change and improve ourselves and our working lives.
Change is both constant and necessary for survival and growth, both personally and for our companies. And yet we also all know that change can be agonisingly slow when we’re trying to introduce it. Change can be scary and unwelcome when it’s inflicted on us, even if we agree with the need.
At the November Agile:MK meetup we discussed this paradox, try see if we can understand why change is hard, and what we might do to make it easier. We hope to share practical experiences, approaches and practices and explore how to lead change successfully.
The Future of Global Financial Services - Vision 2020 Mumbai September 2008Ross Dawson
Presentation by Ross Dawson at the Vision 2020 - Financial Services Sector conference, run by NDTV Convergence and Wipro, held in Mumbai on 12 September 2020
5 minute Pitch deck for www.crowd-solve.it from the Hunter Pitchfest 2016 finals held at Dantia Smart Hub on 20/10/16.
CrowdSolve is a mobile platform that bridges the gap between the community and government. It's our mission to empower you to make an impact by sharing suggestions for community, building an engaged audience and connecting you with decision makers tp make your idea a reality.
For government, we provide the means to radically increase community engagement on local projects, as well as make sense of the ever increasing volume of unstructured data available, ultimately leading to a more productive and informed government.
"Merging of Digital & Physical Tech" - insights by Veemal Gungadin for #TSEA ...GEVME
Check out the main insights from the speech of Veemal Gungadin, CEO of GlobalSign.in and VP of Digital & Innovations, SACEOS during "The Special Event Asia".
Why Rural Areas Need Excellent Economic Development WebsitesBen Wright
Rural communities may be small, but their importance is not. Small communities make up a vast majority of cities and counties in the United States, which means having a great economic development website to carve out their space in the digital marketplace is essential.
Learn how to research and utilize Big Data to tell the story of your community and ultimately attract companies, talent, and capital to your front door.
Keynote Slides: Creating the Future of NewsRoss Dawson
Slides for Ross Dawson's opening keynote at INMA World Congress 2015 in New York. Note that the slides are not designed to be viewed without attending the keynote, and there are numerous videos in the presentation not visible in these slides. For more see www.rossdawson.com
How to Accelerate Decision Making, Info Flows, & Open Mindsets
Organizations are under tremendous pressure from increased competition, volatile political shifts, and digital transformation. To survive, they need to be able to quickly understand what is going on and make faster, and better, decisions. Knowledge repositories just aren’t going to cut it any more. A seasoned Innovation and KM practitioner offers a new way to look at KM and innovation based on interesting research and with practical ways you can deliver even more value to your organization.
Curious about the US National Strategy for Trusted Identities in Cyberspace (NSTIC) and its private sector-lead partner the Identity Ecosystem Steering Group (IDESG)? Look no further. Here is the deck I used to give an update at the Kantara workshop at the Identity Relationship Management Summit.
DATAHOLICS is the only platform that captures and structure millions of data about people who are on social networks as Facebook, Linkedin, Google, Twitter and many others. It also captures information from users who are in different public sources such as Google search results, blogs, web portals and online services. Our algorithm creates an unified profile of the people with behavioral, professional and demographic indicators based on their email, celphone, name or ID.
Deals for Data - alliances in the digital economySachin Kapoor
Lecture presentation: data drives the alliances narrative in the digital economy. What to keep in mind when trying to stitch an alliance for accessing data or sharing it.
Data has become a true enterprise asset, but the gap between the enormous amount of data we have and achieving a firm foundation of master data which we can turn into business intelligence is vast. Join other councils to hear how they are meeting this challenge.
RYAN SAXBY HILL: Do your technology choices support your organization’s values?NetSquared Vancouver
Do your technology choices support your organization’s values? Working to build a better online Canada with informed technology choice
What does your choice of technology say about your values? How can choices about technology help build community? Sometimes decisions about Internet technologies seem small, but these small choices add up. Building a strong, resilient, and accessible Internet in Canada requires us all to understand how our decisions affect the larger Internet ecosystem and what we can do to ensure that our decisions are in line with our values.
Using data from the .CA Internet Factbook gathered from audits of 400 Canadian non-profit organizations conducted by Framework, and examples from the Internet and domain name industry, this session will address some key decisions about Internet technology and how non-profit executives can make choices that respect their mission, values and the needs of their clients.
@saxby
Ryan Saxby Hill is an expert in communications and digital marketing. He is currently the communications manager with the Canadian Internet Registration Authority, operators of the .CA domain name. Previously, Ryan led media relations and online engagement efforts at the Canada Foundation for Innovation and has held positions handling global communications and PR programs for Ciena Corporation and Nortel Networks.
Ryan is a founder of Apartment613, an award winning Ottawa-based digital community media organization and serves on the board of directors for the Centretown Citizens Ottawa Corporation, one of Canada’s most innovative non-profit housing providers.
====
Now in its 3rd year, The Digital Nonprofit Conference is ready to take you to the next level of tech success in your organization. This year's line up of presenters includes experts in the tech, nonprofit and private sectors, delivering deep dive discussions on topics ranging from:
Capacity planning in the digital world
Choosing the right tech tools to suit your organization's values
Cultivating digital talent
Digital fundraising & donor engagement
Building community engagement strategies with corporate partners
Rosette Search Essentials is a multilingual text analytics plugin that offers a wealth of powerful text analytics functionality, including tokenization, lemmatization, decompounding and POS tagging, along with entity extraction and entity resolution in Asian, European and Middle Eastern languages.
Keynote Slides: Creating the Future of NewsRoss Dawson
Slides for Ross Dawson's opening keynote at INMA World Congress 2015 in New York. Note that the slides are not designed to be viewed without attending the keynote, and there are numerous videos in the presentation not visible in these slides. For more see www.rossdawson.com
How to Accelerate Decision Making, Info Flows, & Open Mindsets
Organizations are under tremendous pressure from increased competition, volatile political shifts, and digital transformation. To survive, they need to be able to quickly understand what is going on and make faster, and better, decisions. Knowledge repositories just aren’t going to cut it any more. A seasoned Innovation and KM practitioner offers a new way to look at KM and innovation based on interesting research and with practical ways you can deliver even more value to your organization.
Curious about the US National Strategy for Trusted Identities in Cyberspace (NSTIC) and its private sector-lead partner the Identity Ecosystem Steering Group (IDESG)? Look no further. Here is the deck I used to give an update at the Kantara workshop at the Identity Relationship Management Summit.
DATAHOLICS is the only platform that captures and structure millions of data about people who are on social networks as Facebook, Linkedin, Google, Twitter and many others. It also captures information from users who are in different public sources such as Google search results, blogs, web portals and online services. Our algorithm creates an unified profile of the people with behavioral, professional and demographic indicators based on their email, celphone, name or ID.
Deals for Data - alliances in the digital economySachin Kapoor
Lecture presentation: data drives the alliances narrative in the digital economy. What to keep in mind when trying to stitch an alliance for accessing data or sharing it.
Data has become a true enterprise asset, but the gap between the enormous amount of data we have and achieving a firm foundation of master data which we can turn into business intelligence is vast. Join other councils to hear how they are meeting this challenge.
RYAN SAXBY HILL: Do your technology choices support your organization’s values?NetSquared Vancouver
Do your technology choices support your organization’s values? Working to build a better online Canada with informed technology choice
What does your choice of technology say about your values? How can choices about technology help build community? Sometimes decisions about Internet technologies seem small, but these small choices add up. Building a strong, resilient, and accessible Internet in Canada requires us all to understand how our decisions affect the larger Internet ecosystem and what we can do to ensure that our decisions are in line with our values.
Using data from the .CA Internet Factbook gathered from audits of 400 Canadian non-profit organizations conducted by Framework, and examples from the Internet and domain name industry, this session will address some key decisions about Internet technology and how non-profit executives can make choices that respect their mission, values and the needs of their clients.
@saxby
Ryan Saxby Hill is an expert in communications and digital marketing. He is currently the communications manager with the Canadian Internet Registration Authority, operators of the .CA domain name. Previously, Ryan led media relations and online engagement efforts at the Canada Foundation for Innovation and has held positions handling global communications and PR programs for Ciena Corporation and Nortel Networks.
Ryan is a founder of Apartment613, an award winning Ottawa-based digital community media organization and serves on the board of directors for the Centretown Citizens Ottawa Corporation, one of Canada’s most innovative non-profit housing providers.
====
Now in its 3rd year, The Digital Nonprofit Conference is ready to take you to the next level of tech success in your organization. This year's line up of presenters includes experts in the tech, nonprofit and private sectors, delivering deep dive discussions on topics ranging from:
Capacity planning in the digital world
Choosing the right tech tools to suit your organization's values
Cultivating digital talent
Digital fundraising & donor engagement
Building community engagement strategies with corporate partners
Rosette Search Essentials is a multilingual text analytics plugin that offers a wealth of powerful text analytics functionality, including tokenization, lemmatization, decompounding and POS tagging, along with entity extraction and entity resolution in Asian, European and Middle Eastern languages.
OSS 2013 - Real World Facets with Entity Resolution by Benson MarguliesBasis Technology
Solr’s ability to facet search results gives end-users a valuable way to drill down to what they want. But for unstructured documents, deriving facets such as the persons mentioned requires advanced analytics. Even if names can be extracted from documents, the user doesn’t want a “George Bush” facet that intermingles documents mentioning either the 41st and 43rd U.S. Presidents, nor does she want separate facets for “George W. Bush” or even “乔治·沃克·布什” (a Chinese translation) that are limited to just one string. We’ll explore the benefits and challenges of empowering Solr users with real-world facets.
View the slides of a tribute to the late Alan Magill by ASTMH Past President Christopher V. Plowe, MD, MPH, FASTMH, during WRAIR's inaugural Magill Symposium on June 23 in Silver Spring, MD.
Multilingual Search and Text Analytics with Solr - Open Source Search ConferenceBasis Technology
This talk will explore the challenges of Multilingual search, including language-specific issues — like N-gram segmentation vs. morphological analysis, stemming vs. lemmatization, and language identification — and the various approaches to configuring your Solr schema. We will also discuss the integration strategies for common text analytics capabilities and the impact of multilingual content on application design.
Solr is a powerful search engine which rapidly gained acceptance as an alternative to commercial search solutions for many applications. There are many features required by organizations to serve their diverse communities, among these is the ability to deliver search excellence in foreign languages. Delivering quality multilingual search involves careful design of schemas and selection of the best linguistic approach for each supported language.
Simple fuzzy name matching in elasticsearch paris meetupBasis Technology
Those are the slides that were presented during the Elasticsearch meetup in Paris on July 29th.
Normalization is crucial to high quality search results -- who wants irrelevant variations between queries and documents leading to missed hits (e.g., “celebrity” v. “celebrities”)? Normalizing dictionary words works, but what if your application focuses on names? Whether you’re tackling log analysis, e-commerce, watch list screening or other applications, names are often the key. Can you find “Abdul Jabbar, Karim” if you search for “Kareem AbdalJabar” or “كريم عبد الجبار”?
Applications using Elasticsearch provide some fuzziness by mixing its built-in edit-distance matching and phonetic analysis with more generic analyzers and filters. We’ve tried to go beyond that to provide both better matching and a simpler integration. We use a custom Mapper and Score Function so that linguistic nuances can be handled behind-the-scenes. We’ll talk about how we built this sort of plug-in for Rosette, its customization, and its connection to broader trend of entity-centric search.
Basis Technology's OSINT is a web-based application that enables analysts to extract and resolve named entities, including people, places, and organizations, from large quantities of multilingual text. OSINT is deployable in the cloud or on premises, and can be customized to address different data sources, knowledge bases, and workflow models.
Multilingual search requires the developer to address challenges that don’t exist in the monolingual case. In Solr, a robust multilingual search engine requires different analysis chains for each language because each language has its own logic for tokenization, lemmatization, stemming, synonyms, and stop words. To make multilingual search even harder, query strings are typically no longer than a handful of words, making language identification of query strings more difficult, or at worst ambiguous even to a human (“pie” could be an English or Spanish query). We’ll explore the breadth of Solr schema and configuration options available to a multilingual search application developer to balance functionality, performance, and complexity. We’ll dive deep into specific experiments with a practical application.
Speaker Bio: David Troiano
David Troiano is a Principal Software Engineer at Basis Technology who develops the services and applications that consume the core natural language processing products that Basis delivers. Over the past five years, he has worked on content search, discovery, and recommendation systems built on Lucene / Solr, with an eye toward scalability and performance. He also has professional experience with machine learning and predictive analytics tools in the quantitative finance industry. David holds a bachelor’s degree in Computer Science from Harvard College.
Presentation delivered by ASTMH Executive Director Karen A. Goraleski for the National Center for Emerging and Zoonotic Infectious Diseases (NCEZID) Lecture Series at the Centers for Disease Control and Prevention
Slides from President-Elect Patricia F. Walker, MD DTM&H, FASTMH, keynote adress to the 6th Annual North American Refugee Health Conference , June 12, 2016 in Niagara Falls, NY: “Refugee Healthcare: Imagining our Future.”
Big Data Triage with Rosette Human Language Technology ConferenceBasis Technology
This talk will discuss how Rosette — entity extraction, entity searching, document clustering, near duplicate detection, and fact-relationship-event extraction — can be combined with a powerful search engine to facilitate information discovery and thematic analysis across a variety of sources and languages.
The term “Big Data” has many possible meanings — large volume, fast-moving, many sources — but the issues it creates are clear. Analysts have significantly more data available, but the tools to exploit this data haven’t kept pace.
Many legacy approaches to analytic systems — databases and custom applications around them — are not flexible enough to pull in data from new sources at a moment’s notice, are not able to import and share the new data quickly enough to provide actionable intelligence, and cannot scale up to hold the massive amounts of data being produced.
But even if today’s systems could handle all of the available data — when presented with massive volumes of semi-structured, multilingual data from many sources, how effectively could an analyst discover the relevant data and efficiently move it into the analytical process?
View more slides from the Human Language Technology Conference 2012 here: http://info.basistech.com/hlt-2012-slides
The REAL Impact of Big Data on PrivacyClaudiu Popa
The awesome promise of Big Data is tempered by the need to protect personal information. Data scientists must expertly navigate the legislative waters and acquire the skills to protect privacy and security. This talk provides enterprise leaders with answers and suggests questions to ask when the time comes to consider the vast opportunities offered by big data.
Integra: Summiting the Mountain of Big Data (Infographic)Jessica Legg
Concepted, copywrote and creative directed the development of a new infographic for Integra around the theme of Big Data.
Summary: The mountain of Big Data is growing, presenting immense opportunities for businesses ready to summit its peak, but the journey requires preparation.
Our infographic will help you understand how big "Big Data" is; the business advantages you can capture by tapping into its power; and how you can prepare to meet its demands—resulting in Big Gains from Big Data.
Business data has changed radically. Enterprises today use thousands of SaaS applications and business systems that create more data than ever imagined, resulting in a struggle for users to gain holistic and actionable insights. Organizations need a solution to simplify the end to end workflow-- from data prep and governance to visualization, delivery, and action. This webinar will reveal a proven solution with real world examples and how it creates future opportunities for your organization.
Notes from the Observation Deck // A Data Revolution gngeorge
Notes from the Observation Deck will provide you with an examined look at the interesting phenomena and trends taking place around us today. We present them to you with the hope of sparking broader conversations, debates and ideas. Please use this as a resource for knowledge, inspiration and enjoyment.
The mountain of Big Data is growing, presenting immense opportunities for businesses ready to summit its peak, but the journey requires careful preparation. Integra helps businesses equip their network infrastructure to handle big requirements for Big Data—with fully-symmetrical Ethernet solutions designed to deliver low-latency, high-bandwidth connectivity between organizational peers, the cloud, and the servers where your data is stored. Our infographic, "Summiting the Mountain of Big Data" will help you understand how big "Big Data" really is; who's producing, consuming, managing and storing all that data; the business advantages you can capture by tapping into its power; and how you can prepare your organization to meet its demands—resulting in Big Gains from Big Data.
Maximize the Value of Your Data: Neo4j Graph Data PlatformNeo4j
In this 60-minute conversation with IDC, we will highlight the momentum and reasons why a graph data platform is a breakthrough solution for businesses in need of a flexible data model.
Please join Mohit Sagar, Group Managing Director of CIO Network, as he hosts the conversation with Dr. Christopher Lee Marshall, Associate VP at IDC, and Nik Vora, Vice President of APAC at Neo4. During this very exciting discussion, you'll discover the insights and knowledge unlocked with the graph data platform.
consists of two parts a) CROWDSOURCING b) BIG DATA. You will.docxclayrhr
consists of two parts a)
CROWDSOURCING
b)
BIG DATA
. You will upload two separate documents.
CROWDSOURCING:
Today it is not unusual to see entrepreneurs rely on the crowd to seek financial assistance to support their business idea instead of going to a traditional financial investor, bank or seek venture capital. The entrepreneur uses his or her social networks and established platforms on the Internet to directly interact with the crowd. The term
crowdsourcing
was first introduced in 2005 by Howe and Robinson, editors at Wired, however, it is in recent years this form of funding has taken off. In 2013, crowd funding became a $5.1 billion industry and in 2014 the market grew to $16.2 billion and it is expected to exceed $34.4 billion by the end of 2015. In China alone, the crowd funding industry is expected to reach $50 billion by 2025. We are looking at a potential global crowd funding market of $90–96 billion by 2025.
As the Internet becomes more integrated into all of our lives, driven in large part through mobile and social, we will all become increasingly interconnected. Our new interconnected world will contain multiple networks that represent different dimensions of people’s lives - social (
Facebook
), business (
LinkedIn
), food (
Allrecipes
), wine (
Vivino
), travel (
TripAdvisor
), movies (
Netflix
), fitness (
Runtastic
) - and cross the digital-physical divide (The Internet of Things).
Crowdsourcing
sites like G2 Crowd will represent just one more type of network that will connect people with products and technology, telling you what products they used, what they thought of them, and what reviews they read, liked or shared. If you then link the
crowdsourcing
network to a business network like
LinkedIn
, you can connect companies to reviewers and bring with it lots of context that when comprehensively analyzed can transform your understanding of the reviews.
Keywords:
crowdsourcing, crowdfunding, crowd wisdom, crowd creation, crowd voting, crowdfunding models, SWOT.
BIG DATA:
is an evolving term that describes any voluminous amount of structured, semi-structured and unstructured data that has the potential to be mined for information. Although big data doesn't refer to any specific quantity, the term is often used when speaking about
petabytes
and
exabytes
of data.
Measured in terms of volume, velocity, and variety, Big Data represents a major disruption in the business intelligence and data management landscape, upending fundamental notions about governance and IT delivery. With traditional solutions becoming too expensive to scale or adapt to rapidly evolving conditions, companies are scrambling to find affordable technologies that will help them store, process, and query all of their data. Innovative solutions will enable companies to extract maximum value from Big Data and create differentiated, more personal customer experiences.
Keywords:
What is Big Data,
History and Future of Big Data, Fundamentals o.
As 2017 begins, we are seeing big data and data science communities engage with new tools that specifically cater to data scientists and data engineers who aren’t necessarily experts in these techniques. Given rapid technological advances, the question for companies now is how to integrate new data science capabilities into their operations and strategies—and position themselves in a world where analytics can upend entire industries. Leading companies are using their data science capabilities not only to improve their core operations but also to launch entirely new business models.
Index:
1) The Importance of Data
2) What is Big Data Concept
3) Big Data vs. Cloud Computing
4) The basic idea behind Big Data
5) Why do we use Big Data
6) Top 10 companies using Big Data
7) What kind of data is Big Data
8) Is Privacy a value
9) Future of Big Data by 2020
Learn to Find Your Dream Job with Your Dream Employer with DreamPath pdx MindShare
DreamPath founders presented at pdxMindShare for our June event and talked about the software they developed to help people identify their dream employers. They were both unhappy with their jobs at one point, and their answer was to help people learn how to find companies they would enjoy working for. The event was held at Trader Vic's in downtown Portland and the workshop was followed by networking where attendees got to interact with dozens of Portland professionals.
Similar to HLT 2013 - Big Data Navigation and Discovery by Stefan Andreasen & Jeff Godbold (20)
When applied to novel domains such as legal, medical, and hacker chatter, the out-of-box accuracy of NLP systems trained on news and other general-purpose datasets leaves much to be desired. What matters is how well a system performs on your</em data, and how easy it is to extract the information you need with minimal developer effort. In this webinar, we’ll introduce three new customization techniques for achieving your specific text processing goals with Rosette:
• Rapid development of custom entity & event extraction models with active learning, which reduces the number of annotated samples needed by about 75%.
• Resolving entity mentions to your knowledge base. With our custom database connector, leverage the power of contextual disambiguation for domain-specific entities of any type.
• Building custom text processing workflows to weave together multiple NLP functions with custom logic. For example, run entity extraction on an Arabic document to pull out key people, places, and organizations, then subsequently translate these entity names into English, all via a single API call.
Heather Phipps, VP of Product Management
Hannah MacKenzie-Margulies, Senior Product Manager
Basis Technology
HOW TO USE AI TO TACKLE CRISIS KYC
Capably matching names and other personally identifiable information (PII) is critical to any effective compliance screening system: failure puts reputation, finances, and ethics on the line. Unfortunately, globalization coupled with the economic impact of the pandemic is testing screening systems like never before. As applications pour in, these systems are being asked to process key identity data in a huge variety of languages at unprecedented volumes. If these critical systems can’t keep up, everyone loses. But no one has to.
In Smart Matching for Screening, AI vet Steve Cohen will provide you with a clear roadmap for enhancing your screening systems with AI and NLP so you can cut false positives, reduce risk, and find bad actors during this crisis.
STEVE COHEN, DECLAN TREZISE
Basis Technology
Understanding Names with Neural Networks - May 2020Basis Technology
Matching names across languages and writing systems is a critical issue in a variety of consumer and governmental domains. Historically, computers have attempted to solve this problem with ad-hoc methods such as edit distance, sound indexing, and Hidden Markov Models, but these have a variety of practical limitations in this problem space, which we will explore. To address these issues, we present our research and development team’s work on doing English/Japanese name matching using deep neural networks, which provides a substantial boost in accuracy.
KFIR BAR, PHILIP BLAIR, CARMEL ELIAV
Basis Technology
Natural language processing (NLP) is advancing at breakneck speeds. This one-hour webinar will get you up to speed with the latest enhancements to the Rosette Text Analytics platform and overarching trends in NLP.
Chris Mack, VP of Text Analytics, covers how Rosette uses semantic signals to extract and link entities to open or proprietary knowledge bases. He also demonstrates a new tool for visualizing machine learning-powered cross-lingual fuzzy name matching.
Kfir Bar, Chief Scientist, discusses how active learning is enabling the next wave of human language technology, such as event and semantic relationship extraction.
The webinar consists of a 45-minute presentation and 15 minutes of Q&A. To watch the webinar in its entirety go to: https://basistech.wistia.com/medias/uje50rxucg
Simple fuzzy Name Matching in Elasticsearch - Graham MoreheadBasis Technology
Slides Washington DC Elasticsearch Meetup
June 25th, 2015
Normalization is crucial to high quality search results -- who wants irrelevant variations between queries and documents leading to missed hits (e.g., “celebrity” v. “celebrities”)? Normalizing dictionary words works, but what if your application focuses on names? Whether you’re tackling log analysis, e-commerce, watch list screening or other applications, names are often the key. Can you find “Abdul Jabbar, Karim” if you search for “Kareem AbdalJabar” or “كريم عبد الجبار”?
Applications using Elasticsearch provide some fuzziness by mixing its built-in edit-distance matching and phonetic analysis with more generic analyzers and filters. We’ve tried to go beyond that to provide both better matching and a simpler integration. We use a custom Mapper and Score Function so that linguistic nuances can be handled behind-the-scenes. We’ll talk about how we built this sort of plug-in for Rosette, its customization, and its connection to broader trend of entity-centric search.
OSDF 2013 - Autopsy 3: Extensible Desktop Forensics by Brian CarrierBasis Technology
Autopsy 3 is an easy to use digital forensics tool. Its development started after discussions at the first OSDF conference, with the goal of being a platform for which other developers will write modules. Autopsy allows you to perform a digital forensics exam on Windows using a free tool. This talk will cover the basic features of Autopsy, including timeline analysis, registry analysis, web artifact analysis, keyword search, and hash sets. There will also be discussion about future modules, and how to get involved as a user or developer.
HLT 2013 - Triaging Foreign Language Documents for MEDEX by Brian CarrierBasis Technology
When digital forensics investigators come across multilingual documents during an examination, how do they quickly check the content without a translator in the room? Basis Technology has built a document triage solution that integrates entity finding and translation capabilities with navigation to quickly help the examiner identify the priority of the document. This solution is a module for Autopsy, which is an open source digital forensics platform that has thousands of users and contributors.
HLT 2013 - Adapting News-Trained Entity Extraction to New Domains and Emergin...Basis Technology
Many of the most robust Human Language Technologies, including statistical part of speech taggers and entity extractors, are developed primarily using high quality newswire datasources. The performance of these technologies on texts in other genres, including short texts like tweets and even sub-genres of news like market summaries, is typically poor. Adapting such technologies to these increasingly important genres is still very difficult and an active area of commercial and academic research. In this presentation, Mr. Stewart will highlight the ways in which newswire trained modules typically fail on the most important emerging text genres, outline the most effective and lowest cost methods to adapt these resources that researchers and practitioners have discovered, and offer guidance on what degree of improvement users can expect to see in the short to medium term.
HLT 2013 - From Research to Reality: Advances in HLT by David MurgatroydBasis Technology
There's never been a more exciting time to be involved in Human Language Technology (HLT). Advances in algorithms, architectures, and applications are making real differences in fulfilling missions around the world. We'll use the perspective of one specific, end-to-end use case starting from primary source collection going all the way through finished intelligence to show the value and importance of moving your HLT thinking from strings to things, from configuration to adaption, from isolation to collaboration, and from small scale to Big Text. This perspective will serve as a guide to the other talks of the day which together will give you greater insight in applying HLT to your mission.
Autopsy 3: Free Open Source End-to-End Windows-based Digital Forensics PlatformBasis Technology
Autopsy™ is the premier free and open source end-to-end digital forensics platform built by Basis Technology and the digital forensics open source community. The platform has been in development since OSDF Con 2010, based on intense interest and collaboration from the digital forensics community, which determined the need for an open source end-to-end forensics platform that runs on Windows systems.
Autopsy version 3 is a complete rewrite from version 2 and is built to enable the creation of fast, thorough, and efficient hard drive investigation tools that can evolve with digital investigators’ needs. The standard installation includes features that rival commercial closed source offerings, without the associated costs.
FEATURES
Triage capability and real-time alerting
Automated workflow based on The Sleuth Kit™
Windows installation
Case management and report generation
Recent user activity extraction including: web history, recent documents, bookmarks, downloads, and registry analysis
Keyword and pattern search including: phone numbers, email addresses, URLs, and IP addresses
Hash lookup
Interesting files detection and timeline viewing
...and much more
For digital forensics investigators and analysts, there are numerous advantages to using open source software and software built on open source platforms like Autopsy and The Sleuth Kit:
• Transparent evidence extraction: Open source platforms allow you to look at the source code and to verify that the software is performing its functions in a forensically sound way. This can prove to be critical when testifying or preparing for litigation.
• Easily extensible: Open source platforms grow organically and as the needs of their consituents and users change, so does their functionality.
• Active community of users and developers: In addition to commercial support offered by Basis Technology,
there is a wealth of information that is available in a community that has evolved over the last 11 years where both users and developers are actively working to improve the software platform. This free knowledge base is an extremely powerful value add to your purchased enterprise support.
A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceBasis Technology
What’s all this cloud stuff, anyway? What kinds of problems do organizations set out to solve with ‘a cloud,’ or even ‘the cloud’? What are a few of the major government initiatives involving this technology? How does HLT in general, and Search in particular, fit?
This talk will take a tour of the technology behind clouds and the sometimes-foggy ambitions of the projects that use them, and look in particular detail at the challenges of applying cloud technologies to Text Analytics.
View more slides from the Human Language Technology Conference 2012 here: http://info.basistech.com/hlt-2012-slides
Autopsy 3.0 - Open Source Digital Forensics ConferenceBasis Technology
Autopsy 3.0 is a complete rewrite from Autopsy 2.0, and this talk will cover all of the things that are new about it. Multi-threaded ingest, triage, embedded databases, web artifact analysis, and indexed keyword search are just some of the new and exciting features.
This talk is targeted towards both users and developers. Users will learn about the tool, and how they can use it. Developers will learn the basics of where they can incorporate their tools into the Autopsy workflow as modules.
View more slides from the Open Source Digital Forensics Conference 2012 here: http://info.basistech.com/osdf-2012-slides
Moving Beyond Entity Extraction to Entity Resolution - Human Language Technol...Basis Technology
Entity extraction finds names in documents, providing important raw material for big decisions. But finding all mentions of the name “George Bush” is very different than finding all mentions of the 43rd US President.
Making big decisions from big data is hopeless unless analytics advance from providing snippets of text to providing statements of truth. Such advances present challenges both of accuracy and of usability. We’ll explore these challenges and demonstrate ways of addressing them.
View more slides from the Human Language Technology Conference 2012 here: http://info.basistech.com/hlt-2012-slides
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
HLT 2013 - Big Data Navigation and Discovery by Stefan Andreasen & Jeff Godbold
1. Big Data Navigation and Discovery
Stefan Andreasen
Jeff Godbold
Andy Lasko
CTO & Founder
Director – Fed Solutions
Partner Manager
Kapow Software
Basis Technology
Kapow Software
2. “
The most meaningful way to
differentiate your company
from your competition, to
put distance between you
and the crowd, is to do an
outstanding job with
information.
HOW YOU GATHER,
MANAGE AND USE
INFORMATION WILL
DETERMINE WHETHER YOU
WIN OR LOSE.
”
Bill Gates
6. There will always be unknowns
The risky ones are the UNKNOWN UNKNOWNS
7. Coming Soon:
Toilet Paper Priced Like Airline
Tickets
Source: Wall Street Journal, September5, 2012, Coming Soon: Toilet Paper Priced Like Airline Tickets
[talk about video – how revolutionary this was at the time but how real it is in today’s world, not just for governments but for business….]The Known Known’s are the comfort zone, the missing piece in a world that we know everything about. Inventory levels, ordering history, employees turnover – all information I know I need and I know where and how to get it…. Nothing too unfamiliarBorders & Barnes&Noble – know competitors, successfully rolled-up the smaller bookstores (or put them out of business), they could monitor in the symmetrical wayData feed providers; API connectorsEnterprises focus on pre-selected segments ‘known’ data and generalize & analyze from it to the 'unseen' (unknown).
The known unknowns are the things you know that you don’t know and proactively trying to get. For example Borders & Barnes&Noble - knew about the internet, knew people were transacting on-line, knew books were being bought but it was an unknown – were they listening? They didn’t know how profound this will be to their business going forwards. As a large media and entertainment company I know my IP is being illegally used online, I don’t know where and how…In eCommerce, I know who my competitors are but I don’t know when are they going to change their prices, of product products, at what frequency, in what regions and SKUsAnd as consumers are becoming so much more social and connected, the decision buying journey involves so many more touch points that are influenced by social commentary. So I know my customers are active on social channels but what are they saying about my brand, what are the positives and negatives etc.This is where you are at as Kapow customers
There will always be unknowns and the risky ones are the unknown unknows, these are the things under the surface that can have a significant impact… the black swans…]Barnesvs Borders – Amazon is the BS – then eBooks It can be very difficult and can be expensive to collect these unknown (asymmetrical) data sets vs the easy data. But this has the potential to be Black Swan impact full. Very small changes to small pieces of this once unknown - now known data, can have massive impact to a Business. You have to be monitoring. Those of you using Kapow for Competitive pricing understand this.
Many people are under the impression that Big Data is not real. It’s a great concept that doesn’t apply to me. Let me show you how people are already using it today to gain significant competitive advantage. We all know airline ticket pricing is dynamic but did you know that how much you pay for toilet paper also depends on when you buy it…A recent article from the Wall Street Journal discussed the model of the modern retailer. Leading retailers are dynamically adjusting product prices in the course of a day, monitoring competitors prices and other market conditions and responding in real-time. It’s elasticity of pricing and it’s used with the most commodity products like toilet paper. You can see that if you are not in the game you are in the sideline… What does Amazon know that nobody else know?
Talk about the growth of the data and how it relates to your original vision when you started the company.
Big Interaction data is growing rapidly. There are 2.5 quintillion bytes of it created every day. 40k new websites are registered each day with a total of 140 mill.The information is out there for you to leverage
When it comes to Big Data there is no greatest data source than the Web. This includes blogs, forums, customer review sites, social media, variety of cloud applications and even your competitors, partners and channel portals. There were 170 million websites as of end of last year and 30 million new ones added every month (http://news.netcraft.com/archives/2011/12/09/december-2011-web-server-survey.html). These data sources are a gold mine of very valuable information for any analytical effort allowing companies to monitor customer behavior and buying patterns to improve sales campaigns or predicting market trends allowing financial services companies to trade ahead of their competition and significantly increase revenues. But as I will discuss later in my presentation the data spread across these growing number of sources also creates a challenge beyond volumes.
Thank you Jeff[Show Muslim Brotherhood site]To get this content, someone needs to search the archives [show], copy all of the documents and metadata [show] into an analytical environment. When you get a few hundred of these sources, the problem gets worse.[Build Robot]So let’s see how we’d build a connector to this data source. Build, debug, deploy, schedule[Show REST/JSON ]Analysts use Kapow to build connectors.IT uses Kapow to build synthetic API’s for application integration, data warehousing and content acquisition.Now that we have our data in Odyssey, I’ll pass it back to Jeff.[Show slide again and build #1 then #2]Speak to bullets 1….The Muslim Brotherhood site is a web application with documents and unstructured metadata. This is a common data source.We see these inside and outside of the Enterprise. Internally these are our CMS, Wiki’s, Intel-link, project repositories and so on.Often there no API’s, the wrong API’s, or means for an analyst to copy the content an analytical application.IT does not have the tools, time or budget to connect to every data source. Manual migration – copy and paste – is too time consuming. Kapow’s workflow development environment we used today provides us rapid access to virtually any content from any application.[Build 2]Let’s look at another example.Let’s say you want to capture a web page or web sites content without writing a connector. Kapow’s ‘Capture’ is a workflow that uses Kapow’s browser connector to request all a web servers browser accessible content.This includes HTML, Images, PDF’s and other file formats, CSS, JS, and even some links and images in Flash.Kapow’s web capture is like Google on Steroids and turns your file system and analytical environment into a mirror of the target web server.The Capture workflow is built in Kapow so it is easy to reconfigure for your exact needs.[Show Google Blog results] http://www.google.com/search?q=muslim+brotherhood&tbm=blgThese are the results of a Google blog search for the term ‘Muslim Brotherhood’. We can call the ‘Web Capture’ from a Browser Plugin like we see here, from a lightweight Kapow Kapplet, from the Kapow Scheduler, or from within Odyssey.[SHOW BUILD SLIDE #3]What if you have something harder? Something that requires logins, multiple coordinated workflows, content in multiple languages and you want to hide your content acquisition activity. [Show Site]I’ll use the Arab Chat room for Egypt as an example.I need to load the site, create a unique name, repeatedly search for conversations, capture the conversations and send them to Odyssey.[Show Robots]Talk through 3 robotsArabChat_Messages.robotArabChat_ChatsToBasis.robotArabChat_People_IP.robotAs Stefan mentioned, Kapow Software can connect to the content of any application – whether it’s a website, an Enterprise Application or something in between like hosted applications, SaaS, and partner portals – in minutes. Kapow can connect to databases, API’s, XML and other file formats.With Kapow Software, Basis users are able to get their content into Odyssey faster, cheaper and they can get content that is difficult to access. Now I’m going to pass it back over to Jeff to review some of this data.