The document discusses developing line-of-business applications with Microsoft Silverlight, outlining an agenda that includes recommended patterns like MVVM, communications using WCF RIA Services, extending applications beyond the browser to be installed locally, and enabling extensibility through frameworks like Prism and MEF. The presenter Nuno Godinho is then introduced along with his background and areas of expertise in Silverlight development.
The Complete IT Consulting & IT Solutions Provider
Hi-Tech ITO is an India based IT Company led & managed by veteran corporate. An ISO 9001:2008 is one of the best IT Consulting & IT Solutions Provider delivering Software Development, Website Development Design and Programming Services.
The IT consulting firm of India has a successful trail and track of providing intelligent and innovative IT Solutions to hundreds of clients from more than 10 years.
Specializations:
IT Outsourcing Services & Solutions Provider specializes at custom software development, professional website design & development, ecommerce & shopping cart, mobile application development, HTML coding & web programming, enterprise & business intelligence solutions, etc.
Hemant Kothari is a software engineer with 2 years of experience in development, testing, and requirements gathering in the telecom domain. He has expertise in Tibco BW, Tibco EMS, Tibco Administrator, and IBM Datapower. Currently working as a software engineer at Tech Mahindra on integration projects for Orange and KPN, focusing on security, development, testing, and optimization. He holds a Bachelor's degree in Computer Science and Engineering from MCKVIE, Kolkata with an 8.41 CGPA.
Automatic multi-modal metadata annotation based on trained cognitive solution...FIAT/IFTA
Jakob Rosinski presented on automatic multi-modal metadata annotation based on trained cognitive solutions. He discussed using various cognitive services like IBM Watson, Microsoft Cognitive Services, Google Vision, and OpenCV to analyze video and audio content. This includes scene detection, people and object detection, emotion detection, speech to text, and more. The extracted metadata can then be used to enrich content and power advanced search and discovery tools.
Digital transformation with AI and process automation.
Prior consulting use cases in the domain of talent acquisition, e-commerce, e-Publishing and HR analytics.
How to prepare a perfect video abstract for your research paper – Pubrica.pdfPubrica
This document provides guidance on creating a video abstract for a research paper. It defines a video abstract as a shorter video that retains the essential meaning of the original through a series of moving pictures. The document discusses techniques for video abstraction, including keyframes, animations, and PowerPoint presentations. It provides technical specifications for video quality and formatting and guidelines for video submission.
How to prepare a perfect video abstract for your research paper – Pubrica.pptxPubrica
A video abstract is a series of moving pictures taken from a lengthier movie that is significantly shorter than the original yet retains the original's essential meaning.
Learn More : https://bit.ly/3JVyrCW
Reference: https://pubrica.com/services/publication-support/Video-Abstract/
Why Pubrica:
When you order our services, we promise you the following – Plagiarism free | always on Time | 24*7 customer support | Written to international Standard | Unlimited Revisions support | Medical writing Expert | Publication Support | Bio statistical experts | High-quality Subject Matter Experts.
Contact us:
Web: https://pubrica.com/
Blog: https://pubrica.com/academy/
Email: sales@pubrica.com
WhatsApp : +91 9884350006
United Kingdom: +44-1618186353
1) The document discusses various Watson capabilities including microservices for language, speech, vision, and data, as well as embodied cognition. It provides examples of use cases demonstrating speech to text with multiple speakers, a school navigator chatbot, expertise finder, and multimedia enrichment.
2) Live demos are shown for speech to text with diarization, speech to speech translation, a school finder application, and multimedia processing of video and audio.
3) The multimedia enrichment pipeline is described in detail, outlining how video and audio inputs are processed using various Watson APIs to extract metadata like transcripts, entities, keywords, and visual recognition results.
The document discusses developing line-of-business applications with Microsoft Silverlight, outlining an agenda that includes recommended patterns like MVVM, communications using WCF RIA Services, extending applications beyond the browser to be installed locally, and enabling extensibility through frameworks like Prism and MEF. The presenter Nuno Godinho is then introduced along with his background and areas of expertise in Silverlight development.
The Complete IT Consulting & IT Solutions Provider
Hi-Tech ITO is an India based IT Company led & managed by veteran corporate. An ISO 9001:2008 is one of the best IT Consulting & IT Solutions Provider delivering Software Development, Website Development Design and Programming Services.
The IT consulting firm of India has a successful trail and track of providing intelligent and innovative IT Solutions to hundreds of clients from more than 10 years.
Specializations:
IT Outsourcing Services & Solutions Provider specializes at custom software development, professional website design & development, ecommerce & shopping cart, mobile application development, HTML coding & web programming, enterprise & business intelligence solutions, etc.
Hemant Kothari is a software engineer with 2 years of experience in development, testing, and requirements gathering in the telecom domain. He has expertise in Tibco BW, Tibco EMS, Tibco Administrator, and IBM Datapower. Currently working as a software engineer at Tech Mahindra on integration projects for Orange and KPN, focusing on security, development, testing, and optimization. He holds a Bachelor's degree in Computer Science and Engineering from MCKVIE, Kolkata with an 8.41 CGPA.
Automatic multi-modal metadata annotation based on trained cognitive solution...FIAT/IFTA
Jakob Rosinski presented on automatic multi-modal metadata annotation based on trained cognitive solutions. He discussed using various cognitive services like IBM Watson, Microsoft Cognitive Services, Google Vision, and OpenCV to analyze video and audio content. This includes scene detection, people and object detection, emotion detection, speech to text, and more. The extracted metadata can then be used to enrich content and power advanced search and discovery tools.
Digital transformation with AI and process automation.
Prior consulting use cases in the domain of talent acquisition, e-commerce, e-Publishing and HR analytics.
How to prepare a perfect video abstract for your research paper – Pubrica.pdfPubrica
This document provides guidance on creating a video abstract for a research paper. It defines a video abstract as a shorter video that retains the essential meaning of the original through a series of moving pictures. The document discusses techniques for video abstraction, including keyframes, animations, and PowerPoint presentations. It provides technical specifications for video quality and formatting and guidelines for video submission.
How to prepare a perfect video abstract for your research paper – Pubrica.pptxPubrica
A video abstract is a series of moving pictures taken from a lengthier movie that is significantly shorter than the original yet retains the original's essential meaning.
Learn More : https://bit.ly/3JVyrCW
Reference: https://pubrica.com/services/publication-support/Video-Abstract/
Why Pubrica:
When you order our services, we promise you the following – Plagiarism free | always on Time | 24*7 customer support | Written to international Standard | Unlimited Revisions support | Medical writing Expert | Publication Support | Bio statistical experts | High-quality Subject Matter Experts.
Contact us:
Web: https://pubrica.com/
Blog: https://pubrica.com/academy/
Email: sales@pubrica.com
WhatsApp : +91 9884350006
United Kingdom: +44-1618186353
1) The document discusses various Watson capabilities including microservices for language, speech, vision, and data, as well as embodied cognition. It provides examples of use cases demonstrating speech to text with multiple speakers, a school navigator chatbot, expertise finder, and multimedia enrichment.
2) Live demos are shown for speech to text with diarization, speech to speech translation, a school finder application, and multimedia processing of video and audio.
3) The multimedia enrichment pipeline is described in detail, outlining how video and audio inputs are processed using various Watson APIs to extract metadata like transcripts, entities, keywords, and visual recognition results.
Evolve your app’s video experience with Azure: Processing and Video AI at scaleMicrosoft Tech Community
This document discusses Microsoft's announcements at Build regarding improvements to Azure Media Services. The updates include a simplified development model with single API for media services using ARM, HTTP ingest support on jobs, and new transform templates. New functionality includes new presets for video and audio analysis, live resources to simplify broadcast workflows, and role-based access control. New SDKs for .NET Core, Python, Java, Node.js and Go are also announced.
The document discusses using computer vision and media analytics technologies to create new opportunities for value-added connected TV services. It describes using these technologies to capitalize on existing content rights, unlock new niche markets, create a better viewing experience, and grow the business. Specific technologies mentioned include speech-to-text, facial detection, emotion recognition, video summarization, object recognition, and face redaction.
The document provides an overview of various Watson services that are available on IBM's Bluemix platform. It describes services such as Personality Insights, Text to Speech, Language Translation, Relationship Extraction, Question and Answer, Tone Analyzer, and Concept Expansion. For each service, it provides a brief description of what the service is, how it works, and potential use cases. The document is intended to educate readers on Watson services that can be accessed through Bluemix and their capabilities.
An Stepped Forward Security System for Multimedia Content Material for Cloud ...IRJET Journal
The document discusses a proposed system for securing multimedia content on cloud infrastructures. The system uses a two-level approach: 1) generating signatures for 3D videos to robustly represent them with little storage, and 2) a distributed matching engine for scalably storing and matching signatures of original and query objects. The system was tested on over 11,000 3D videos and 1 million images, achieving high accuracy and scalability when deployed on Amazon cloud resources.
This document provides information on building intelligent bots using Azure Bot Service and Cognitive Services. It discusses using pre-built AI like Cognitive Services APIs or custom AI with Azure Machine Learning. It also summarizes capabilities of Azure Bot Service like integrating various channels, using the Bot Builder SDK, and deploying bots to Azure. Examples of building speech, vision, and question-answering bots are also provided.
Create professional e-learning content and publish with the click of a button.
Discover the possibilities of an easy, collaborative authoring tool in the Cloud.
Deliver high-quality messaging, screen sharing, audio, and video capabilities...Jorge Fonseca
Your customers are asking to collaborate more effectively within your telemedicine apps, retail apps, gaming apps, and more. By attending this session, you will learn how to leverage a set of SDKs and APIs to embed video and audio capabilities in your apps. Eliminate the cost, complexity, and friction of creating and maintaining real-time communication infrastructure and services.
[DSC Europe 22] On building a video recommendation system and other use-cases...DataScienceConferenc1
I will talk about building a recommendation system for our internal video-hosting used by thousands of employees. Will describe how our project is organized, ML approach we use and issues we are facing, what our business goal is, what kind of features do we use, how quality is measured, and model is monitored. In addition to that, I would cover some of the other recommender system use-cases we have in the company.
The document discusses patents and inventions. It notes that patents are valuable intellectual property assets and that the process of obtaining a patent can be long but protecting ideas and inventions is important. IBM leverages patents significantly through differentiation of products, maintaining a technology lead, and licensing patents to others. The rest of the document provides details on 13 of Mariana's inventions and patents.
A confluence of events is accelerating the growth of AI in the Enterprise - (i) The COVID pandemic is accelerating the digital transformation of enterprises, (ii) increased digital sales & digital interaction is fueling interest in operationalizing AI to drive revenue and cost efficiencies and (iii) Enterprise databases and enterprise apps are infusing AI to transparently augment predictive capabilities for clients. Enterprise Power Systems are pillars of the global economy hosting our trinity of operating systems
In this session, you'll get all the answers about how ChatGPT and other GPT-X models can be applied to your current or future project. First, we'll put in order all the terms – OpenAI, GPT-3, ChatGPT, Codex, Dall-E, etc., and explain why Microsoft and Azure are often mentioned in this context. Then, we'll go through the main capabilities of the Azure OpenAI and respective usecases that might inspire you to either optimize your product or build a completely new one.
Artificial Intelligence on the AWS PlatformAdrian Hornsby
This document discusses artificial intelligence capabilities on the AWS platform. It introduces Amazon Polly for text-to-speech, Rekognition for image analysis, and Lex for speech recognition and natural language understanding. It also discusses the Deep Learning AMI for popular deep learning frameworks and GPU-accelerated EC2 instances for AI workloads. Case studies are provided of companies using these AWS AI services.
How Amazon AI Can Help You Transform Your Education Business | AWS WebinarAmazon Web Services
Machine Learning (ML) is a hot topic in the education industry. In this webinar, you will learn how AWS customers are using ML on AWS to transform their companies and their products. We'll go through many use cases across different industries (education, media, finance, retail, etc.), both in the enterprise and the startup worlds. In the process, we'll introduce you to the growing family of API-driven ML services that provide EdTechs with everything they need to innovate and build transformative learning solutions for the education marketplace.
Maschinelles Lernen auf AWS für Entwickler, Data Scientists und ExpertenAWS Germany
In diesem Vortrag geben wir einen Überblick mit Beispielen über aktuelle Werkzeuge für Maschinelles Lernen (ML) auf AWS. Dieser überblick deckt alle Möglichkeiten von einfach zu nutzenden, vollständig verwalteten ML-Services für Entwickler über ML-Plattformen für Data Scientists bis hin zu ML-optimierten Infrastruktur- und Software-Komponenten ab. Beispiele und Online-Demos zeigen, wie einfach ML-Methoden auf AWS genutzt werden können.
Moderator: Christian Petters, Solutions Architect, AWS
Google Cloud Platform provides tools for storage, computing, networking, machine learning and analytics that are powered by Google's technology and can be used to organize information and make it universally accessible. It includes services like Compute Engine, Kubernetes Engine, BigQuery, Cloud SQL, Cloud Storage and machine learning APIs for vision, translation and speech. Customers can build and manage applications on Google's scalable and fully-managed infrastructure at low cost.
This document summarizes Netex learningMaker, an e-learning authoring tool that allows users to create professional HTML5 content for publishing. It offers templates, collaborative features, device responsiveness, and publishing capabilities. Key features include content creation tools, project management features, user and permissions management, and error reporting. Technical requirements and support plans are also outlined. Additional integrated services like content creation assistance and system integrations are mentioned.
IWE 2480 - An Ecosystem of Innovation: Creating Cognitive Apps Powered by IB...Carmine DiMascio
The document is a keynote presentation about IBM Watson and the IBM Watson ecosystem. Some key points:
1. IBM Watson is a question answering computer system that understands natural language, generates hypotheses, and learns from interactions. It can provide responses with confidence levels and evidence.
2. The IBM Watson ecosystem brings Watson's cognitive capabilities to the cloud through tools, APIs, content stores, and talent hubs. It allows third parties to build cognitive apps powered by Watson.
3. An example app called Watson Films demonstrates how to integrate with Watson using APIs to build an application for getting information about movies from Watson. The app was developed using tools from the IBM Watson ecosystem.
This document discusses research on detecting deception in real-time audio and video streams. It outlines challenges in synchronizing, capturing, indexing and analyzing multiple streams. It proposes using MPEG-7 semantic annotations to generate knowledge bases for analysis. The research tests infrastructure for capturing, storing and retrieving segmented streams in SQL Server 2008. It also demonstrates prototype avatar animation controlled by Python scripts. Further studies are needed on the visual concept models and detection analysis engine.
The document describes IBM's Budapest Lab, formerly known as Ustream, which was acquired by IBM in 2016. It provides full-scale product development capabilities for video products, including engineering, DevOps, product management, UX design, data science, sales, and customer relations. The lab is using cognitive technologies like IBM Watson to help media clients leverage, monetize and understand video to drive their business through products like video enrichment, captioning, and recommendation engines. Case studies are presented showing how these products have helped clients like Sinclair Broadcast Group, Fox Sports, Workday, and the Grammy Awards enhance fan and user experiences.
An overview of the results of the 2021 FIAT/IFTA Timeline Survey, as presented by Adrienne Warburton during the 2021 FIAT/IFTA World Conference (online).
The FIAT/IFTA Most Wanted List may be a new initiative of FIAT/IFTA. The aim is to create a central hub of Most Wanted Lists, provided by broadcast and audiovisual archives worldwide.
On these lists we would put those programmes, media fragments, excerpts or even complete series that archives are desperately looking for. Via a contact button, other archives could put themselves in contact with the archive that has published its list, in order to to signal a possible trouvaille.
All further explanations and a link to a survey to measure the interest are in this presentation.
More Related Content
Similar to Rosinski ibm ai overview with several examples of projects in the media and lessons learned
Evolve your app’s video experience with Azure: Processing and Video AI at scaleMicrosoft Tech Community
This document discusses Microsoft's announcements at Build regarding improvements to Azure Media Services. The updates include a simplified development model with single API for media services using ARM, HTTP ingest support on jobs, and new transform templates. New functionality includes new presets for video and audio analysis, live resources to simplify broadcast workflows, and role-based access control. New SDKs for .NET Core, Python, Java, Node.js and Go are also announced.
The document discusses using computer vision and media analytics technologies to create new opportunities for value-added connected TV services. It describes using these technologies to capitalize on existing content rights, unlock new niche markets, create a better viewing experience, and grow the business. Specific technologies mentioned include speech-to-text, facial detection, emotion recognition, video summarization, object recognition, and face redaction.
The document provides an overview of various Watson services that are available on IBM's Bluemix platform. It describes services such as Personality Insights, Text to Speech, Language Translation, Relationship Extraction, Question and Answer, Tone Analyzer, and Concept Expansion. For each service, it provides a brief description of what the service is, how it works, and potential use cases. The document is intended to educate readers on Watson services that can be accessed through Bluemix and their capabilities.
An Stepped Forward Security System for Multimedia Content Material for Cloud ...IRJET Journal
The document discusses a proposed system for securing multimedia content on cloud infrastructures. The system uses a two-level approach: 1) generating signatures for 3D videos to robustly represent them with little storage, and 2) a distributed matching engine for scalably storing and matching signatures of original and query objects. The system was tested on over 11,000 3D videos and 1 million images, achieving high accuracy and scalability when deployed on Amazon cloud resources.
This document provides information on building intelligent bots using Azure Bot Service and Cognitive Services. It discusses using pre-built AI like Cognitive Services APIs or custom AI with Azure Machine Learning. It also summarizes capabilities of Azure Bot Service like integrating various channels, using the Bot Builder SDK, and deploying bots to Azure. Examples of building speech, vision, and question-answering bots are also provided.
Create professional e-learning content and publish with the click of a button.
Discover the possibilities of an easy, collaborative authoring tool in the Cloud.
Deliver high-quality messaging, screen sharing, audio, and video capabilities...Jorge Fonseca
Your customers are asking to collaborate more effectively within your telemedicine apps, retail apps, gaming apps, and more. By attending this session, you will learn how to leverage a set of SDKs and APIs to embed video and audio capabilities in your apps. Eliminate the cost, complexity, and friction of creating and maintaining real-time communication infrastructure and services.
[DSC Europe 22] On building a video recommendation system and other use-cases...DataScienceConferenc1
I will talk about building a recommendation system for our internal video-hosting used by thousands of employees. Will describe how our project is organized, ML approach we use and issues we are facing, what our business goal is, what kind of features do we use, how quality is measured, and model is monitored. In addition to that, I would cover some of the other recommender system use-cases we have in the company.
The document discusses patents and inventions. It notes that patents are valuable intellectual property assets and that the process of obtaining a patent can be long but protecting ideas and inventions is important. IBM leverages patents significantly through differentiation of products, maintaining a technology lead, and licensing patents to others. The rest of the document provides details on 13 of Mariana's inventions and patents.
A confluence of events is accelerating the growth of AI in the Enterprise - (i) The COVID pandemic is accelerating the digital transformation of enterprises, (ii) increased digital sales & digital interaction is fueling interest in operationalizing AI to drive revenue and cost efficiencies and (iii) Enterprise databases and enterprise apps are infusing AI to transparently augment predictive capabilities for clients. Enterprise Power Systems are pillars of the global economy hosting our trinity of operating systems
In this session, you'll get all the answers about how ChatGPT and other GPT-X models can be applied to your current or future project. First, we'll put in order all the terms – OpenAI, GPT-3, ChatGPT, Codex, Dall-E, etc., and explain why Microsoft and Azure are often mentioned in this context. Then, we'll go through the main capabilities of the Azure OpenAI and respective usecases that might inspire you to either optimize your product or build a completely new one.
Artificial Intelligence on the AWS PlatformAdrian Hornsby
This document discusses artificial intelligence capabilities on the AWS platform. It introduces Amazon Polly for text-to-speech, Rekognition for image analysis, and Lex for speech recognition and natural language understanding. It also discusses the Deep Learning AMI for popular deep learning frameworks and GPU-accelerated EC2 instances for AI workloads. Case studies are provided of companies using these AWS AI services.
How Amazon AI Can Help You Transform Your Education Business | AWS WebinarAmazon Web Services
Machine Learning (ML) is a hot topic in the education industry. In this webinar, you will learn how AWS customers are using ML on AWS to transform their companies and their products. We'll go through many use cases across different industries (education, media, finance, retail, etc.), both in the enterprise and the startup worlds. In the process, we'll introduce you to the growing family of API-driven ML services that provide EdTechs with everything they need to innovate and build transformative learning solutions for the education marketplace.
Maschinelles Lernen auf AWS für Entwickler, Data Scientists und ExpertenAWS Germany
In diesem Vortrag geben wir einen Überblick mit Beispielen über aktuelle Werkzeuge für Maschinelles Lernen (ML) auf AWS. Dieser überblick deckt alle Möglichkeiten von einfach zu nutzenden, vollständig verwalteten ML-Services für Entwickler über ML-Plattformen für Data Scientists bis hin zu ML-optimierten Infrastruktur- und Software-Komponenten ab. Beispiele und Online-Demos zeigen, wie einfach ML-Methoden auf AWS genutzt werden können.
Moderator: Christian Petters, Solutions Architect, AWS
Google Cloud Platform provides tools for storage, computing, networking, machine learning and analytics that are powered by Google's technology and can be used to organize information and make it universally accessible. It includes services like Compute Engine, Kubernetes Engine, BigQuery, Cloud SQL, Cloud Storage and machine learning APIs for vision, translation and speech. Customers can build and manage applications on Google's scalable and fully-managed infrastructure at low cost.
This document summarizes Netex learningMaker, an e-learning authoring tool that allows users to create professional HTML5 content for publishing. It offers templates, collaborative features, device responsiveness, and publishing capabilities. Key features include content creation tools, project management features, user and permissions management, and error reporting. Technical requirements and support plans are also outlined. Additional integrated services like content creation assistance and system integrations are mentioned.
IWE 2480 - An Ecosystem of Innovation: Creating Cognitive Apps Powered by IB...Carmine DiMascio
The document is a keynote presentation about IBM Watson and the IBM Watson ecosystem. Some key points:
1. IBM Watson is a question answering computer system that understands natural language, generates hypotheses, and learns from interactions. It can provide responses with confidence levels and evidence.
2. The IBM Watson ecosystem brings Watson's cognitive capabilities to the cloud through tools, APIs, content stores, and talent hubs. It allows third parties to build cognitive apps powered by Watson.
3. An example app called Watson Films demonstrates how to integrate with Watson using APIs to build an application for getting information about movies from Watson. The app was developed using tools from the IBM Watson ecosystem.
This document discusses research on detecting deception in real-time audio and video streams. It outlines challenges in synchronizing, capturing, indexing and analyzing multiple streams. It proposes using MPEG-7 semantic annotations to generate knowledge bases for analysis. The research tests infrastructure for capturing, storing and retrieving segmented streams in SQL Server 2008. It also demonstrates prototype avatar animation controlled by Python scripts. Further studies are needed on the visual concept models and detection analysis engine.
The document describes IBM's Budapest Lab, formerly known as Ustream, which was acquired by IBM in 2016. It provides full-scale product development capabilities for video products, including engineering, DevOps, product management, UX design, data science, sales, and customer relations. The lab is using cognitive technologies like IBM Watson to help media clients leverage, monetize and understand video to drive their business through products like video enrichment, captioning, and recommendation engines. Case studies are presented showing how these products have helped clients like Sinclair Broadcast Group, Fox Sports, Workday, and the Grammy Awards enhance fan and user experiences.
Similar to Rosinski ibm ai overview with several examples of projects in the media and lessons learned (20)
An overview of the results of the 2021 FIAT/IFTA Timeline Survey, as presented by Adrienne Warburton during the 2021 FIAT/IFTA World Conference (online).
The FIAT/IFTA Most Wanted List may be a new initiative of FIAT/IFTA. The aim is to create a central hub of Most Wanted Lists, provided by broadcast and audiovisual archives worldwide.
On these lists we would put those programmes, media fragments, excerpts or even complete series that archives are desperately looking for. Via a contact button, other archives could put themselves in contact with the archive that has published its list, in order to to signal a possible trouvaille.
All further explanations and a link to a survey to measure the interest are in this presentation.
This document presents the results of a survey conducted by the FIAT/IFTA Media Management Commission to understand where member organizations are on the digital archiving timeline. The survey asked about preservation formats, content management systems, access methods, metadata creation, and public connections. 79 organizations responded, mostly broadcasters and audiovisual archives. The results show most have transferred files to mass storage, use digital asset management systems, provide online access, and feature content on websites and social media. The document compares 2020 responses to previous years.
As presented by Johan Oomen (Sound an Vision) and Vasilis Mezaris (Information Technologies Institute Thessaloniki) at the 2019 FIAT/IFTA World Conference in Dubrovnik, Croatia.
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)FIAT/IFTA
The document discusses the benefits of exercise for mental health. Regular physical activity can help reduce anxiety and depression and improve mood and cognitive functioning. Exercise causes chemical changes in the brain that may help protect against mental illness and improve symptoms.
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉFIAT/IFTA
This document discusses procurement rules and approaches for converting physical archive media like tapes, films, and collections into a digital archive. It considers two approaches: conducting digitization in-house versus outsourcing. It also covers national and EU tendering processes, different types of procurement competitions, and factors to consider like internal resources and timelines, technical specifications, and supplier capabilities and costs. The goal is to establish the most efficient and compliant approach to digitizing a large physical archive collection.
HULSENBECK Value Use and Copyright Comission initiativesFIAT/IFTA
The document discusses concepts related to copyright including value, use, creative reuse, intellectual property rights, open access, and the Value, Use & Copyright Commission (VUC). It provides links to external resources on copyright including flowcharts and case studies. The document poses questions about these copyright-related topics.
This document discusses the challenges and benefits of digitizing a film archive. It notes that the challenges of digitizing include a lack of funding, film handling skills, and local telecine facilities, as well as unreliable metadata and unsuitable equipment. However, the benefits would include increased access and reuse of the archive, improved search capabilities, preservation of fragile originals, improved collection building, and development of film skills.
LORENZ Building an integrated digital media archive and legal depositFIAT/IFTA
The document discusses building an integrated digital media archive and legal deposit system for preserving film and video assets in Slovenia. It summarizes the requirements presented by Vladimir Torov from the Ministry of Justice in Slovenia, which include analyzing the current analog system, choosing standards, identifying hardware and software solutions, and creating workflows. The key requirements are for hardware like high-end servers and film scanners, and customized software for a media asset management system, flexible workflow system, quality control checks, and user rights management. The presentation then discusses how Cube-Tec can provide solutions to meet these requirements, including verification of assets and metadata, a web-based media player, database import/export, and a flexible workflow system.
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...FIAT/IFTA
This document discusses the history of video art and television in Argentina, mentioning several important figures and events. It references the dictatorship of Augusto Pinochet in Chile and the military dictatorship in Argentina led by General Leopoldo Galtieri. It also lists several Argentine video artists and their works that helped establish video art as a new form of creative expression during this political period.
Over 8 million people in Western Europe are living with dementia. Music has been described as "the profoundest nonchemical medication" by Oliver Sacks. The document discusses using music and reminiscence work to help those with dementia, as well as a goal for everyone with dementia in the UK to have access to music by 2020. It also considers how television and radio archives can support this by improving lives through their archived content and partnerships.
PERVIZ Automated evolvable media console systems in digital archivesFIAT/IFTA
Al Jazeera Media Network has a large daily influx of content that needs to be ingested and stored in their central archive. They are implementing automated and AI-based media console systems to help manage this content more efficiently. The systems will use machine learning to analyze videos, enrich metadata, and translate scripts and text. This will help improve content discovery, retrieval and accessibility across Al Jazeera's many language channels and platforms.
LYCKE Artificial intelligence, hype or hope?FIAT/IFTA
This document discusses the potential for using artificial intelligence to help manage an archive of media content at VRT, a Belgian public broadcaster. It notes that the archive department is facing budget cuts and growing amounts of content to archive. It considers using AI for tasks like speech recognition to generate metadata for older content lacking it. While AI could create metadata quickly without human effort, challenges include the quality of AI-generated metadata and needing to train models on archive-specific data. The conclusion is that AI shows both hype and hope for archives - it may help with searchability over time with focused use cases, but will require experimentation and expertise to apply effectively for the archive's needs.
AZIZ BABBUCCI Let's play with the archiveFIAT/IFTA
RSI is developing an augmented reality mobile application called RSI ARchive to provide access to archival audiovisual content from RSI in a novel way. Using AR and GPS technologies, the app will allow users to discover contextually relevant archival content linked to their real-world location. Metadata including titles, descriptions and coordinates will offer users an enriched experience of exploring the archival materials and learning about the history of different regions. The goals are to strengthen the connection between archival content and locations, make quality historical content accessible, and enable new collaborations with partners such as in tourism and museums.
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
Codeless Generative AI Pipelines
(GenAI with Milvus)
https://ml.dssconf.pl/user.html#!/lecture/DSSML24-041a/rate
Discover the potential of real-time streaming in the context of GenAI as we delve into the intricacies of Apache NiFi and its capabilities. Learn how this tool can significantly simplify the data engineering workflow for GenAI applications, allowing you to focus on the creative aspects rather than the technical complexities. I will guide you through practical examples and use cases, showing the impact of automation on prompt building. From data ingestion to transformation and delivery, witness how Apache NiFi streamlines the entire pipeline, ensuring a smooth and hassle-free experience.
Timothy Spann
https://www.youtube.com/@FLaNK-Stack
https://medium.com/@tspann
https://www.datainmotion.dev/
milvus, unstructured data, vector database, zilliz, cloud, vectors, python, deep learning, generative ai, genai, nifi, kafka, flink, streaming, iot, edge
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...Social Samosa
The Modern Marketing Reckoner (MMR) is a comprehensive resource packed with POVs from 60+ industry leaders on how AI is transforming the 4 key pillars of marketing – product, place, price and promotions.
The Ipsos - AI - Monitor 2024 Report.pdfSocial Samosa
According to Ipsos AI Monitor's 2024 report, 65% Indians said that products and services using AI have profoundly changed their daily life in the past 3-5 years.
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Aggregage
This webinar will explore cutting-edge, less familiar but powerful experimentation methodologies which address well-known limitations of standard A/B Testing. Designed for data and product leaders, this session aims to inspire the embrace of innovative approaches and provide insights into the frontiers of experimentation!
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
State of Artificial intelligence Report 2023kuntobimo2016
Artificial intelligence (AI) is a multidisciplinary field of science and engineering whose goal is to create intelligent machines.
We believe that AI will be a force multiplier on technological progress in our increasingly digital, data-driven world. This is because everything around us today, ranging from culture to consumer products, is a product of intelligence.
The State of AI Report is now in its sixth year. Consider this report as a compilation of the most interesting things we’ve seen with a goal of triggering an informed conversation about the state of AI and its implication for the future.
We consider the following key dimensions in our report:
Research: Technology breakthroughs and their capabilities.
Industry: Areas of commercial application for AI and its business impact.
Politics: Regulation of AI, its economic implications and the evolving geopolitics of AI.
Safety: Identifying and mitigating catastrophic risks that highly-capable future AI systems could pose to us.
Predictions: What we believe will happen in the next 12 months and a 2022 performance review to keep us honest.
End-to-end pipeline agility - Berlin Buzzwords 2024Lars Albertsson
We describe how we achieve high change agility in data engineering by eliminating the fear of breaking downstream data pipelines through end-to-end pipeline testing, and by using schema metaprogramming to safely eliminate boilerplate involved in changes that affect whole pipelines.
A quick poll on agility in changing pipelines from end to end indicated a huge span in capabilities. For the question "How long time does it take for all downstream pipelines to be adapted to an upstream change," the median response was 6 months, but some respondents could do it in less than a day. When quantitative data engineering differences between the best and worst are measured, the span is often 100x-1000x, sometimes even more.
A long time ago, we suffered at Spotify from fear of changing pipelines due to not knowing what the impact might be downstream. We made plans for a technical solution to test pipelines end-to-end to mitigate that fear, but the effort failed for cultural reasons. We eventually solved this challenge, but in a different context. In this presentation we will describe how we test full pipelines effectively by manipulating workflow orchestration, which enables us to make changes in pipelines without fear of breaking downstream.
Making schema changes that affect many jobs also involves a lot of toil and boilerplate. Using schema-on-read mitigates some of it, but has drawbacks since it makes it more difficult to detect errors early. We will describe how we have rejected this tradeoff by applying schema metaprogramming, eliminating boilerplate but keeping the protection of static typing, thereby further improving agility to quickly modify data pipelines without fear.
Population Growth in Bataan: The effects of population growth around rural pl...
Rosinski ibm ai overview with several examples of projects in the media and lessons learned
1. FIAT/IFTA Media Management Seminar
“Game Changers? From Automation to Curation: Futureproofing AV Content”
IBM AI Overview with several Examples of
Projects in the Media and Lessons Learned
Jakob Rosinski | Lead Architect Video Solutions & Broadcast Industry Europe
Stockholm | 13.06.2018
2. This speech will give you an overview about client projects in the space of media archives worldwide IBM has
contributed to with it's own AI - named Watson - but also with it's knowledge and integration capabilities. Major topics are
scope definition and use case identification, further the usage of cognitive services of different kinds and vendors - with
success and open problems. In such a multi-modal approach training of services is also key, and the speech should
show how this can be managed both from a human and machine perspective.
Abstract
Jakob is the Lead Architect for Video Solutions & Broadcast Industry for IBM Services
in Europe. He is also the product owner of IBM AREMA, a workflow and essence
management solution which is widely used at different broadcasters for essence
archives and workflow automation.
Over the last decade Jakob was responsible for various projects in the media industry
at HBO, France24, ORF, SRF, RTL Mediengruppe or Deutsche Bundesliga/Sportcast.
He is an expert for multi-site & multi-tier essence management and workflow
automation for ingest, archive, production & distribution.
Further he is known and valued as a subject matter expert for the topics above in the
WW IBM M&E community. He is skilled at translating business needs into systems
solutions
3. Video Enrichment uses industry leading AI capabilities to analyze textual, audio, and visual data
within multi-media content, and to build easily searchable metadata packages for every asset.
By understanding content in new ways, media companies can improve content discovery,
increase operational efficiency, deliver higher ad revenues, drive viewer engagement and offer
entirely new ways to meet the demands of their businesses.
Enriched content is inherently more searchable. Improved content discovery in your consumer
service leads to increased usage.
4. Cognitive base services used for content enrichment
Enhanced and automated
understanding of personalities
present in the frame, and objects
Activate decade-old material by
running it through the STT API and
then performing deeper analytics
Deeper understanding of concepts,
recognized entities, keywords, and
relationships
Target
Deeply
enriched
content
second-to-
second
Search for image and videodata for
not trained objects or contexts.
Visual Recognition
Audiomining & Speech
to Text
NLU & Translation
Videodetection / Speed /
Movement
Pattern Detection &
Similarity Search
5. A lot of vendors are providing base cognitive
services...
Visual Recognition
Audioming & Speech to
Text
NLU & Translation
Videodetection / Speed /
Movement
Pattern Detection &
Similarity Search
11. 11
Customer
MAM or DAM
Enriched metadata is delivered as an open JSON bundle to be
stored and used for search, compliance, recommendation and
other vital use cases.
Assets are acquired, ingested, processed and enriched
using the Watson Media platform.
SEMANTIC SCENE CHAPTERING
Divides the Media into meaningful chunks or chapters that can be more
easily managed by people responsible for editing or producing.
SPEECH TO TEXT
Converts audio into text, by leveraging machine intelligence to combine
information about grammar and language structure with knowledge of
the composition of the audio signal. Trainable.
NATURAL LANGUAGE UNDERSTANDING
Using the Textual output of S2T or a Close Caption File, NLU derives:
Concepts, Document-Level Emotions Sentiment, Entities, Keywords,
Language, & Taxonomy. Trainable.
VISUAL RECOGNITION
Detects the contents of an image or video frame, answering the
question: “What is in this image?” Returns class, class description, face
detection, and text recognition. Trainable.
TONE ANALYZER & PERSONALITY INSIGHTS
Provide additional features that document the Emotional Tone, Writing
Tone, Social Tone of dialogue, as well as the overall personalities of
characters based on their words.
Watson Video Enrichment Workflow
> > >
>>>
>>>
15. Scene Detection
Deep Video-Analysis
People-, Object and Context-Detection
Classification of actors based on 24
emotions
Classification of scenes based on 22.000
categories
Deep Audio-Analysis
Background
Actor sentiment and tone
Analysis of scene composition
Classification of light and color
Analysis of succesful trailers
to automatically create a
new one
https://www.youtube.com/watch?v=gJEzuYynaiw
15
16. Concept and proving of an automatic content
enrichment system for 40+ years of soccer history
Annotation by usage of a portfolio of cognitive solutions
Audio: Speech-to-text / Transcript
Audio: Speaker-Detection
Audio: Atmosphere (cheers, whistles, ..)
Video: Angle/Camera & Context Detection
Video: Face- & Object Detection
Domain trained services including Traningsportal
Sharpening of results by knowledge of domain and
creation of timelines, identifiying of concepts
Link with Game- and Playerdata
Optimize content analysis and search based on game
and player statistics
Guided search.
Persona-based User Experience
Personalized Discovery, Suggestions, Design &
Projects
Content enrichment for
Bundesliga archive
16
17. 17
Target: Automatic content enrichment
of 30+ years of show content
Annotation by usage of a portfolio of
cognitive solutions (IBM, OpenCV)
Audio: Speech-to-text / Transcript /
Phrase detection
Video: Angle/Camera & Context
Detection
Video: Face- & Object Detection
Domain trained services including
Traningsportal
Sharpening of results by knowledge of
domain and creation of timelines,
identifiying of concepts
Content enrichment for
Brazils most famous TV show
18. Architecture of “Captain Caption” Demo
AREMA
Speech
to Text
Deep Learning –
Sound
Recognition
Natural
Language
Understanding
Conform results into one Close Caption file
Translation into target language
L
19. 19
Context / Solution
Frame accurate detection of trained frames of lead in and out scenes to mark those
scenes in the content and exchange those automatically in master format without
transcoding (unwrap, cut, wrap) and with appropriate audio track handling to
enable fast channel switch of content.
• Usage of own developed detection component using OpenCV and Watson VR for
frameaccurate detection of scenes.
• Usage of AREMA‘s Dalet Galaxy integration to directly pull and push content to
MAM system, no need to extend Galaxy for this purpose
• Automatically scalable by using AREMA autoscaler in combination with
Kubernetes & Docker
• Usage of AREMA MXF Package for
• metadata extraction of source file
• rewrapping / preparartion audiotrack schema of new scene
• partial cut of source file
• conforming of all parts to target file
=> very fast, no transcoding or change of audio and video streams
Use Case: “Implement a full integrated, trained
cognitive service to exchange ident in and out
scenes”
Result:
• Fully automatized exchange of scenes, deeply integrated with existing environment
• Nearly endlessly scalable as all components can run in Kubernetes/Docker environment leads to significant reduce of time and people effort and faster
change of content between programs => from 3 months (2 full-time persons) to days
20. Each Use Case of Multimodal Analysis has different requirements so the workflows and the
combination of AI Services have to be adopted to these requirements
This is where the following model provides flexibility to adapt to each unique use case of
multimodal analytics
Vendor independant usage of cognitive services
The whole is greater than the sum of its parts (Aristoteles), but sometimes also particular
„tiny“ use cases are worth to be evaluated
Flexible MULTIMODALITY is a must
There is no One Size Fits All
21. 21
Elemental parts of a content
enrichment platform
Multi-Modality &
Training &
Vendorindependence
Data-Consolidation &
Monitoring
Integration
& Workflow
212121
23. Why is training necessary?
- How do we tell Will Ferrell (famous actor) apart from
Chad Smith (famous rock musician)?
- Challenges include:
• Out-of-Plane Rotation: frontal, 45 degree, profile,
upside down
• Presence of beard, mustache, glasses.
• Facial Expressions
• Occlusions by long hair, hand
• In-Plane Rotation
• Image conditions: size, lighting condition, distortion,
noise, compression
Trust me, these are two non-related different people!
https://medium.com/@ageitgey/machine-learning-is-fun-part-4-modern-face-recognition-with-deep-learning-c3cffc121d78
https://medium.com/@ageitgey/machine-learning-is-fun-part-4-modern-face-
recognition-with-deep-learning-c3cffc121d78
24. A lot of vendors are providing base cognitive services...but without
individual training they do not provide sufficient benefit
Customized user
AI model
Industry/Domain AI
Model
Base AI Model
Training data size
Accuracy
70%
60%
40%
Base model
learning curve
Domain-specific model
learning curve
50%
Customer adapted model
learning curve
0
80%
90%
As the domain specializes, learning accelerates
• Public models
• Pre-trained
• Limited accuracy for
typical real life use
cases
• Trained with proprietary
data
• Data ownership critical for
differentiation
Automated TRAINING is a must
27. Parts for an successful content enrichment
1. By combination of
trained cognitive
serviced new valuable
metadata can be
retrieved from content
2. Automatic creation and
use of those metadata
must be included in
existing processes
3. Quality of cognitive
services and processes
must be supervisioned
Information Corpora
- Rule-based configuration
- Batch learning
- Manual labeling
- Cognitive workflow builder
- E2E Broadcast Integration
(MAM, etc.)
- Full integration into AREMA
Operations Dashboards
…
Training
Cognitive Workflow
Orchestration
Cognitive Workflow
Operations
Elementary AI Services
Cognitive Content Media Services
IBM Watson APIs 3rd Party APIs
Speech-
to-Text
NLC/
NLU*
Visual
Recogn. …
General Domain
Content Tagging
Domain-specific
Content Tagging
(3rd party)
Domain-specific
Content Tagging
(propriety)
Domain-specific
Content Tagging
(shared)
Speech
Languag
e
Visual …Watson
Media
Knowledge
Studio
Essence Files Meta Data Public Data
Other Data
sources
…
28. • A comparison between single cognitive services is not adequate, but the reasonable combination of
services is
• The solution approach must start with the use case given, for which the solution will be defined and
customized
• AI will not overtake all human work, but will support in the areas where automization is meaningful
• The process will be a mix of human an AI based tasks and steps
• Sufficient solutions will be created by try-out and optimization, not by waiting for the perfect
technology.
Summary
While AI can’t fully
equate the human
touch creatively, it can
optimize workflows and
media processes to
gain more value from
content.
29.
30.
31. 31
Notes and Sources
McCaskill, Steve. “Wimbledon 2018: AI Marries Tennis Tradition With Digital Innovation.” Forbes. July 2018.
https://www.forbes.com/sites/stevemccaskill/2018/07/06/ wimbledon-marries-innovation-with-tradition-in-use-of- ai/#7686e2d92198
Moore, Mike. “Wimbledon 2018: How IBM Watson is serving up the best viewer experience.” Tech Radar. July 2018.
https://www.techradar.com/news/wimbledon-2018-how-ibm- watson-is-serving-up-the-best-viewer-experience
McCarthy, John. “IBM and Fox Sports lean on AI so fans can generate World Cup highlights packages.” The Drum. June 2018.
https://www.thedrum.com/news/2018/06/06/ibm-
and-fox-sports-lean-ai-so-fans-can-generate-world- cup-highlights-packages
Alvarez, Edgar. “Fox Sports’ World Cup Highlight Machine is powered by IBM’s Watson.” Engadget. June 2018.
https://www.engadget.com/2018/06/04/fox-sports-world- cup-highlight-machine-ibm-watson
Chang, Lulu. “IBM’s Watson will make headlines at the Masters tournament.” Digital Trends. April 2018.
https://www.digitaltrends.com/outdoors/ibm-watson-masters
Alexander, Julia, “Watch the first ever movie trailer made by artificial intelligence.” Polygon. September 2016.
https://www.polygon.com/2016/9/1/12753298/morgan- trailer-artificial-intelligence
Smith, John R. “IBM Research takes Watson to Hollywood with the first “cognitive movie trailer.” IBM. August 2016.
https://www.ibm.com/blogs/think/2016/08/cognitive- movie-trailer
“Uncovering Dark Video Data with AI: How Watson Video Enrichment can provide better decision-making data and unlock new business possibilities in
the media industry.” IBM. August 2017. https://public.dhe.ibm.com/common/ ssi/ecm/me/en/mew03018usen/uncovering-dark-data_
MEW03018USEN.pdf