The document discusses multimedia annotation. It begins with the motivation for annotation, which is the exponential growth of multimedia content generated from ubiquitous mobile access and wearable devices. This content needs to be indexed and retrieved. The document then covers the basic architecture of annotation systems, including content capture, storage, and retrieval. It also discusses different types of metadata used for annotation at various semantic levels. Finally, it compares manual annotation versus automatic annotation, noting the semantic gap challenge and covering approaches like crowdsourcing annotation as micro-tasks or games.
The document discusses multimedia retrieval techniques for improving the user experience of consuming multimedia content. It describes how traditional broadcasting is limited because it does not allow users to choose what, when, how and where they access content. The techniques discussed include search, browsing, and recommendation. Search involves context-based search using metadata and content-based search using visual/audio content. Browsing involves navigating a structure built from content metadata. Recommendation involves using social networks and machine learning to suggest additional content to users.
This is a snapshot of how social media is being used at NC State University as of March 2010. It was presented in ENG 583a Social Media and Technical Communication, a course in NC State's Master's of Science in Technical Communication curriculum.
Building a global teaching profile: Showcasing Open Educational Resources a...Michael Paskevicius
Â
Building a global teaching profile: Â Showcasing Open Educational Resources at the University of Cape Town (UCT).
Delivered November 18, 2009 at the Teaching with Technology Miniconference hosted by the Centre for Educational Technology at UCT.
The document summarizes the Euroversity network, which aims to collect examples of good practices in teaching and learning in virtual worlds. It discusses the network's goals of facilitating knowledge transfer across contexts and providing a framework for a pan-European virtual university. The summary also outlines some of the network's opportunities in reducing costs and barriers to education, as well as some of its challenges around acceptance, access issues, and quality concerns.
Seminar presentation about :
Automatic Image Annotation structure: shallow and deep,
cons and pros of different features and classification methods in AIA and
useful information about databases,toolboxes, authors
Digital Commons Seminar Series: Information and Digital Literacy at the Unive...missvagrant
Â
The document discusses information and digital literacy at the University of Sheffield. It covers conceptualizing information and digital literacy, models for digital literacy from JISC, what information and digital literacy are, the library's vision statement, examples of digital tools used to support literacy like referencing tutorials and databases, and concludes with discussing literacy initiatives in the university's Information Commons.
ct: On 10 March 2010, cIRcle and the University Librarian, Ingrid Parent, hosted a special event at which UBC scholars highlighted their experience using cIRcle to disseminate research. This event, entitled "Up close with cIRcle: Revealing your research to the world" took place in the Irving K. Barber Learning Centre during Celebrate Research Week (March 8 - 12, 2010) at the University of British Columbia. The attached slide show was created for this event.
The document discusses multimedia retrieval techniques for improving the user experience of consuming multimedia content. It describes how traditional broadcasting is limited because it does not allow users to choose what, when, how and where they access content. The techniques discussed include search, browsing, and recommendation. Search involves context-based search using metadata and content-based search using visual/audio content. Browsing involves navigating a structure built from content metadata. Recommendation involves using social networks and machine learning to suggest additional content to users.
This is a snapshot of how social media is being used at NC State University as of March 2010. It was presented in ENG 583a Social Media and Technical Communication, a course in NC State's Master's of Science in Technical Communication curriculum.
Building a global teaching profile: Showcasing Open Educational Resources a...Michael Paskevicius
Â
Building a global teaching profile: Â Showcasing Open Educational Resources at the University of Cape Town (UCT).
Delivered November 18, 2009 at the Teaching with Technology Miniconference hosted by the Centre for Educational Technology at UCT.
The document summarizes the Euroversity network, which aims to collect examples of good practices in teaching and learning in virtual worlds. It discusses the network's goals of facilitating knowledge transfer across contexts and providing a framework for a pan-European virtual university. The summary also outlines some of the network's opportunities in reducing costs and barriers to education, as well as some of its challenges around acceptance, access issues, and quality concerns.
Seminar presentation about :
Automatic Image Annotation structure: shallow and deep,
cons and pros of different features and classification methods in AIA and
useful information about databases,toolboxes, authors
Digital Commons Seminar Series: Information and Digital Literacy at the Unive...missvagrant
Â
The document discusses information and digital literacy at the University of Sheffield. It covers conceptualizing information and digital literacy, models for digital literacy from JISC, what information and digital literacy are, the library's vision statement, examples of digital tools used to support literacy like referencing tutorials and databases, and concludes with discussing literacy initiatives in the university's Information Commons.
ct: On 10 March 2010, cIRcle and the University Librarian, Ingrid Parent, hosted a special event at which UBC scholars highlighted their experience using cIRcle to disseminate research. This event, entitled "Up close with cIRcle: Revealing your research to the world" took place in the Irving K. Barber Learning Centre during Celebrate Research Week (March 8 - 12, 2010) at the University of British Columbia. The attached slide show was created for this event.
OPEN BADGES – THE MISSING LINK IN OPEN EDUCATIONIlona Buchem
Â
A KEYNOTE ON OPEN BADGES PRESENTED AT THE UNIVERSITY OF LONDON, CENTER FOR DISTANCE EDUCATION, RESEARCH AND INNOVATION IN DISTANCE EDUCATION AND E-LEARNING – The 2016 research conference, 11 MARCH 2016 https://cdelondon.wordpress.com/
Presentation in panel "Opening up the world" at the Conference of OASP-Open Access Scholarly Publishers Association. Paris, UNESCO,Headquarters, 17-19 September 2014. http://oaspa.org/coasp-2014-preliminary-program/
Presentation in panel "Opening up the world" at the Conference of OASP-Open Access Scholarly Publishers Association. Paris, UNESCO,Headquarters, 17-19 September 2014. http://oaspa.org/coasp-2014-preliminary-program/
http://imatge-upc.github.io/telecombcn-2016-dlcv/
Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.
The Olympic and Sport Studies Centre at the Universidad AutĂłnoma de Barcelona is a multidisciplinary institution focused on research, education, dissemination, and information management related to sports and the Olympics. It was created for the 1992 Barcelona Olympics and conducts research projects on topics like mega-events, multiculturalism in sports, and social media and the Olympic Movement. The Centre offers education programs and promotes the International Chair in Olympism. It manages an online portal and library resources to disseminate knowledge about Olympic studies.
Using Web Archives for Studying Cultural Heritage Collaborative PlatformsMarta Severo
Â
In the last few years, cultural institutions have launched several experiments in order to transform their registers into transparent, open and participative documents available on the web. All these platforms introduce new ways of collaborative management of cultural heritage through the creation of participative pages corresponding to the inventory records directly on Wikipedia or on ad hoc platforms. This communication aims at studying these new forms of collaborative management of cultural heritage based on the use of wiki platforms. Past studies on this topic are organized mainly around two poles: analyses of computer and technical solutions, on the one hand, and researches on changes in the relationship between institutions and publics, on the other hand. Differently, this study is meant to focus on cultural heritage and notably on the collaborative digital writing around heritage objects that take shape on the web. Our ideal goal would be to study, through a historical perspective, how cultural heritage objects included in these inventories have evolved in the last few years as an effect of their opening on the web through wiki platforms. The objects will not be considered in relation to the inventory record, but as digital objects resulting from the editorialization processes involving heritage professionals, but also other users of the web.
Space, The Final Frontier: Next Generation Special CollectionsElaine Harrington
Â
This document summarizes Elaine Harrington's presentation on next generation special collections at UCC Library. It discusses the library's efforts to upgrade learning spaces and showcase innovative teaching practices. It also describes findings from the NMC Horizon Report on emerging technologies in higher education. Additionally, it provides several case studies of how special collections have been used in innovative ways with technologies like iPads, GIS mapping, 3D printing, microscopes, and document cameras to enhance learning. The presentation emphasizes how special collections can shape research outcomes and the benefits of reflecting on technology use.
- Angelina Russo leads research projects exploring the impact of social media on museum learning and communication between museums.
- Te Papa Tongarewa museum in New Zealand has the only complete Colossal Squid specimen on display. It was defrosted and examined in 2008 with a webcast and blog documenting the process.
- Russo discusses innovations at Te Papa including using multiple platforms to engage audiences through the scientific examination of the squid and its exhibition. She advocates designing experiences that capture imagination and illustrate relevant issues.
Convenient isn't always simple: Digital Visitors and Residents.Lynn Connaway
Â
Connaway, L. S. (2019). Convenient isn't always simple: Digital Visitors and Residents. Presented at the University of Adelaide, February 18, 2019, Adelaide, Australia.
CILIP Conference 2020: The 'Digital Pivot' - the role of librarnas and knowle...CILIP
Â
The document discusses the role of librarians and knowledge specialists in supporting the transition to online teaching and learning during the COVID-19 pandemic. It notes that information literacy is important for developing critical thinking skills and engaging as informed citizens. A survey found that librarians helped with skills like evaluating information, using databases, referencing, and dealing with misinformation. Common technologies used included web conferencing, learning tools, and video creation tools. Challenges included student engagement and digital literacy, while opportunities included developing online teaching skills and reaching more students. The document provides case studies of librarians adapting information literacy sessions online.
IEEE and IEEE Education Society - IEEE and LACCEI Manuel Castro
Â
Meetings in the IEEE Florida Council (North and Central Florida / South Florida) in August 2015, for the formation of the IEEE Florida Council Education Society Chapter
How researchers use of Social Media & Scholarly Collaboration NetworksKeita Bando
Â
Researchers have long shared scholarly articles with colleagues, but the internet has changed scholarly communication. There are now many scholarly collaboration networks (SCNs) that allow researchers to communicate, exchange information, and build online profiles. SCNs such as Academia.edu, ResearchGate, and Mendeley are increasingly popular among researchers for discovering and sharing articles. Sharing articles on SCNs can help advance research and be tracked through altmetrics.
Here are the key points about bibliometrics and information
evaluation:
- Bibliometrics is the use of citations to evaluate the impact and influence of
research publications. It is commonly used in research evaluation exercises.
- Databases like WoS, Scopus and Google Scholar have citation searching
capabilities to identify highly cited papers in a field and track citations to a
researcher's work.
- Altmetrics is an emerging field that looks at the broader impact of research
through social media mentions, bookmarks, reviews etc. It provides a more
timely measure of impact.
- Each database has strengths and limitations in coverage. WoS indexes fewer
journals but is more selective. Scopus has
Using Open Educational Resources in the Basic Composition ClassroomAnne Arendt
Â
Using Open Educational Resources (OER) and OpenCourseWare (OCW) and other Web 2.0 Technologies in the Basic Composition Classroom
Note: Go to http://works.bepress.com/anne_arendt/4/ to get the detailed report with all the proper citations and additional information.
This document provides an overview of deep generative learning and summarizes several key generative models including GANs, VAEs, diffusion models, and autoregressive models. It discusses the motivation for generative models and their applications such as image generation, text-to-image synthesis, and enhancing other media like video and speech. Example state-of-the-art models are provided for each application. The document also covers important concepts like the difference between discriminative and generative modeling, sampling techniques, and the training procedures for GANs and VAEs.
This document provides an overview of deep generative learning and summarizes several key generative models including GANs, VAEs, diffusion models, and autoregressive models. It discusses the motivation for generative models and their applications such as image generation, text-to-image synthesis, and enhancing other media like video and speech. Example state-of-the-art models are provided for each application. The document also covers important concepts like the difference between discriminative and generative modeling, sampling techniques, and the training procedures for GANs and VAEs.
OPEN BADGES – THE MISSING LINK IN OPEN EDUCATIONIlona Buchem
Â
A KEYNOTE ON OPEN BADGES PRESENTED AT THE UNIVERSITY OF LONDON, CENTER FOR DISTANCE EDUCATION, RESEARCH AND INNOVATION IN DISTANCE EDUCATION AND E-LEARNING – The 2016 research conference, 11 MARCH 2016 https://cdelondon.wordpress.com/
Presentation in panel "Opening up the world" at the Conference of OASP-Open Access Scholarly Publishers Association. Paris, UNESCO,Headquarters, 17-19 September 2014. http://oaspa.org/coasp-2014-preliminary-program/
Presentation in panel "Opening up the world" at the Conference of OASP-Open Access Scholarly Publishers Association. Paris, UNESCO,Headquarters, 17-19 September 2014. http://oaspa.org/coasp-2014-preliminary-program/
http://imatge-upc.github.io/telecombcn-2016-dlcv/
Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.
The Olympic and Sport Studies Centre at the Universidad AutĂłnoma de Barcelona is a multidisciplinary institution focused on research, education, dissemination, and information management related to sports and the Olympics. It was created for the 1992 Barcelona Olympics and conducts research projects on topics like mega-events, multiculturalism in sports, and social media and the Olympic Movement. The Centre offers education programs and promotes the International Chair in Olympism. It manages an online portal and library resources to disseminate knowledge about Olympic studies.
Using Web Archives for Studying Cultural Heritage Collaborative PlatformsMarta Severo
Â
In the last few years, cultural institutions have launched several experiments in order to transform their registers into transparent, open and participative documents available on the web. All these platforms introduce new ways of collaborative management of cultural heritage through the creation of participative pages corresponding to the inventory records directly on Wikipedia or on ad hoc platforms. This communication aims at studying these new forms of collaborative management of cultural heritage based on the use of wiki platforms. Past studies on this topic are organized mainly around two poles: analyses of computer and technical solutions, on the one hand, and researches on changes in the relationship between institutions and publics, on the other hand. Differently, this study is meant to focus on cultural heritage and notably on the collaborative digital writing around heritage objects that take shape on the web. Our ideal goal would be to study, through a historical perspective, how cultural heritage objects included in these inventories have evolved in the last few years as an effect of their opening on the web through wiki platforms. The objects will not be considered in relation to the inventory record, but as digital objects resulting from the editorialization processes involving heritage professionals, but also other users of the web.
Space, The Final Frontier: Next Generation Special CollectionsElaine Harrington
Â
This document summarizes Elaine Harrington's presentation on next generation special collections at UCC Library. It discusses the library's efforts to upgrade learning spaces and showcase innovative teaching practices. It also describes findings from the NMC Horizon Report on emerging technologies in higher education. Additionally, it provides several case studies of how special collections have been used in innovative ways with technologies like iPads, GIS mapping, 3D printing, microscopes, and document cameras to enhance learning. The presentation emphasizes how special collections can shape research outcomes and the benefits of reflecting on technology use.
- Angelina Russo leads research projects exploring the impact of social media on museum learning and communication between museums.
- Te Papa Tongarewa museum in New Zealand has the only complete Colossal Squid specimen on display. It was defrosted and examined in 2008 with a webcast and blog documenting the process.
- Russo discusses innovations at Te Papa including using multiple platforms to engage audiences through the scientific examination of the squid and its exhibition. She advocates designing experiences that capture imagination and illustrate relevant issues.
Convenient isn't always simple: Digital Visitors and Residents.Lynn Connaway
Â
Connaway, L. S. (2019). Convenient isn't always simple: Digital Visitors and Residents. Presented at the University of Adelaide, February 18, 2019, Adelaide, Australia.
CILIP Conference 2020: The 'Digital Pivot' - the role of librarnas and knowle...CILIP
Â
The document discusses the role of librarians and knowledge specialists in supporting the transition to online teaching and learning during the COVID-19 pandemic. It notes that information literacy is important for developing critical thinking skills and engaging as informed citizens. A survey found that librarians helped with skills like evaluating information, using databases, referencing, and dealing with misinformation. Common technologies used included web conferencing, learning tools, and video creation tools. Challenges included student engagement and digital literacy, while opportunities included developing online teaching skills and reaching more students. The document provides case studies of librarians adapting information literacy sessions online.
IEEE and IEEE Education Society - IEEE and LACCEI Manuel Castro
Â
Meetings in the IEEE Florida Council (North and Central Florida / South Florida) in August 2015, for the formation of the IEEE Florida Council Education Society Chapter
How researchers use of Social Media & Scholarly Collaboration NetworksKeita Bando
Â
Researchers have long shared scholarly articles with colleagues, but the internet has changed scholarly communication. There are now many scholarly collaboration networks (SCNs) that allow researchers to communicate, exchange information, and build online profiles. SCNs such as Academia.edu, ResearchGate, and Mendeley are increasingly popular among researchers for discovering and sharing articles. Sharing articles on SCNs can help advance research and be tracked through altmetrics.
Here are the key points about bibliometrics and information
evaluation:
- Bibliometrics is the use of citations to evaluate the impact and influence of
research publications. It is commonly used in research evaluation exercises.
- Databases like WoS, Scopus and Google Scholar have citation searching
capabilities to identify highly cited papers in a field and track citations to a
researcher's work.
- Altmetrics is an emerging field that looks at the broader impact of research
through social media mentions, bookmarks, reviews etc. It provides a more
timely measure of impact.
- Each database has strengths and limitations in coverage. WoS indexes fewer
journals but is more selective. Scopus has
Using Open Educational Resources in the Basic Composition ClassroomAnne Arendt
Â
Using Open Educational Resources (OER) and OpenCourseWare (OCW) and other Web 2.0 Technologies in the Basic Composition Classroom
Note: Go to http://works.bepress.com/anne_arendt/4/ to get the detailed report with all the proper citations and additional information.
This document provides an overview of deep generative learning and summarizes several key generative models including GANs, VAEs, diffusion models, and autoregressive models. It discusses the motivation for generative models and their applications such as image generation, text-to-image synthesis, and enhancing other media like video and speech. Example state-of-the-art models are provided for each application. The document also covers important concepts like the difference between discriminative and generative modeling, sampling techniques, and the training procedures for GANs and VAEs.
This document provides an overview of deep generative learning and summarizes several key generative models including GANs, VAEs, diffusion models, and autoregressive models. It discusses the motivation for generative models and their applications such as image generation, text-to-image synthesis, and enhancing other media like video and speech. Example state-of-the-art models are provided for each application. The document also covers important concepts like the difference between discriminative and generative modeling, sampling techniques, and the training procedures for GANs and VAEs.
The document discusses the Vision Transformer (ViT) model for computer vision tasks. It covers:
1. How ViT tokenizes images into patches and uses position embeddings to encode spatial relationships.
2. ViT uses a class embedding to trigger class predictions, unlike CNNs which have decoders.
3. The receptive field of ViT grows as the attention mechanism allows elements to attend to other distant elements in later layers.
4. Initial results showed ViT performance was comparable to CNNs when trained on large datasets but lagged CNNs trained on smaller datasets like ImageNet.
Machine translation and computer vision have greatly benefited from the advances in deep learning. A large and diverse amount of textual and visual data have been used to train neural networks whether in a supervised or self-supervised manner. Nevertheless, the convergence of the two fields in sign language translation and production still poses multiple open challenges, like the low video resources, limitations in hand pose estimation, or 3D spatial grounding from poses.
The transformer is the neural architecture that has received most attention in the early 2020's. It removed the recurrency in RNNs, replacing it with and attention mechanism across the input and output tokens of a sequence (cross-attenntion) and between the tokens composing the input (and output) sequences, named self-attention.
These slides review the research of our lab since 2016 on applied deep learning, starting from our participation in the TRECVID Instance Search 2014, moving into video analysis with CNN+RNN architectures, and our current efforts in sign language translation and production.
Machine translation and computer vision have greatly benefited of the advances in deep learning. The large and diverse amount of textual and visual data have been used to train neural networks whether in a supervised or self-supervised manner. Nevertheless, the convergence of the two field in sign language translation and production is still poses multiple open challenges, like the low video resources, limitations in hand pose estimation, or 3D spatial grounding from poses. This talk will present these challenges and the How2✌️Sign dataset (https://how2sign.github.io) recorded at CMU in collaboration with UPC, BSC, Gallaudet University and Facebook.
https://imatge.upc.edu/web/publications/sign-language-translation-and-production-multimedia-and-multimodal-challenges-all
https://imatge-upc.github.io/synthref/
Integrating computer vision with natural language processing has achieved significant progress
over the last years owing to the continuous evolution of deep learning. A novel vision and language
task, which is tackled in the present Master thesis is referring video object segmentation, in which a
language query defines which instance to segment from a video sequence. One of the biggest chal-
lenges for this task is the lack of relatively large annotated datasets since a tremendous amount of
time and human effort is required for annotation. Moreover, existing datasets suffer from poor qual-
ity annotations in the sense that approximately one out of ten language expressions fails to uniquely
describe the target object.
The purpose of the present Master thesis is to address these challenges by proposing a novel
method for generating synthetic referring expressions for an image (video frame). This method pro-
duces synthetic referring expressions by using only the ground-truth annotations of the objects as well
as their attributes, which are detected by a state-of-the-art object detection deep neural network. One
of the advantages of the proposed method is that its formulation allows its application to any object
detection or segmentation dataset.
By using the proposed method, the first large-scale dataset with synthetic referring expressions for
video object segmentation is created, based on an existing large benchmark dataset for video instance
segmentation. A statistical analysis and comparison of the created synthetic dataset with existing ones
is also provided in the present Master thesis.
The conducted experiments on three different datasets used for referring video object segmen-
tation prove the efficiency of the generated synthetic data. More specifically, the obtained results
demonstrate that by pre-training a deep neural network with the proposed synthetic dataset one can
improve the ability of the network to generalize across different datasets, without any additional annotation cost. This outcome is even more important taking into account that no additional annotation cost is involved.
Peter Muschick MSc thesis
Universitat Pollitecnica de Catalunya, 2020
Sign language recognition and translation has been an active research field in the recent years with most approaches using deep neural networks to extract information from sign language data. This work investigates the mostly disregarded approach of using human keypoint estimation from image and video data with OpenPose in combination with transformer network architecture. Firstly, it was shown that it is possible to recognize individual signs (4.5% word error rate (WER)). Continuous sign language recognition though was more error prone (77.3% WER) and sign language translation was not possible using the proposed methods, which might be due to low accuracy scores of human keypoint estimation by OpenPose and accompanying loss of information or insufficient capacities of the used transformer model. Results may improve with the use of datasets containing higher repetition rates of individual signs or focusing more precisely on keypoint extraction of hands.
This document discusses interpretability and explainable AI (XAI) in neural networks. It begins by providing motivation for why explanations of neural network predictions are often required. It then provides an overview of different interpretability techniques, including visualizing learned weights and feature maps, attribution methods like class activation maps and guided backpropagation, and feature visualization. Specific examples and applications of each technique are described. The document serves as a guide to interpretability and explainability in deep learning models.
Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks or Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles of deep learning from both an algorithmic and computational perspectives.
Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks or Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles of deep learning from both an algorithmic and computational perspectives.
Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks or Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles of deep learning from both an algorithmic and computational perspectives.
https://telecombcn-dl.github.io/dlai-2020/
Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks or Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles of deep learning from both an algorithmic and computational perspectives.
https://telecombcn-dl.github.io/drl-2020/
This course presents the principles of reinforcement learning as an artificial intelligence tool based on the interaction of the machine with its environment, with applications to control tasks (eg. robotics, autonomous driving) o decision making (eg. resource optimization in wireless communication networks). It also advances in the development of deep neural networks trained with little or no supervision, both for discriminative and generative tasks, with special attention on multimedia applications (vision, language and speech).
Giro-i-Nieto, X. One Perceptron to Rule Them All: Language, Vision, Audio and Speech. In Proceedings of the 2020 International Conference on Multimedia Retrieval (pp. 7-8).
Tutorial page:
https://imatge.upc.edu/web/publications/one-perceptron-rule-them-all-language-vision-audio-and-speech-tutorial
Deep neural networks have boosted the convergence of multimedia data analytics in a unified framework shared by practitioners in natural language, vision and speech. Image captioning, lip reading or video sonorization are some of the first applications of a new and exciting field of research exploiting the generalization properties of deep neural representation. This tutorial will firstly review the basic neural architectures to encode and decode vision, text and audio, to later review the those models that have successfully translated information across modalities.
This document summarizes image segmentation techniques using deep learning. It begins with an overview of semantic segmentation and instance segmentation. It then discusses several techniques for semantic segmentation, including deconvolution/transposed convolution for learnable upsampling, skip connections to combine predictions from different CNN depths, and dilated convolutions to increase the receptive field without losing resolution. For instance segmentation, it covers proposal-based methods like Mask R-CNN, and single-shot and recurrent approaches as alternatives to proposal-based models.
https://imatge-upc.github.io/rvos-mots/
Video object segmentation can be understood as a sequence-to-sequence task that can benefit from the curriculum learning strategies for better and faster training of deep neural networks. This work explores different schedule sampling and frame skipping variations to significantly improve the performance of a recurrent architecture. Our results on the car class of the KITTI-MOTS challenge indicate that, surprisingly, an inverse schedule sampling is a better option than a classic forward one. Also, that a progressive skipping of frames during training is beneficial, but only when training with the ground truth masks instead of the predicted ones.
Deep neural networks have achieved outstanding results in various applications such as vision, language, audio, speech, or reinforcement learning. These powerful function approximators typically require large amounts of data to be trained, which poses a challenge in the usual case where little labeled data is available. During the last year, multiple solutions have been proposed to leverage this problem, based on the concept of self-supervised learning, which can be understood as a specific case of unsupervised learning. This talk will cover its basic principles and provide examples in the field of multimedia.
More from Universitat Politècnica de Catalunya (20)
HCL Notes and Domino License Cost Reduction in the World of DLAUpanagenda
Â
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-and-domino-license-cost-reduction-in-the-world-of-dlau/
The introduction of DLAU and the CCB & CCX licensing model caused quite a stir in the HCL community. As a Notes and Domino customer, you may have faced challenges with unexpected user counts and license costs. You probably have questions on how this new licensing approach works and how to benefit from it. Most importantly, you likely have budget constraints and want to save money where possible. Don’t worry, we can help with all of this!
We’ll show you how to fix common misconfigurations that cause higher-than-expected user counts, and how to identify accounts which you can deactivate to save money. There are also frequent patterns that can cause unnecessary cost, like using a person document instead of a mail-in for shared mailboxes. We’ll provide examples and solutions for those as well. And naturally we’ll explain the new licensing model.
Join HCL Ambassador Marc Thomas in this webinar with a special guest appearance from Franz Walder. It will give you the tools and know-how to stay on top of what is going on with Domino licensing. You will be able lower your cost through an optimized configuration and keep it low going forward.
These topics will be covered
- Reducing license cost by finding and fixing misconfigurations and superfluous accounts
- How do CCB and CCX licenses really work?
- Understanding the DLAU tool and how to best utilize it
- Tips for common problem areas, like team mailboxes, functional/test users, etc
- Practical examples and best practices to implement right away
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...alexjohnson7307
Â
Predictive maintenance is a proactive approach that anticipates equipment failures before they happen. At the forefront of this innovative strategy is Artificial Intelligence (AI), which brings unprecedented precision and efficiency. AI in predictive maintenance is transforming industries by reducing downtime, minimizing costs, and enhancing productivity.
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUpanagenda
Â
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-und-domino-lizenzkostenreduzierung-in-der-welt-von-dlau/
DLAU und die Lizenzen nach dem CCB- und CCX-Modell sind für viele in der HCL-Community seit letztem Jahr ein heißes Thema. Als Notes- oder Domino-Kunde haben Sie vielleicht mit unerwartet hohen Benutzerzahlen und Lizenzgebühren zu kämpfen. Sie fragen sich vielleicht, wie diese neue Art der Lizenzierung funktioniert und welchen Nutzen sie Ihnen bringt. Vor allem wollen Sie sicherlich Ihr Budget einhalten und Kosten sparen, wo immer möglich. Das verstehen wir und wir möchten Ihnen dabei helfen!
Wir erklären Ihnen, wie Sie häufige Konfigurationsprobleme lösen können, die dazu führen können, dass mehr Benutzer gezählt werden als nötig, und wie Sie überflüssige oder ungenutzte Konten identifizieren und entfernen können, um Geld zu sparen. Es gibt auch einige Ansätze, die zu unnötigen Ausgaben führen können, z. B. wenn ein Personendokument anstelle eines Mail-Ins für geteilte Mailboxen verwendet wird. Wir zeigen Ihnen solche Fälle und deren Lösungen. Und natürlich erklären wir Ihnen das neue Lizenzmodell.
Nehmen Sie an diesem Webinar teil, bei dem HCL-Ambassador Marc Thomas und Gastredner Franz Walder Ihnen diese neue Welt näherbringen. Es vermittelt Ihnen die Tools und das Know-how, um den Überblick zu bewahren. Sie werden in der Lage sein, Ihre Kosten durch eine optimierte Domino-Konfiguration zu reduzieren und auch in Zukunft gering zu halten.
Diese Themen werden behandelt
- Reduzierung der Lizenzkosten durch Auffinden und Beheben von Fehlkonfigurationen und ĂĽberflĂĽssigen Konten
- Wie funktionieren CCB- und CCX-Lizenzen wirklich?
- Verstehen des DLAU-Tools und wie man es am besten nutzt
- Tipps für häufige Problembereiche, wie z. B. Team-Postfächer, Funktions-/Testbenutzer usw.
- Praxisbeispiele und Best Practices zum sofortigen Umsetzen
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Tatiana Kojar
Â
Skybuffer AI, built on the robust SAP Business Technology Platform (SAP BTP), is the latest and most advanced version of our AI development, reaffirming our commitment to delivering top-tier AI solutions. Skybuffer AI harnesses all the innovative capabilities of the SAP BTP in the AI domain, from Conversational AI to cutting-edge Generative AI and Retrieval-Augmented Generation (RAG). It also helps SAP customers safeguard their investments into SAP Conversational AI and ensure a seamless, one-click transition to SAP Business AI.
With Skybuffer AI, various AI models can be integrated into a single communication channel such as Microsoft Teams. This integration empowers business users with insights drawn from SAP backend systems, enterprise documents, and the expansive knowledge of Generative AI. And the best part of it is that it is all managed through our intuitive no-code Action Server interface, requiring no extensive coding knowledge and making the advanced AI accessible to more users.
FREE A4 Cyber Security Awareness Posters-Social Engineering part 3Data Hops
Â
Free A4 downloadable and printable Cyber Security, Social Engineering Safety and security Training Posters . Promote security awareness in the home or workplace. Lock them Out From training providers datahops.com
This presentation provides valuable insights into effective cost-saving techniques on AWS. Learn how to optimize your AWS resources by rightsizing, increasing elasticity, picking the right storage class, and choosing the best pricing model. Additionally, discover essential governance mechanisms to ensure continuous cost efficiency. Whether you are new to AWS or an experienced user, this presentation provides clear and practical tips to help you reduce your cloud costs and get the most out of your budget.
Best 20 SEO Techniques To Improve Website Visibility In SERPPixlogix Infotech
Â
Boost your website's visibility with proven SEO techniques! Our latest blog dives into essential strategies to enhance your online presence, increase traffic, and rank higher on search engines. From keyword optimization to quality content creation, learn how to make your site stand out in the crowded digital landscape. Discover actionable tips and expert insights to elevate your SEO game.
Introduction of Cybersecurity with OSS at Code Europe 2024Hiroshi SHIBATA
Â
I develop the Ruby programming language, RubyGems, and Bundler, which are package managers for Ruby. Today, I will introduce how to enhance the security of your application using open-source software (OSS) examples from Ruby and RubyGems.
The first topic is CVE (Common Vulnerabilities and Exposures). I have published CVEs many times. But what exactly is a CVE? I'll provide a basic understanding of CVEs and explain how to detect and handle vulnerabilities in OSS.
Next, let's discuss package managers. Package managers play a critical role in the OSS ecosystem. I'll explain how to manage library dependencies in your application.
I'll share insights into how the Ruby and RubyGems core team works to keep our ecosystem safe. By the end of this talk, you'll have a better understanding of how to safeguard your code.
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slackshyamraj55
Â
Discover the seamless integration of RPA (Robotic Process Automation), COMPOSER, and APM with AWS IDP enhanced with Slack notifications. Explore how these technologies converge to streamline workflows, optimize performance, and ensure secure access, all while leveraging the power of AWS IDP and real-time communication via Slack notifications.
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...Alex Pruden
Â
Folding is a recent technique for building efficient recursive SNARKs. Several elegant folding protocols have been proposed, such as Nova, Supernova, Hypernova, Protostar, and others. However, all of them rely on an additively homomorphic commitment scheme based on discrete log, and are therefore not post-quantum secure. In this work we present LatticeFold, the first lattice-based folding protocol based on the Module SIS problem. This folding protocol naturally leads to an efficient recursive lattice-based SNARK and an efficient PCD scheme. LatticeFold supports folding low-degree relations, such as R1CS, as well as high-degree relations, such as CCS. The key challenge is to construct a secure folding protocol that works with the Ajtai commitment scheme. The difficulty, is ensuring that extracted witnesses are low norm through many rounds of folding. We present a novel technique using the sumcheck protocol to ensure that extracted witnesses are always low norm no matter how many rounds of folding are used. Our evaluation of the final proof system suggests that it is as performant as Hypernova, while providing post-quantum security.
Paper Link: https://eprint.iacr.org/2024/257
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyScyllaDB
Â
Freshworks creates AI-boosted business software that helps employees work more efficiently and effectively. Managing data across multiple RDBMS and NoSQL databases was already a challenge at their current scale. To prepare for 10X growth, they knew it was time to rethink their database strategy. Learn how they architected a solution that would simplify scaling while keeping costs under control.
Main news related to the CCS TSI 2023 (2023/1695)Jakub Marek
Â
An English 🇬🇧 translation of a presentation to the speech I gave about the main changes brought by CCS TSI 2023 at the biggest Czech conference on Communications and signalling systems on Railways, which was held in Clarion Hotel Olomouc from 7th to 9th November 2023 (konferenceszt.cz). Attended by around 500 participants and 200 on-line followers.
The original Czech 🇨🇿 version of the presentation can be found here: https://www.slideshare.net/slideshow/hlavni-novinky-souvisejici-s-ccs-tsi-2023-2023-1695/269688092 .
The videorecording (in Czech) from the presentation is available here: https://youtu.be/WzjJWm4IyPk?si=SImb06tuXGb30BEH .
Monitoring and Managing Anomaly Detection on OpenShift.pdfTosin Akinosho
Â
Monitoring and Managing Anomaly Detection on OpenShift
Overview
Dive into the world of anomaly detection on edge devices with our comprehensive hands-on tutorial. This SlideShare presentation will guide you through the entire process, from data collection and model training to edge deployment and real-time monitoring. Perfect for those looking to implement robust anomaly detection systems on resource-constrained IoT/edge devices.
Key Topics Covered
1. Introduction to Anomaly Detection
- Understand the fundamentals of anomaly detection and its importance in identifying unusual behavior or failures in systems.
2. Understanding Edge (IoT)
- Learn about edge computing and IoT, and how they enable real-time data processing and decision-making at the source.
3. What is ArgoCD?
- Discover ArgoCD, a declarative, GitOps continuous delivery tool for Kubernetes, and its role in deploying applications on edge devices.
4. Deployment Using ArgoCD for Edge Devices
- Step-by-step guide on deploying anomaly detection models on edge devices using ArgoCD.
5. Introduction to Apache Kafka and S3
- Explore Apache Kafka for real-time data streaming and Amazon S3 for scalable storage solutions.
6. Viewing Kafka Messages in the Data Lake
- Learn how to view and analyze Kafka messages stored in a data lake for better insights.
7. What is Prometheus?
- Get to know Prometheus, an open-source monitoring and alerting toolkit, and its application in monitoring edge devices.
8. Monitoring Application Metrics with Prometheus
- Detailed instructions on setting up Prometheus to monitor the performance and health of your anomaly detection system.
9. What is Camel K?
- Introduction to Camel K, a lightweight integration framework built on Apache Camel, designed for Kubernetes.
10. Configuring Camel K Integrations for Data Pipelines
- Learn how to configure Camel K for seamless data pipeline integrations in your anomaly detection workflow.
11. What is a Jupyter Notebook?
- Overview of Jupyter Notebooks, an open-source web application for creating and sharing documents with live code, equations, visualizations, and narrative text.
12. Jupyter Notebooks with Code Examples
- Hands-on examples and code snippets in Jupyter Notebooks to help you implement and test anomaly detection models.
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Jeffrey Haguewood
Â
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on integration of Salesforce with Bonterra Impact Management.
Interested in deploying an integration with Salesforce for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframePrecisely
Â
Inconsistent user experience and siloed data, high costs, and changing customer expectations – Citizens Bank was experiencing these challenges while it was attempting to deliver a superior digital banking experience for its clients. Its core banking applications run on the mainframe and Citizens was using legacy utilities to get the critical mainframe data to feed customer-facing channels, like call centers, web, and mobile. Ultimately, this led to higher operating costs (MIPS), delayed response times, and longer time to market.
Ever-changing customer expectations demand more modern digital experiences, and the bank needed to find a solution that could provide real-time data to its customer channels with low latency and operating costs. Join this session to learn how Citizens is leveraging Precisely to replicate mainframe data to its customer channels and deliver on their “modern digital bank” experiences.
Have you ever been confused by the myriad of choices offered by AWS for hosting a website or an API?
Lambda, Elastic Beanstalk, Lightsail, Amplify, S3 (and more!) can each host websites + APIs. But which one should we choose?
Which one is cheapest? Which one is fastest? Which one will scale to meet our needs?
Join me in this session as we dive into each AWS hosting service to determine which one is best for your scenario and explain why!
Ivanti’s Patch Tuesday breakdown goes beyond patching your applications and brings you the intelligence and guidance needed to prioritize where to focus your attention first. Catch early analysis on our Ivanti blog, then join industry expert Chris Goettl for the Patch Tuesday Webinar Event. There we’ll do a deep dive into each of the bulletins and give guidance on the risks associated with the newly-identified vulnerabilities.
36. ImageProcessingGroups
UniversitatPolitècnicadeCatalunya(UPC)
Xavier Giro-i-Nieto, “Multimedia Annotation”. Dublin City University (04/04/2016)
Metadata
• Applications: Grouping photos in events with metadata.
36
D. Manchon-Vizuete, Gris-Sarabia, I., and Giró-i-Nieto, X., “Photo Clustering of Social Events by
Extending PhotoTOC to a Rich Context”, in ICMR 2014 Workshop on Social Events in Web Multimedia
(SEWM), Glasgow, Scotland, 2014.
40. ImageProcessingGroups
UniversitatPolitècnicadeCatalunya(UPC)
Xavier Giro-i-Nieto, “Multimedia Annotation”. Dublin City University (04/04/2016)
Manual vs Automatic Annotation
40
The semantic gap is the difference between a high level and a
low level description of a document:
Human are very good at abstraction
using natural language (words)...
...while computers are really good at
analysing perceptual features.
Semantic
gap
53. ImageProcessingGroups
UniversitatPolitècnicadeCatalunya(UPC)
Xavier Giro-i-Nieto, “Multimedia Annotation”. Dublin City University (04/04/2016)
Manual Annotation
53
Ref: Luis von Ahn and Laura Dabbish,
“Labeling images with a computer
game”. (SIGCHI 2004)
Ref: Amaia Salvador et al, “Crowdsourced
Object Segmentation with a Game” (CrowdMM
2013)
Games With A Purpose (GWAP)
Annotation can be splitted and assigned to the crowd as…
….micro-games for online players.
61. ImageProcessingGroups
UniversitatPolitècnicadeCatalunya(UPC)
Xavier Giro-i-Nieto, “Multimedia Annotation”. Dublin City University (04/04/2016)
Manual vs Automatic Annotation
61
The semantic gap is the difference between a high level and a
low level description of a document:
Human are very good at abstraction
using natural language...
...while computers are really good at
analysing perceptual features.
Challenge
89. ImageProcessingGroups
UniversitatPolitècnicadeCatalunya(UPC)
Xavier Giro-i-Nieto, “Multimedia Annotation”. Dublin City University (04/04/2016) 89
Automatic Annotation
Activity Recognition
Tran, Du, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, and Manohar Paluri. "Learning
spatiotemporal features with 3D convolutional networks." In Proceedings of the IEEE International
Conference on Computer Vision, pp. 4489-4497. 2015
95. ImageProcessingGroups
UniversitatPolitècnicadeCatalunya(UPC)
Xavier Giro-i-Nieto, “Multimedia Annotation”. Dublin City University (04/04/2016) 95
Visual saliency prediction
J. Pan, McGuinness, K., Sayrol, E., O'Connor, N., and Giró-i-Nieto, X., “Shallow and
Deep Convolutional Networks for Saliency Prediction”, in IEEE Conference on
Computer Vision and Pattern Recognition, CVPR, In Press.
LSUN Challenge
Only learn to see ?
104. ImageProcessingGroups
UniversitatPolitècnicadeCatalunya(UPC)
Xavier Giro-i-Nieto, “Multimedia Annotation”. Dublin City University (04/04/2016) 104
Only learn to see ?
“Scripts” (!?)
Darknet
JON
He leaned close and onions, barefoot from his shoulder. "I am not a purple
girl," he said as he stood over him. "The sight of you sell your father with you a
little choice."
"I say to swear up his sea or a boy of stone and heart, down," Lord Tywin
said. "I love your word or her to me."
109. ImageProcessingGroups
UniversitatPolitècnicadeCatalunya(UPC)
Xavier Giro-i-Nieto, “Multimedia Annotation”. Dublin City University (04/04/2016) 109
Only learn to see ?
Affective computing
V. Campos, Salvador, A., Jou, B., and Giró-i-Nieto, X., “Diving Deep into
Sentiment: Understanding Fine-tuned CNNs for Visual Sentiment
Prediction”, in 1st International Workshop on Affect and Sentiment in
Multimedia, Brisbane, Australia, 2015.
Visual maps of positive (green) or negative (red) sentiments:
112. ImageProcessingGroups
UniversitatPolitècnicadeCatalunya(UPC)
Xavier Giro-i-Nieto, “Multimedia Annotation”. Dublin City University (04/04/2016) 112
Only learn to see ?
“Google’s chairman (Eric Schmidth) thinks artificial intelligence
will let scientists solve some of the world’s "hard problems," like
population growth, climate change, human development,
and education.” (Bloomberg Business, 11/01/2016)
[+info @ MIT Technology Review]