This document provides an overview of ontologies and the semantic web. It defines ontologies as formal specifications of conceptualizations that are shared between people and computers. Ontologies provide a common vocabulary and conceptual structure to facilitate understanding between humans and machines. They allow different systems and communities to work together by providing shared definitions of concepts and relationships. The development of ontologies and the semantic web aims to make web resources more computer-readable and enable machines to better understand and process online information.
FITC 2014 Amsterdam - Adobe Apps for Web Designers in 2014Michael Chaize
This document discusses Adobe apps that are useful for web designers in 2014, including a history of Adobe ImageReady and Fireworks from 1998. It covers topics like flat design, responsive web design, using Illustrator for vectors and SVG, extracting CSS, and Adobe add-ons. The document provides an overview of design trends and techniques as well as features of Adobe products that help with web design.
This document provides an overview of ontologies and the semantic web. It defines ontologies as formal specifications of conceptualizations that are shared between people and computers. Ontologies provide a common vocabulary and conceptual structure to facilitate understanding between humans and machines. They allow different systems and communities to work together by providing shared definitions of concepts and relationships. The development of ontologies and the semantic web aims to make web resources more computer-readable and enable machines to better understand and process online information.
FITC 2014 Amsterdam - Adobe Apps for Web Designers in 2014Michael Chaize
This document discusses Adobe apps that are useful for web designers in 2014, including a history of Adobe ImageReady and Fireworks from 1998. It covers topics like flat design, responsive web design, using Illustrator for vectors and SVG, extracting CSS, and Adobe add-ons. The document provides an overview of design trends and techniques as well as features of Adobe products that help with web design.
Mergers and acquisitions involve combining two companies to capitalize on their mutual strengths and synergies. A merger unifies two similar companies, while an acquisition involves a larger company purchasing a smaller one. These strategies allow companies to scale up operations, gain market share, and leverage each other's expertise and resources. Key considerations for mergers and acquisitions include ensuring strategic fit, capitalizing on synergies, evaluating market opportunities, and creating long-term value for stakeholders. Comprehensive planning is required for integration activities like financial modeling, governance, HR strategy, and marketing.
The document discusses creating and implementing an effective drug-free workplace program. It covers assessing alcohol and drug problems in the workplace, developing policies, educating employees on substance abuse, establishing an Employee Assistance Program, implementing drug testing, and celebrating safely. The goal is to increase productivity and safety while reducing costs, accidents, absenteeism and liability through a successful drug-free workplace initiative.
The presentation introduces in a systematic way a new vision and a new approach for enterprise innovation. The addressed topics are largely drawn upon the work carried out in the European Project BIVEE (www.bivee.eu). In the conclusion, there is a first proposal to start thinking to innovation as a proper discipline, fertilised by several existing scientific areas, form business to engineering, from creative thinking to technology.
The document discusses Adobe Flex 4.6 and its capabilities for mobile application development. It outlines how Flex allows developing once for multiple mobile platforms like Android, iOS, and others. It highlights features like automatic scaling of user interface elements for different device densities. The document also discusses Adobe's continued support for Flex through contributions to the Apache Flex project.
The document discusses building successful enterprise mobile applications. It advocates for (1) user-experience driven development, (2) adopting a user-experience oriented architecture, and (3) cross-platform development to allow applications to work across multiple devices and operating systems. The document provides examples of how various organizations have benefited from focusing on user experience and building applications that are cross-platform.
This document summarizes a presentation about avoiding app store rejections. It discusses that the app review process is partly subjective and depends on factors like the developer's brand. While hybrid apps are allowed, the app needs to provide a high-quality native experience. Common reasons for rejection include not following Apple's guidelines, inappropriate content, or performance issues. Developers are encouraged to address reviewer feedback, maintain communication, and focus on meeting Apple's standards for user experience.
The document discusses using Java and Flex to create enterprise-class rich internet applications (RIAs). It promotes Flex and the Flash platform as a complete system for building innovative web applications, including details on Flex's capabilities, integration with Java, and productive development workflows. The author, Michael Chaize, is identified as a Flash platform evangelist.
This document provides information about the Software Engineering Group at the Norwegian University of Science and Technology (NTNU). It includes summaries of the research areas, courses taught, and potential master's thesis topics for 13 professors and associate professors in the group. The research areas include software engineering, human-computer interaction, games and simulations, cooperation technologies, and more. The
The document summarizes a research roadmap for future enterprise information systems (FInES) through 2025. It outlines four knowledge spaces: (1) socio-economic, (2) enterprises, (3) enterprise systems, and (4) enabling technologies. For each space, it identifies research challenges. The roadmap provides a vision of future internet-based enterprises and enterprise systems that are humanistic, agile, cognitive, sensing, community-oriented, and sustainable. It also discusses new approaches to enterprise governance, knowledge management, and flexible engineering of proactive and autonomous enterprise systems.
Рассмотрены проблемы пополнения компьютерного семантического словаря новыми словами, встреченными в тексте при его анализе. Предлагаемая для этого система работает в полуавтоматическом диалоговом режиме. На первом этапе определяются морфологические характеристики нового слова, на втором – его синтактико-семантические параметры по аналогам, имеющимся в существующем словаре. Предлагаемые подходы обеспечивают высокий уровень точности. Впервые появилась возможность указания точной семантики новых слов с учетом не только семантических классов, но и аргументов, обеспечивающих связь с подсоединяемыми словами.
Mergers and acquisitions involve combining two companies to capitalize on their mutual strengths and synergies. A merger unifies two similar companies, while an acquisition involves a larger company purchasing a smaller one. These strategies allow companies to scale up operations, gain market share, and leverage each other's expertise and resources. Key considerations for mergers and acquisitions include ensuring strategic fit, capitalizing on synergies, evaluating market opportunities, and creating long-term value for stakeholders. Comprehensive planning is required for integration activities like financial modeling, governance, HR strategy, and marketing.
The document discusses creating and implementing an effective drug-free workplace program. It covers assessing alcohol and drug problems in the workplace, developing policies, educating employees on substance abuse, establishing an Employee Assistance Program, implementing drug testing, and celebrating safely. The goal is to increase productivity and safety while reducing costs, accidents, absenteeism and liability through a successful drug-free workplace initiative.
The presentation introduces in a systematic way a new vision and a new approach for enterprise innovation. The addressed topics are largely drawn upon the work carried out in the European Project BIVEE (www.bivee.eu). In the conclusion, there is a first proposal to start thinking to innovation as a proper discipline, fertilised by several existing scientific areas, form business to engineering, from creative thinking to technology.
The document discusses Adobe Flex 4.6 and its capabilities for mobile application development. It outlines how Flex allows developing once for multiple mobile platforms like Android, iOS, and others. It highlights features like automatic scaling of user interface elements for different device densities. The document also discusses Adobe's continued support for Flex through contributions to the Apache Flex project.
The document discusses building successful enterprise mobile applications. It advocates for (1) user-experience driven development, (2) adopting a user-experience oriented architecture, and (3) cross-platform development to allow applications to work across multiple devices and operating systems. The document provides examples of how various organizations have benefited from focusing on user experience and building applications that are cross-platform.
This document summarizes a presentation about avoiding app store rejections. It discusses that the app review process is partly subjective and depends on factors like the developer's brand. While hybrid apps are allowed, the app needs to provide a high-quality native experience. Common reasons for rejection include not following Apple's guidelines, inappropriate content, or performance issues. Developers are encouraged to address reviewer feedback, maintain communication, and focus on meeting Apple's standards for user experience.
The document discusses using Java and Flex to create enterprise-class rich internet applications (RIAs). It promotes Flex and the Flash platform as a complete system for building innovative web applications, including details on Flex's capabilities, integration with Java, and productive development workflows. The author, Michael Chaize, is identified as a Flash platform evangelist.
This document provides information about the Software Engineering Group at the Norwegian University of Science and Technology (NTNU). It includes summaries of the research areas, courses taught, and potential master's thesis topics for 13 professors and associate professors in the group. The research areas include software engineering, human-computer interaction, games and simulations, cooperation technologies, and more. The
The document summarizes a research roadmap for future enterprise information systems (FInES) through 2025. It outlines four knowledge spaces: (1) socio-economic, (2) enterprises, (3) enterprise systems, and (4) enabling technologies. For each space, it identifies research challenges. The roadmap provides a vision of future internet-based enterprises and enterprise systems that are humanistic, agile, cognitive, sensing, community-oriented, and sustainable. It also discusses new approaches to enterprise governance, knowledge management, and flexible engineering of proactive and autonomous enterprise systems.
Рассмотрены проблемы пополнения компьютерного семантического словаря новыми словами, встреченными в тексте при его анализе. Предлагаемая для этого система работает в полуавтоматическом диалоговом режиме. На первом этапе определяются морфологические характеристики нового слова, на втором – его синтактико-семантические параметры по аналогам, имеющимся в существующем словаре. Предлагаемые подходы обеспечивают высокий уровень точности. Впервые появилась возможность указания точной семантики новых слов с учетом не только семантических классов, но и аргументов, обеспечивающих связь с подсоединяемыми словами.
ПОСТРОЕНИЕ ОТНОШЕНИЙ В СМЕШАННОЙ ОНТОЛОГИЧЕСКОЙ СЕТИ ДЛЯ РЕШЕНИЯ ЗАДАЧ ТЕСТИР...Сергей Пономарев
Настоящая статья описывает метод построения отношений вида «синоним», «гиперним» и «гипоним» в смешанной онтологической сети. Построенные отношения использовались для решения задач определения семантической близости и ассоциаций между словами в рамках тестирования на полях форума «Диалог-2015».
“Волновой генетический код” написан через три года после выхода монографии “Волновой геном” и, несмотря на сходство названия, не повторяет ее, но развивает преимущественно в теоретическом плане.
В биологии и, особенно, в ее ключевой части генетике, настала пора переоценки ценностей. Вероятно, она будет иметь взрывной характер. столь нелюбимое для многих во времена лысенковщины слова и понятия “ген” и “триплетный генетический код”, наконец, повсеместно приняты, и вроде бы, здесь все ясно. но вот парадокс, эта видимость ясности стала интеллектуальным тормозом. однако, лавина новых экспериментальных фактов уже не укладывается в признанные и, вчера еще, революционные идеи. Эти новые факты удивительны, волнуют воображение, манят вперед. Взять, хотя бы, проблему “эгоистической” ДНК, или странную неэкономичность синтеза преинформационных РНК, или как будто бы ненужную интрон-экзонную разбивку генов. А возьмите проблему контекстных дальних ориентаций при выборе аминокислот в процессе синтеза белков или феномен лазерных и солитонных излучений ДНК. И это только малая часть “аномальных” явлений в биологии. Особенно интересна ситуация со знаменитой Вобл-гипотезой Ф.Крика, прячущей за звучной идеей вроде бы случайного “качания” (замен) третьего нуклеотида в кодоне главную теоретическую трудность парадигмы триплетного кода, его омонимичность. Кодирующие одинаковые дуплеты-омонимы должны точно означать (кодировать) ту или иную различающиеся аминокислоты, и рибосома делает этот точный однозначный выбор амнокислоты. Но каким образом? Генетика и молекулярная биология сейчас не могут ответить на этот вопрос, они застыли в фазе непонимания правил орфографии написания белковых “текстов” из аминокислотных “букв”.
Иными словами, проблему генетического кода пора пересматривать. Да и почему он генетический? Он белковый. Что касается генетического кода, как программы построения всей биосистемы, то он существенно иной — гетеромультиплетный, многомерный, плюралистичный и, наконец, образно-волновой.
Эта работа не претендует на истину в последней инстанции. Ее задача скромнее, правильно поставить новые вопросы. Ответ на них, может быть, найдут в XXI веке.
П.П.Гаряев «Волновой генетический код», Издательство «ИЗДАТЦЕНТР» 1997 г. ISBN 5-7816-0022-1
Classification and clustering in media monitoring: from knowledge engineering...Lidia Pivovarova
This PhD thesis examines classification and clustering techniques for media monitoring, including news grouping, multi-label text classification, and business polarity detection. It focuses on applying these methods to the PULS media monitoring system, which collects over 10,000 news articles daily. The thesis contributes novel algorithms and datasets for grouping news into stories based on named entity salience, large-scale multi-label text classification balancing training sets, and the first dataset and methods for entity-level business polarity detection.
The document describes a Russian paraphrase corpus created by the authors. It contains over 8000 sentence pairs annotated as precise, loose, or non-paraphrases using crowdsourcing. The corpus was collected from news headlines and aims to capture the most important events. The authors evaluate different models for classifying sentence pairs and find that combining linguistic features improves performance over individual feature types. Graphs built from the corpus can reveal connected events more completely than human annotations alone.
This document discusses the work of Antiplagiat Research, which tackles challenging natural language processing and plagiarism detection problems. It outlines their focus on cross-language plagiarism detection, machine-generated text detection, and intrinsic plagiarism detection. It also describes Antiplagiat Research's collaboration opportunities and their participation in evaluating plagiarism detection algorithms through workshops like Dialogue Evaluation.
This document summarizes a study that analyzed 47,410 Instagram images from Saint Petersburg over one year to understand human experience in different urban areas. The images were clustered using Google tags and user hashtags into topics like portraits, cars, flowers. The clusters were mapped geographically to see their spatial distribution. Clusters like hairstyle and animals were evenly distributed, while clothing, fitness and architecture were more detached, indicating urban segregation. The combination of semantic and geospatial analysis of social media images provided new insights into urban life not previously available from traditional data sources.
The document discusses the Pullenti NER Engine and its use in semantic similarity tasks. It presents the Semantics-Oriented Linguistic Processor (SOLP) which establishes text segments containing similar semantic units. It then describes the hybrid linguistic and machine learning approach used by the Pullenti-based engine, including the two-step Semantic Expansion Algorithm. Performance figures and evaluation metrics for Pullenti's named entity recognition are also provided.
The document discusses the reliability of results from corpus research and introduces a solution called GICR that provides automatic result analysis. GICR allows users to see statistics on search areas to check for bias or lack of homogeneity compared to the entire corpus by displaying metadata attributes like URLs, document IDs, author information, region, gender, and genre. It aims to address the problem that simply getting IPM and KWIC search results does not indicate if the results are biased by providing analysis directly in the interface.
This document discusses methods for estimating a user's actual age and gender when those values are not directly provided. It outlines using social graph analysis, natural language processing, analyzing user interests, and statistical methods. For social graph analysis, it examines using connections like classmates to infer age and analyzing local graph properties. NLP looks at gender-specific language in user profiles while interest analysis matches users to gender-biased communities. Statistics applies overall patterns in the data to make estimations.
This document presents mathematical models of information dissemination and warfare. It discusses:
1) Models of information spreading through both vertical (centralized) and horizontal (interpersonal) flows, and how the combination of these determines information dynamics in society.
2) Models of information adoption and forgetting over time, and the effects of incomplete media coverage and two-step perception.
3) Models of information warfare between two information sources, examining the necessary conditions for one to win over the other.
4) Extensions of these models including periodic destabilization, additional factors like forgetting, and a model of individual choice-making during information warfare.
This document discusses the analysis and modeling of complex systems. It describes analyzing the problem, modeling the system, and determining both quantitative and qualitative parameters. An example is given of assigning weights to different quantitative parameters. The document recommends creating a coordinate system and basis to define qualitative parameters. It formulates the final task as creating a concept for a basis of a quality parameter system. It seeks colleagues to partner with on further developing these analysis methods.
This document discusses trend detection at OK. It describes the multi-step process used: text extraction from logs, language detection, tokenization, dictionary extraction, vectorization, deduplication, statistics calculation, trend identification, clustering of trending terms, extraction of relevant documents, and visualization of trends. Both batch and streaming approaches are discussed to address the need for timely trend detection. Technologies used include Apache Kafka, YARN, Spark, Samza, Lucene and ELKI.
1. The researcher analyzed quantitative characteristics such as entropy, readability, lexical diversity, frequencies of words, and parts of speech for different text genres including scientific texts, news articles, and student writings.
2. The analysis found that student writings had higher entropy and readability than news articles or scientific texts. News articles had higher lexical diversity and frequencies of common words.
3. To evaluate the accuracy of a developed Old Irish lemmatizer, the researcher applied it to a test corpus of 840 tokens, of which 186 were unknown words. The lemmatizer correctly predicted lemmas for 84 of the unknown words, achieving an accuracy of around 60% for unknown words.
This document discusses methods for evaluating clustering validity indices (CVIs) that measure the quality of clustering results. It proposes using human assessments of clustered data as ground truth to evaluate how well different CVIs match human judgments. An experimental evaluation of 19 CVIs on 41 datasets clustered using 6 algorithms showed that none of the CVIs perfectly matched human assessments. The document concludes that while no universal CVI exists, meta-learning from past human assessments could help select the most appropriate CVI for a new clustering problem.
The document provides information on various artificial intelligence and voice assistant technologies including:
1) JUST AI and Eugene Goostman chatbot, a winner of the 2014 Turing 100 Chatbots competition.
2) Everyday Assistant, a voice assistant available on mobile devices.
3) Dusi Voice Assistant with over 1 million downloads on Google Play.
4) Era of messengers for chatting with personal assistants without voice.
5) ElSmart, the first Android phone for blind users.
6) Zenbot, an open source framework for developing voice assistants across platforms.
This document proposes a data augmentation method for image sentiment analysis using hashtags. It involves collecting a small set of manually labeled images and their hashtags, learning to predict sentiment labels from the hashtags using machine learning, and using this model to automatically label more images. Preliminary results show the hashtag-predicted labels match human labels with 83-95% accuracy. However, more testing is needed on a general set of images to fully evaluate the method's effectiveness.
This document proposes a method for continuous time series alignment in human action recognition. It defines continuous versions of time series, warping paths, and the dynamic time warping (DTW) distance. The method finds the optimal continuous warping path by approximating solutions to a cost minimization problem. An experiment applies the continuous DTW to classify human activities from accelerometer data, achieving classification accuracy close to the discrete DTW method. The continuous approach solves issues with resampling data and has potential for improved approximations and optimization methods.
2. Виды словосочетаний Свободные Связные Разрывные Глагольное управление Неразрывные Коллокации
3. Объемы словарей сочетаемости Название Объем, статей Словарь сочетаемости слов русского языка / Под ред. П. Н. Денисова, В. В. Морковкина. 3-е изд., испр. М., АСТ, 2002. 816 с. 2500 Бирюк О.Л., Гусев В.Ю., Калинина Е.Ю. Словарь глагольной сочетаемости непредметных имен русского языка - http://dict.ruslang.ru/abstr_noun.php 10000 Невзорова О.А., Невзоров В.Н., Зинькина Ю.В., Пяткин Н.В. Интегральная технология разрешения омонимии в системе анализа текстовых документов «ЛоТА» 30000
4.
5.
6.
7.
8.
9.
10.
11. Граф конечного автомата 0 1 2 3 Гл. Гл. Пред. Прил. Сущ. Пред. Сущ. Др. Прил. Сущ . Гл. Др . Пред. Прил. Гл. Пред. Прил. Сущ. Др. 0 - начальное состояние 1 - ждем существительное после глагола 2 - ждем существительное до глагола 3 - ждем глагол Зеленый - занести в базу Красный – сброс Черный - переход
12.
13. Объем обработанных источников Источник Объем, млн словоупотреблений Библиотека Мошкова 680 РИА Новости 156 Доп. корпус прозы 120 Независимая газета 89 Лента.ру 33 Российская газета 29 PCWeek 28 РБК 21 Компьюлента 9 Итого 1165
14.
15.
16. Числитель показывает общее количество обнаруженных вхождений, знаменатель – количество уникальных сочетаний . Статистика употреблений по частям речи Результаты (по количеству вхождений) Пара Всего вхождений, млн > 1 повторения, млн > 2 повторений, млн Глагол+сущ. 65 / 8,3 60,3 / 3,5 57,7 / 2,3 Деепр.+сущ. 3,5 / 0,88 2,8 / 0,31 2,6 / 0,18 Сущ.+прил. 9,9 / 1,3 9,2 / 0,56 8,8 / 0,36 Часть речи Приняло участие Всего в морфологии Глагол 21500 26400 Сущ. 53300 83000 Прил. 23700 45300