Reading digital comic on mobile phone is demanding now. Rather that create a new mobile comic content, adaptation of existing digital comic web portal is valuable. In this paper, we proposed an automatic e-comic mobile content adaptation method for automatically create mobile comic content from existing digital comic website portal. Automatic e-comic content adaptation is based on our comic frame extraction method combine with additional process to extract comic balloon and text from digital comic page. The proposed method work as a content adaptation intermediary proxy server application, while generating a Comic XML file as an input source for mobile phone to render a specific mobile comic contents. Our proposed method is an effective and efficient method for real time implementation of reading e-comic comparing to other methods. Experimental results show that our proposed method has 100% accuracy of flat comic frame extraction, 91.48% accuracy of non-flat comic frame extraction, and about 90% processing time faster than previous method.
MOBILE PHONE APPLICATION PROGRAMMING INTERFACES FOR E-COMMERCEIJCSEA Journal
The document summarizes a research project that developed a cross-platform mobile payments application programming interface (API) for e-commerce in Kenya. The API allows merchants to integrate mobile payment options into their online stores, facilitating secure and seamless payments via mobile phones. The objectives were to create a RESTful API that is easy for developers to integrate and supports payments across different platforms. The API connects e-commerce applications to existing mobile payment systems in Kenya like M-Pesa. Testing challenges included limited resources to simulate a production environment. Further testing is needed but the API achieves its goals of enabling mobile payments for e-commerce.
Decision Tree Classifiers to determine the patient’s Post-operative Recovery ...Waqas Tariq
Machine Learning aims to generate classifying expressions simple enough to be understood easily by the human. There are many machine learning approaches available for classification. Among which decision tree learning is one of the most popular classification algorithms. In this paper we propose a systematic approach based on decision tree which is used to automatically determine the patient’s post–operative recovery status. Decision Tree structures are constructed, using data mining methods and then are used to classify discharge decisions.
it & Economic Performance a Critical Review of the Empirical DataWaqas Tariq
The present study undertakes a critical review of the research around the multi-significant issue of the correlation between the IT investments and the economic performance to both micro and macroeconomic level. The aim of this study is to shed light on the interaction of IT with the economy, at corporate, industry and national level and document it¢ s contribution to productivity and therefore to economic growth. My conclusion is that there is a positive effect of IT investments to both the above economic indicators in all aspects, but is something that needs further research so as to find a more clear and risk adjusted relation.
Planning in Markov Stochastic Task DomainsWaqas Tariq
In decision theoretic planning, a challenge for Markov decision processes (MDPs) and partially observable Markov decision processes (POMDPs) is, many problem domains contain big state spaces and complex tasks, which will result in poor solution performance. We develop a task analysis and modeling (TAM) approach, in which the (PO)MDP model is separated into a task view and an action view. In the task view, TAM models the problem domain using a task equivalence model, with task-dependent abstract states and observations. We provide a learning algorithm to obtain the parameter values of task equivalence models. We present three typical examples to explain the TAM approach. Experimental results indicate our approach can greatly improve the computational capacity of task planning in Markov stochastic domains.
Audio Art Authentication and Classification with Wavelet StatisticsWaqas Tariq
An experimental computation technique for audio art authentication is presented. Specifically, the computational techniques used by painting/drawings art authentication are transformed from twodimensional (image) into one-dimensional (audio) methods. The statistical model consists of first and higher-order wavelet statistics. Classification is performed with a multi-dimensional scaled 3D visual model. The results from the analyses of music/silence discrimination, audio art authentication, genre classification, and audio fingerprinting are demonstrated.
One of the fundamental issues in computer science is ordering a list of items. Although there is a number of sorting algorithms, sorting problem has attracted a great deal of research, because efficient sorting is important to optimize the use of other algorithms. This paper presents a new sorting algorithm (Index Sort) which runs based on the previously sorted elements.. This algorithm was analyzed, implemented and tested and the results are promising for a random data.
From TION-EMO Theory To A New Beginning Of Artificial EmotionsWaqas Tariq
This document summarizes Rafael Navia's Tion-Emo Theory (TET), which proposes a new system for building artificial emotions. The TET involves three "objects" that represent different regions - the intellectual object in the intellectual region, the social object in the semi-intellectual region, and the self-object in the non-intellectual region. Connecting these objects allows for the formation of a self-sustaining emotional system, with the self-object regulating emotional energy and censorship systems. The TET draws from studies of animal behavior, evolutionary robotics, and neuroscience to provide a framework grounded in biological principles for developing artificial emotions.
A Method to Provide Accessibility for Visual Components to Vision ImpairedWaqas Tariq
Non-textual graphical information (line graphs, bar charts, pie charts, etc.) are increasingly pervasive in digital scientific literatures and business reports which enabling readers to easily acquire the nature of the underlying information [1]. These graphical components are commonly used to present data in an easy-to interpret way. Graphs are frequently used in economics, mathematics and other scientific subjects. In general term data visualization techniques are useless for blind people. Being unable to access graphical information easily is a major obstacle to blind people in pursuing a scientific study and careers [2].This paper suggests a method to extract implicit information of Bar chart, Pie chart, Line chart and math’s graph components of an electronic document and present them to vision impaired users in audio format. The goal is to provide simple to use, efficient, and available presentation schemes for non textual which can help vision impaired users in comprehending form without needing any further devices or equipments. A software application has been developed based on this research. The output of application is a textual summary of the graphic including the core content of the hypothesized intended message of the graphic designer. The textual summary of the graphic is then conveyed to the user by Text to Speech software .The benefit of this approach is automatic providing the user with the message and knowledge that one would gain from viewing the chart.
MOBILE PHONE APPLICATION PROGRAMMING INTERFACES FOR E-COMMERCEIJCSEA Journal
The document summarizes a research project that developed a cross-platform mobile payments application programming interface (API) for e-commerce in Kenya. The API allows merchants to integrate mobile payment options into their online stores, facilitating secure and seamless payments via mobile phones. The objectives were to create a RESTful API that is easy for developers to integrate and supports payments across different platforms. The API connects e-commerce applications to existing mobile payment systems in Kenya like M-Pesa. Testing challenges included limited resources to simulate a production environment. Further testing is needed but the API achieves its goals of enabling mobile payments for e-commerce.
Decision Tree Classifiers to determine the patient’s Post-operative Recovery ...Waqas Tariq
Machine Learning aims to generate classifying expressions simple enough to be understood easily by the human. There are many machine learning approaches available for classification. Among which decision tree learning is one of the most popular classification algorithms. In this paper we propose a systematic approach based on decision tree which is used to automatically determine the patient’s post–operative recovery status. Decision Tree structures are constructed, using data mining methods and then are used to classify discharge decisions.
it & Economic Performance a Critical Review of the Empirical DataWaqas Tariq
The present study undertakes a critical review of the research around the multi-significant issue of the correlation between the IT investments and the economic performance to both micro and macroeconomic level. The aim of this study is to shed light on the interaction of IT with the economy, at corporate, industry and national level and document it¢ s contribution to productivity and therefore to economic growth. My conclusion is that there is a positive effect of IT investments to both the above economic indicators in all aspects, but is something that needs further research so as to find a more clear and risk adjusted relation.
Planning in Markov Stochastic Task DomainsWaqas Tariq
In decision theoretic planning, a challenge for Markov decision processes (MDPs) and partially observable Markov decision processes (POMDPs) is, many problem domains contain big state spaces and complex tasks, which will result in poor solution performance. We develop a task analysis and modeling (TAM) approach, in which the (PO)MDP model is separated into a task view and an action view. In the task view, TAM models the problem domain using a task equivalence model, with task-dependent abstract states and observations. We provide a learning algorithm to obtain the parameter values of task equivalence models. We present three typical examples to explain the TAM approach. Experimental results indicate our approach can greatly improve the computational capacity of task planning in Markov stochastic domains.
Audio Art Authentication and Classification with Wavelet StatisticsWaqas Tariq
An experimental computation technique for audio art authentication is presented. Specifically, the computational techniques used by painting/drawings art authentication are transformed from twodimensional (image) into one-dimensional (audio) methods. The statistical model consists of first and higher-order wavelet statistics. Classification is performed with a multi-dimensional scaled 3D visual model. The results from the analyses of music/silence discrimination, audio art authentication, genre classification, and audio fingerprinting are demonstrated.
One of the fundamental issues in computer science is ordering a list of items. Although there is a number of sorting algorithms, sorting problem has attracted a great deal of research, because efficient sorting is important to optimize the use of other algorithms. This paper presents a new sorting algorithm (Index Sort) which runs based on the previously sorted elements.. This algorithm was analyzed, implemented and tested and the results are promising for a random data.
From TION-EMO Theory To A New Beginning Of Artificial EmotionsWaqas Tariq
This document summarizes Rafael Navia's Tion-Emo Theory (TET), which proposes a new system for building artificial emotions. The TET involves three "objects" that represent different regions - the intellectual object in the intellectual region, the social object in the semi-intellectual region, and the self-object in the non-intellectual region. Connecting these objects allows for the formation of a self-sustaining emotional system, with the self-object regulating emotional energy and censorship systems. The TET draws from studies of animal behavior, evolutionary robotics, and neuroscience to provide a framework grounded in biological principles for developing artificial emotions.
A Method to Provide Accessibility for Visual Components to Vision ImpairedWaqas Tariq
Non-textual graphical information (line graphs, bar charts, pie charts, etc.) are increasingly pervasive in digital scientific literatures and business reports which enabling readers to easily acquire the nature of the underlying information [1]. These graphical components are commonly used to present data in an easy-to interpret way. Graphs are frequently used in economics, mathematics and other scientific subjects. In general term data visualization techniques are useless for blind people. Being unable to access graphical information easily is a major obstacle to blind people in pursuing a scientific study and careers [2].This paper suggests a method to extract implicit information of Bar chart, Pie chart, Line chart and math’s graph components of an electronic document and present them to vision impaired users in audio format. The goal is to provide simple to use, efficient, and available presentation schemes for non textual which can help vision impaired users in comprehending form without needing any further devices or equipments. A software application has been developed based on this research. The output of application is a textual summary of the graphic including the core content of the hypothesized intended message of the graphic designer. The textual summary of the graphic is then conveyed to the user by Text to Speech software .The benefit of this approach is automatic providing the user with the message and knowledge that one would gain from viewing the chart.
A Simple Integrative Solution For Simultaneous Localization And MappingWaqas Tariq
Simultaneous Localization and Mapping is a method used to find the location of a mobile robot while at the same time build a constructive map of its surrounding environment. This paper gives a brief description about a simple integrative SLAM technique using a Laser Range Finder (LRF) and Odometry data, primarily for indoor environments. In this project, a solution for the SLAM problem was implemented on a differential drive mobile robot equipped with a SICK laser scanner.
Mathematical Derivation of Annuity Interest Rate and its ApplicationWaqas Tariq
A fundamental task in business for investor or borrower is to know the interest rate of an annuity. In this type of problem, the size of each periodic payment(R), the term(n), and the amount(Sn) or the present value of the annuity(An) are usually given. However, a direct equation representing the Annuity Interest Rate(i) is not available, since an approximate value of the Annuity Interest Rate is obtained by interpolation method based on table showing (Sn/R) values. This paper emphasizes the real time computational problem for Annuity interest rate. It has therefore been important to derive an equation for computing the Annuity Interest rate. The evaluation of error analysis has been discussed. The new algorithm saved computational energy by approximately 99.9% than that of the tabulated one.
Active Control of Tool Position in the Presence of Nonlinear Cutting Forces i...Waqas Tariq
This work presents a practical approach to the control of tool’s position, in orthogonal cutting, in the presence nonlinear dynamic cutting forces. The controller is Linear Quadratic Gaussian (LQG) type constructed from an augmented model of both, tool-actuator dynamics, and a nonlinear dynamic model relating tool displacement to cutting forces. The latter model is obtained using black-box system identification of experimental orthogonal cutting data in which tool displacement is the input and cutting force is the output. The controller is evaluated and its performance is demonstrated
Data mining visualization to support biochemical markers for liver fibrosis i...Waqas Tariq
The reference diagnostic test to detect fibrosis is liver biopsy (LB), a procedure subject to various limitations, including risk of patient injury and sampling error. FibroTest (FT) and ActiTest (AT) are biochemical markers (noninvasive tests) used in determining the level of fibrosis and the degree of necroinflammatory activity in the liver. The objective of this work is to discover the differences in the temporal patterns between noninvasive tests and liver biopsy by visualization tools, which made it easier to understand the relations of the complicated rules. This Study ware focused on the major serum fibrosis markers (FT/AT). The test uses a combination of serum biochemical markers with visualization technique to evaluate whether biochemical markers can be used to estimate the stage of liver fibrosis and necro-inflammatory activity in the liver.
Computer Assisted System for Enhancing the Application of Ergonomics in Manuf...Waqas Tariq
The current paper focuses on the need and a plan for the development of a Computer Assisted Interactive and Intelligent Ergonomics System which, through a user friendly consulting mode presents the guidelines and formalized procedures for the application of ergonomics knowledge and data in manufacturing organizations. The system is expected to allow a production engineer or supervisor or even a worker with minimal ergonomics knowledge, to understand, analyze and find solutions to problems related to industrial ergonomics. A survey which is conducted in this regard is also described in this paper and through the out come of the survey it is shown that the poor acceptance and application of ergonomics is due to lack of exposure to ergonomics knowledge and non-availability of ergonomics knowledge in a suitable form for its application in manufacturing systems. 10
Design of A Spell Corrector For Hausa LanguageWaqas Tariq
In this article, a spell corrector has been designed for the Hausa language which is the second most spoken language in Africa and do not yet have processing tools. This study is a contribution to the automatic processing of the Hausa language. We used existing techniques for other languages and adapted them to the special case of the Hausa language. The corrector designed operates essentially on Mijinguini’s dictionary and characteristics of the Hausa alphabet. After a brief review on spell checking and spell correcting techniques and the state of art in the Hausa language processing, we opted for the data structures trie and hash table to represent the dictionary. The edit distance and the specificities of the Hausa alphabet have been used to detect and correct spelling errors. The implementation of the spell corrector has been made on a special editor developed for that purpose (LyTexEditor) but also as an extension (add-on) for OpenOffice.org. A comparison was made on the performance of the two data structures used.
Design and Implementation of Sliding Mode Algorithm: Applied to Robot Manipul...Waqas Tariq
Refer to the research, review of sliding mode controller is introduced and application to robot manipulator has proposed in order to design high performance nonlinear controller in the presence of uncertainties. Regarding to the positive points in sliding mode controller, fuzzy logic controller and adaptive method, the output in most of research have improved. Each method by adding to the previous algorithm has covered negative points. Obviously robot manipulator is nonlinear, and a number of parameters are uncertain, this research focuses on comparison between sliding mode algorithm which analyzed by many researcher. Sliding mode controller (SMC) is one of the nonlinear robust controllers which it can be used in uncertainty nonlinear dynamic systems. This nonlinear controller has two challenges namely nonlinear dynamic equivalent part and chattering phenomenon. A review of sliding mode controller for robot manipulator will be investigated in this research.
Multi User Detection in CDMA System Using Linear and Non Linear DetectorWaqas Tariq
DS-Code division multiple access is considered as the third generation of cellular mobile used in interim standard 95(IS-95) [1]and it is currently being standardized for universal mobile telecommunication systems (UMTS). CDMA offers attractive features, such as frequency reuse, soft handoff, increased capacity, and multipath combating. In a CDMA system, several users simultaneously transmit information over a common channel using pre-assigned codes. The conventional single user detector consists of a bank of filters matched to the spreading codes. This detector suffers from two problems. First, multiple access interference (MAI) produced by the other co-channel users is a significant limitation to the capacity of this detector. The second problem is the near-far effect which occurs when the relative received power of interfering signals becomes larger. A potential solution is multi-user detection which exploits the information of signals of interfering users. In the present study performance of various linear detectors like matched filter detector, MMSE detector, and adaptive LMS detector are studied. These are the linear detectors that operate linearly on the received signal statistics and are suboptimal detectors. The matched filter bank is the conventional detector and offers the simplest way of demodulating CDMA signals .The detector resulting from the MMSE (minimum mean square error) criterion shows better performance over the conventional one for low SNR value. Adaptive LMS is employed to enhance the BER performance in MUD application.Several factors motivated the research to apply neural network as multi-user detector. NN are nonlinear classifier in addition to being adaptive and computationally efficient. The performance of two layer perceptron neural network using BP learning rule is used for multi-user detection of CDMA signals in AWGN channels. The neural network detectors show improvement of BER in the comparative analysis done in the present work. and offers further research scope for solving multi-user detection problems in CDMA application.
A Business Review of E-Retailing in IndiaWaqas Tariq
Abstract As a professor in computer science I am very much interested in training my students in e-Commerce and prepared myself for an in depth research in this area and to present a quick journal about e-Retailing concepts / framework, how an organization can start e-Retailing business quickly? Its Pro’s & Con’s, how to make the e-Retailing venture successful? How retailers should plan / experience to achieve varying success by leveraging the internet technology? How to incorporate traditional retails practices with Internet technology? And strive for success in India. How internet is used by users and its use for online shopping. It serves a s a best article for all the readers across the globe. Keywords: E-Retailing in India, e-tailing, E commerce, Online store, retail, e business
An Efficient Hybrid Successive Markov Model for Predicting Web User Usage Beh...Waqas Tariq
With the continued growth and proliferation of Web services and Web based information systems, the volumes of user data have reached astronomical proportions. Analyzing such data using Web Usage Mining can help to determine the visiting interests or needs of the web user. As web log is incremental in nature, it becomes a crucial issue to predict exactly the ways how users browse websites. It is necessary for web miners to use predictive mining techniques to filter the unwanted categories for reducing the operational scope. The first-order Markov model has low accuracy in achieving right predictions, which is why extensions to higher order models are necessary. All higher order Markov model holds the promise of achieving higher prediction accuracies, improved coverage than any single-order Markov model but holds high state space complexity. Hence a Hybrid Markov Model is required to improve the operation performance and prediction accuracy significantly. The present paper introduces An Efficient Hybrid Successive Markov Prediction Model, HSMP. The HSMP model is initially predicts the possible wanted categories using Relevance factor, which can be used to infer the users’ browsing behavior between web categories. Then predict the pages in predicted categories using techniques for intelligently combining different order Markov models so that the resulting model has low state complexity, improved prediction accuracy and retains the coverage of the all higher order Markov model. These techniques eliminates low support states, evaluates the probability distribution and estimates the error associated with each state without affecting the overall accuracy as well as protection of the resulting model. To validate the proposed prediction model, several experiments were conducted and results proven this are claimed in this paper.
A Robotic Prototype System for Child MonitoringWaqas Tariq
This document describes a robotic prototype system for child monitoring. The system consists of a Khepera robot, host computer, and circuits to trigger lights and alarms. The robot uses image processing to find and follow a baby prop (Lego blocks) in a testing area with obstacles. It can detect when the baby enters danger zones and activate the circuits. Experimental testing showed the robot could successfully find and track the baby prop while avoiding obstacles in scenarios of increasing complexity. The system provides a starting point for developing mobile robotic child monitoring in the home.
Method for Real Time Text Extraction of Digital Manga ComicCSCJournals
Manga is one of popular item in Japan and also in the rest of the world. Hundreds of manga book is printed everyday in Japan, and some of printed manga book is digitized into web content for reading comic through the internet. People then make translation of Japanese language in manga into other language to share enjoy of reading manga for non Japanese reader. However, people make translation of the text on printed comic book (they call it scanlation) in manually because there is no automatic method for translate comic text image into any other language. The challenge in extracting Japanese character in manga is how to detect comic balloon and extract text in vertical direction as Japanese classic writing direction is top down and right to left. Several research projects [1-4] proposed method for text extraction from images but not specific for extraction from comic image. There are two base methods for text extraction, using region based method and texture based method. In [5], propose the concept of automatic mobile content conversion using semantic image analysis that include comic text extraction, but this paper did not explain the details for text extraction. Also, Yamada [6] proposed method for comic image decomposition for reading comic on mobile phone that including comic text extraction but not details on comic text extraction. The conventional method assuming extraction process in offline way and using scanned comic image. In the internet and mobility era, we need advance method for extraction text in online way and automatically
IRJET- Neural Style based Comics Photo-Caption GeneratorIRJET Journal
This document proposes a method to automatically generate comic book strips from ordinary photographs by using neural style transfer to convert the photos to a comic-style and generating poetic captions to describe the comic images. It first discusses existing neural style transfer techniques for transferring artistic styles onto photos. It then focuses on adapting these methods to specifically transfer a comic book style. The proposed system uses a convolutional neural network to separate the content and style of images and recombine them to generate comic-style images, along with using a generative adversarial network to produce captions matching the comic images. The goal is to automate key aspects of the comic book creation process by stylizing real photos as comics and generating accompanying text.
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNINGIRJET Journal
This document describes a study on converting images to text to speech using machine learning. The researchers developed a system that uses optical character recognition to extract text from images, then converts the text to speech. They achieved over 99% accuracy on their test dataset of over 1 million images. Their integrated system was able to accurately extract and convert text from various real-world images like street signs and menus. The system has potential to improve accessibility for people with visual impairments by allowing printed information to be converted to audio. Future work includes handling lower quality images and expanding the system to support additional languages and applications.
IRJET- Transformation of Realistic Images and Videos into Cartoon Images and ...IRJET Journal
This document summarizes research on using a Generative Adversarial Network (GAN) called Cartoon GAN to transform real-world images and videos into cartoon images and videos. The researchers trained Cartoon GAN on 3000 real-world images to learn how to generate cartoon images by using content and adversarial loss functions. They were able to successfully convert both individual images and video clips into cartoon/animated versions. For video, they used the OpenCV library to divide videos into frames, pass each frame through the trained Cartoon GAN model, and then recombine the cartoonized frames into an output cartoon video. The researchers concluded that Cartoon GAN is an effective method for automatically transforming real media into cartoons and aims to improve the quality and resolution
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...IRJET Journal
This document proposes an approach to automatically generate HTML code from mock-up images using machine learning techniques. The approach uses convolutional neural networks to analyze mock-up images and identify elements like buttons and text. A long short-term memory network is then used to generate HTML code structured according to the website page hierarchy. The networks are trained on a dataset of mock-up images and corresponding HTML code. The goal is to reduce the time and costs required for developers to manually convert mock-ups into code.
Design and development of a delta robot system to classify objects using imag...IJECEIAES
In this paper, a delta robot is designed to grasp objects in an automatic sorting system. The system consists of a delta robot arm for grasping objects, a belt conveyor for transmitting objects, a camera mounted above the conveyor to capture images of objects, and a computer for processing images to classify objects. The delta robot is driven by three direct current (DC) servo motors. The controller is implemented by an Arduino board and Raspberry Pi 4 computer. The Arduino is programmed to provide rotation to each corresponding motor. The Raspberry Pi 4 computer is used to process images of objects to classify objects according to their color. An image processing algorithm is developed to classify objects by color. The blue, green, red (BGR) image of objects is converted to HSV color space and then different thresholds are applied to recognize the object’s color. The robot grasps objects and put them in the correct position according to information received from Raspberry. Experimental results show that the accuracy when classifying red and yellow objects is 100%, and for green objects is 97.5%. The system takes an average of 1.8 s to sort an object.
Conceptual Framework for Anthropomorphic Simulation of Human Face for Interac...IRJESJOURNAL
ABSTRACT. This publication is dedicated to the experimental development of conceptual framework for interactive direct marketing network based on simulated anthropomorphic agents (human faces). Aims: to discuss the main functional open system architecture and in particular openness to track and identify the mood of the user by analyzing the captured images of his face and other features. The article presents practical results, problems and possible solutions on the particular stage of development of the prototype, including project for further investigation the recognition of standardized facial expressions of emotion (anger, fear, disgust, happiness, sadness, surprise) at a perceptual level for children on the autism spectrum.
Probabilistic Approach to Provisioning of ITV - Amos K.Amos Kohn
This white paper discusses a probabilistic approach to provisioning network and computing resources for delivering interactive TV. It develops a proprietary spreadsheet model to estimate the costs and benefits of deploying an interactive TV streaming processor. The model is based on analyzing user behavior, data packaging into MPEG streams, required bit rates, transport of data over the forward and return paths, necessary processing power, and financial projections to calculate return on investment.
Probabilistic Approach to Provisioning of ITV - By Amos_KohnAmos Kohn
This white paper discusses a probabilistic approach to provisioning network and computing resources for delivering interactive TV. It develops a proprietary spreadsheet model to estimate the costs and benefits of deploying an interactive TV streaming processor. The model is based on analyzing user behavior, data packaging into MPEG streams, required bit rates, forward and return network paths, processing needs, and financial projections to calculate return on investment.
An AI Based ATM Intelligent Security System using Open CV and YOLOYogeshIJTSRD
Nowadays most of the surveillance cameras in ATM doesn’t record with detail for analysis of incidents. Due to this most of the ATM cases gets unsolved. In this paper a system to improve ATM security is proposed. The proposed system deals with the development of a application using Open CV, YOLO and AI for automation of video surveillance in ATM machines and detect any type of potential criminal activities that might be arising. Prem Krishna | Saheel Ahamed | Roshan Kartik "An AI Based ATM Intelligent Security System using Open CV and YOLO" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-4 , June 2021, URL: https://www.ijtsrd.compapers/ijtsrd41232.pdf Paper URL: https://www.ijtsrd.comengineering/computer-engineering/41232/an-ai-based-atm-intelligent-security-system-using-open-cv-and-yolo/prem-krishna
A Simple Integrative Solution For Simultaneous Localization And MappingWaqas Tariq
Simultaneous Localization and Mapping is a method used to find the location of a mobile robot while at the same time build a constructive map of its surrounding environment. This paper gives a brief description about a simple integrative SLAM technique using a Laser Range Finder (LRF) and Odometry data, primarily for indoor environments. In this project, a solution for the SLAM problem was implemented on a differential drive mobile robot equipped with a SICK laser scanner.
Mathematical Derivation of Annuity Interest Rate and its ApplicationWaqas Tariq
A fundamental task in business for investor or borrower is to know the interest rate of an annuity. In this type of problem, the size of each periodic payment(R), the term(n), and the amount(Sn) or the present value of the annuity(An) are usually given. However, a direct equation representing the Annuity Interest Rate(i) is not available, since an approximate value of the Annuity Interest Rate is obtained by interpolation method based on table showing (Sn/R) values. This paper emphasizes the real time computational problem for Annuity interest rate. It has therefore been important to derive an equation for computing the Annuity Interest rate. The evaluation of error analysis has been discussed. The new algorithm saved computational energy by approximately 99.9% than that of the tabulated one.
Active Control of Tool Position in the Presence of Nonlinear Cutting Forces i...Waqas Tariq
This work presents a practical approach to the control of tool’s position, in orthogonal cutting, in the presence nonlinear dynamic cutting forces. The controller is Linear Quadratic Gaussian (LQG) type constructed from an augmented model of both, tool-actuator dynamics, and a nonlinear dynamic model relating tool displacement to cutting forces. The latter model is obtained using black-box system identification of experimental orthogonal cutting data in which tool displacement is the input and cutting force is the output. The controller is evaluated and its performance is demonstrated
Data mining visualization to support biochemical markers for liver fibrosis i...Waqas Tariq
The reference diagnostic test to detect fibrosis is liver biopsy (LB), a procedure subject to various limitations, including risk of patient injury and sampling error. FibroTest (FT) and ActiTest (AT) are biochemical markers (noninvasive tests) used in determining the level of fibrosis and the degree of necroinflammatory activity in the liver. The objective of this work is to discover the differences in the temporal patterns between noninvasive tests and liver biopsy by visualization tools, which made it easier to understand the relations of the complicated rules. This Study ware focused on the major serum fibrosis markers (FT/AT). The test uses a combination of serum biochemical markers with visualization technique to evaluate whether biochemical markers can be used to estimate the stage of liver fibrosis and necro-inflammatory activity in the liver.
Computer Assisted System for Enhancing the Application of Ergonomics in Manuf...Waqas Tariq
The current paper focuses on the need and a plan for the development of a Computer Assisted Interactive and Intelligent Ergonomics System which, through a user friendly consulting mode presents the guidelines and formalized procedures for the application of ergonomics knowledge and data in manufacturing organizations. The system is expected to allow a production engineer or supervisor or even a worker with minimal ergonomics knowledge, to understand, analyze and find solutions to problems related to industrial ergonomics. A survey which is conducted in this regard is also described in this paper and through the out come of the survey it is shown that the poor acceptance and application of ergonomics is due to lack of exposure to ergonomics knowledge and non-availability of ergonomics knowledge in a suitable form for its application in manufacturing systems. 10
Design of A Spell Corrector For Hausa LanguageWaqas Tariq
In this article, a spell corrector has been designed for the Hausa language which is the second most spoken language in Africa and do not yet have processing tools. This study is a contribution to the automatic processing of the Hausa language. We used existing techniques for other languages and adapted them to the special case of the Hausa language. The corrector designed operates essentially on Mijinguini’s dictionary and characteristics of the Hausa alphabet. After a brief review on spell checking and spell correcting techniques and the state of art in the Hausa language processing, we opted for the data structures trie and hash table to represent the dictionary. The edit distance and the specificities of the Hausa alphabet have been used to detect and correct spelling errors. The implementation of the spell corrector has been made on a special editor developed for that purpose (LyTexEditor) but also as an extension (add-on) for OpenOffice.org. A comparison was made on the performance of the two data structures used.
Design and Implementation of Sliding Mode Algorithm: Applied to Robot Manipul...Waqas Tariq
Refer to the research, review of sliding mode controller is introduced and application to robot manipulator has proposed in order to design high performance nonlinear controller in the presence of uncertainties. Regarding to the positive points in sliding mode controller, fuzzy logic controller and adaptive method, the output in most of research have improved. Each method by adding to the previous algorithm has covered negative points. Obviously robot manipulator is nonlinear, and a number of parameters are uncertain, this research focuses on comparison between sliding mode algorithm which analyzed by many researcher. Sliding mode controller (SMC) is one of the nonlinear robust controllers which it can be used in uncertainty nonlinear dynamic systems. This nonlinear controller has two challenges namely nonlinear dynamic equivalent part and chattering phenomenon. A review of sliding mode controller for robot manipulator will be investigated in this research.
Multi User Detection in CDMA System Using Linear and Non Linear DetectorWaqas Tariq
DS-Code division multiple access is considered as the third generation of cellular mobile used in interim standard 95(IS-95) [1]and it is currently being standardized for universal mobile telecommunication systems (UMTS). CDMA offers attractive features, such as frequency reuse, soft handoff, increased capacity, and multipath combating. In a CDMA system, several users simultaneously transmit information over a common channel using pre-assigned codes. The conventional single user detector consists of a bank of filters matched to the spreading codes. This detector suffers from two problems. First, multiple access interference (MAI) produced by the other co-channel users is a significant limitation to the capacity of this detector. The second problem is the near-far effect which occurs when the relative received power of interfering signals becomes larger. A potential solution is multi-user detection which exploits the information of signals of interfering users. In the present study performance of various linear detectors like matched filter detector, MMSE detector, and adaptive LMS detector are studied. These are the linear detectors that operate linearly on the received signal statistics and are suboptimal detectors. The matched filter bank is the conventional detector and offers the simplest way of demodulating CDMA signals .The detector resulting from the MMSE (minimum mean square error) criterion shows better performance over the conventional one for low SNR value. Adaptive LMS is employed to enhance the BER performance in MUD application.Several factors motivated the research to apply neural network as multi-user detector. NN are nonlinear classifier in addition to being adaptive and computationally efficient. The performance of two layer perceptron neural network using BP learning rule is used for multi-user detection of CDMA signals in AWGN channels. The neural network detectors show improvement of BER in the comparative analysis done in the present work. and offers further research scope for solving multi-user detection problems in CDMA application.
A Business Review of E-Retailing in IndiaWaqas Tariq
Abstract As a professor in computer science I am very much interested in training my students in e-Commerce and prepared myself for an in depth research in this area and to present a quick journal about e-Retailing concepts / framework, how an organization can start e-Retailing business quickly? Its Pro’s & Con’s, how to make the e-Retailing venture successful? How retailers should plan / experience to achieve varying success by leveraging the internet technology? How to incorporate traditional retails practices with Internet technology? And strive for success in India. How internet is used by users and its use for online shopping. It serves a s a best article for all the readers across the globe. Keywords: E-Retailing in India, e-tailing, E commerce, Online store, retail, e business
An Efficient Hybrid Successive Markov Model for Predicting Web User Usage Beh...Waqas Tariq
With the continued growth and proliferation of Web services and Web based information systems, the volumes of user data have reached astronomical proportions. Analyzing such data using Web Usage Mining can help to determine the visiting interests or needs of the web user. As web log is incremental in nature, it becomes a crucial issue to predict exactly the ways how users browse websites. It is necessary for web miners to use predictive mining techniques to filter the unwanted categories for reducing the operational scope. The first-order Markov model has low accuracy in achieving right predictions, which is why extensions to higher order models are necessary. All higher order Markov model holds the promise of achieving higher prediction accuracies, improved coverage than any single-order Markov model but holds high state space complexity. Hence a Hybrid Markov Model is required to improve the operation performance and prediction accuracy significantly. The present paper introduces An Efficient Hybrid Successive Markov Prediction Model, HSMP. The HSMP model is initially predicts the possible wanted categories using Relevance factor, which can be used to infer the users’ browsing behavior between web categories. Then predict the pages in predicted categories using techniques for intelligently combining different order Markov models so that the resulting model has low state complexity, improved prediction accuracy and retains the coverage of the all higher order Markov model. These techniques eliminates low support states, evaluates the probability distribution and estimates the error associated with each state without affecting the overall accuracy as well as protection of the resulting model. To validate the proposed prediction model, several experiments were conducted and results proven this are claimed in this paper.
A Robotic Prototype System for Child MonitoringWaqas Tariq
This document describes a robotic prototype system for child monitoring. The system consists of a Khepera robot, host computer, and circuits to trigger lights and alarms. The robot uses image processing to find and follow a baby prop (Lego blocks) in a testing area with obstacles. It can detect when the baby enters danger zones and activate the circuits. Experimental testing showed the robot could successfully find and track the baby prop while avoiding obstacles in scenarios of increasing complexity. The system provides a starting point for developing mobile robotic child monitoring in the home.
Method for Real Time Text Extraction of Digital Manga ComicCSCJournals
Manga is one of popular item in Japan and also in the rest of the world. Hundreds of manga book is printed everyday in Japan, and some of printed manga book is digitized into web content for reading comic through the internet. People then make translation of Japanese language in manga into other language to share enjoy of reading manga for non Japanese reader. However, people make translation of the text on printed comic book (they call it scanlation) in manually because there is no automatic method for translate comic text image into any other language. The challenge in extracting Japanese character in manga is how to detect comic balloon and extract text in vertical direction as Japanese classic writing direction is top down and right to left. Several research projects [1-4] proposed method for text extraction from images but not specific for extraction from comic image. There are two base methods for text extraction, using region based method and texture based method. In [5], propose the concept of automatic mobile content conversion using semantic image analysis that include comic text extraction, but this paper did not explain the details for text extraction. Also, Yamada [6] proposed method for comic image decomposition for reading comic on mobile phone that including comic text extraction but not details on comic text extraction. The conventional method assuming extraction process in offline way and using scanned comic image. In the internet and mobility era, we need advance method for extraction text in online way and automatically
IRJET- Neural Style based Comics Photo-Caption GeneratorIRJET Journal
This document proposes a method to automatically generate comic book strips from ordinary photographs by using neural style transfer to convert the photos to a comic-style and generating poetic captions to describe the comic images. It first discusses existing neural style transfer techniques for transferring artistic styles onto photos. It then focuses on adapting these methods to specifically transfer a comic book style. The proposed system uses a convolutional neural network to separate the content and style of images and recombine them to generate comic-style images, along with using a generative adversarial network to produce captions matching the comic images. The goal is to automate key aspects of the comic book creation process by stylizing real photos as comics and generating accompanying text.
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNINGIRJET Journal
This document describes a study on converting images to text to speech using machine learning. The researchers developed a system that uses optical character recognition to extract text from images, then converts the text to speech. They achieved over 99% accuracy on their test dataset of over 1 million images. Their integrated system was able to accurately extract and convert text from various real-world images like street signs and menus. The system has potential to improve accessibility for people with visual impairments by allowing printed information to be converted to audio. Future work includes handling lower quality images and expanding the system to support additional languages and applications.
IRJET- Transformation of Realistic Images and Videos into Cartoon Images and ...IRJET Journal
This document summarizes research on using a Generative Adversarial Network (GAN) called Cartoon GAN to transform real-world images and videos into cartoon images and videos. The researchers trained Cartoon GAN on 3000 real-world images to learn how to generate cartoon images by using content and adversarial loss functions. They were able to successfully convert both individual images and video clips into cartoon/animated versions. For video, they used the OpenCV library to divide videos into frames, pass each frame through the trained Cartoon GAN model, and then recombine the cartoonized frames into an output cartoon video. The researchers concluded that Cartoon GAN is an effective method for automatically transforming real media into cartoons and aims to improve the quality and resolution
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...IRJET Journal
This document proposes an approach to automatically generate HTML code from mock-up images using machine learning techniques. The approach uses convolutional neural networks to analyze mock-up images and identify elements like buttons and text. A long short-term memory network is then used to generate HTML code structured according to the website page hierarchy. The networks are trained on a dataset of mock-up images and corresponding HTML code. The goal is to reduce the time and costs required for developers to manually convert mock-ups into code.
Design and development of a delta robot system to classify objects using imag...IJECEIAES
In this paper, a delta robot is designed to grasp objects in an automatic sorting system. The system consists of a delta robot arm for grasping objects, a belt conveyor for transmitting objects, a camera mounted above the conveyor to capture images of objects, and a computer for processing images to classify objects. The delta robot is driven by three direct current (DC) servo motors. The controller is implemented by an Arduino board and Raspberry Pi 4 computer. The Arduino is programmed to provide rotation to each corresponding motor. The Raspberry Pi 4 computer is used to process images of objects to classify objects according to their color. An image processing algorithm is developed to classify objects by color. The blue, green, red (BGR) image of objects is converted to HSV color space and then different thresholds are applied to recognize the object’s color. The robot grasps objects and put them in the correct position according to information received from Raspberry. Experimental results show that the accuracy when classifying red and yellow objects is 100%, and for green objects is 97.5%. The system takes an average of 1.8 s to sort an object.
Conceptual Framework for Anthropomorphic Simulation of Human Face for Interac...IRJESJOURNAL
ABSTRACT. This publication is dedicated to the experimental development of conceptual framework for interactive direct marketing network based on simulated anthropomorphic agents (human faces). Aims: to discuss the main functional open system architecture and in particular openness to track and identify the mood of the user by analyzing the captured images of his face and other features. The article presents practical results, problems and possible solutions on the particular stage of development of the prototype, including project for further investigation the recognition of standardized facial expressions of emotion (anger, fear, disgust, happiness, sadness, surprise) at a perceptual level for children on the autism spectrum.
Probabilistic Approach to Provisioning of ITV - Amos K.Amos Kohn
This white paper discusses a probabilistic approach to provisioning network and computing resources for delivering interactive TV. It develops a proprietary spreadsheet model to estimate the costs and benefits of deploying an interactive TV streaming processor. The model is based on analyzing user behavior, data packaging into MPEG streams, required bit rates, transport of data over the forward and return paths, necessary processing power, and financial projections to calculate return on investment.
Probabilistic Approach to Provisioning of ITV - By Amos_KohnAmos Kohn
This white paper discusses a probabilistic approach to provisioning network and computing resources for delivering interactive TV. It develops a proprietary spreadsheet model to estimate the costs and benefits of deploying an interactive TV streaming processor. The model is based on analyzing user behavior, data packaging into MPEG streams, required bit rates, forward and return network paths, processing needs, and financial projections to calculate return on investment.
An AI Based ATM Intelligent Security System using Open CV and YOLOYogeshIJTSRD
Nowadays most of the surveillance cameras in ATM doesn’t record with detail for analysis of incidents. Due to this most of the ATM cases gets unsolved. In this paper a system to improve ATM security is proposed. The proposed system deals with the development of a application using Open CV, YOLO and AI for automation of video surveillance in ATM machines and detect any type of potential criminal activities that might be arising. Prem Krishna | Saheel Ahamed | Roshan Kartik "An AI Based ATM Intelligent Security System using Open CV and YOLO" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-4 , June 2021, URL: https://www.ijtsrd.compapers/ijtsrd41232.pdf Paper URL: https://www.ijtsrd.comengineering/computer-engineering/41232/an-ai-based-atm-intelligent-security-system-using-open-cv-and-yolo/prem-krishna
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Real Time Head Generation for Video ConferencingIRJET Journal
This document proposes a real-time video synthesis model for video conferencing that reconstructs video at the receiver end to maintain a steady experience. It extracts and retargets motion from sender video frame by frame and synthesizes video on the receiver end using the first image and motion keypoints. This allows it to transfer facial expressions, head poses, and eye movements between videos using less bandwidth than standard H.264 video. The model uses WebRTC for peer-to-peer connections and a first-order motion model to extract and apply motion from one video to another in real-time for live video conferencing.
Automated Image Captioning – Model Based on CNN – GRU ArchitectureIRJET Journal
This document presents a model for automated image captioning using deep learning techniques. The model uses a CNN-GRU architecture, where a CNN encoder extracts image features and a GRU decoder generates captions. The model is trained on the Flickr30K dataset and achieves a BLEU score of 0.5625. Experimental results show the model can accurately identify objects, animals, and relationships between objects in images and generate descriptive captions. The authors integrate text-to-speech functionality to help describe images to visually impaired people. In under 3 sentences, the document introduces an image captioning model using CNN-GRU, discusses training on Flickr30K, and highlights integration of text-to-speech for assisting the visually impaired.
With the development of autonomous development
technology, the need for additional applications to be used
inside and outside the vehicle is increasing. As a result of the
literature review, many applications have been developed to
display vehicle data directly on the monitor, with reflections
on glass, and on hardware devices. These applications have
been developed only for a defined problem and for a
particular autonomous system. In this study, a basic
autonomous vehicle software infrastructure and mobile
Augmented Reality application that can work on Android
devices have been developed. The Mobile Augmented Reality
app serves inside and outside the vehicle. In addition, this
application has been shown to support multiple autonomous
system infrastructures.
The document discusses ATMOS, a cloud-based content retrieval library. Some key points:
1) ATMOS provides the benefits of cloud computing like convenient APIs, simple metadata-based retrieval and classification, high reliability and security.
2) Using ATMOS as the basis for a content retrieval system could help solve problems with data manipulation and add more universality to retrieval, allowing both machine and manual indexing.
3) The library presented retrieves images that are visually or cognitively similar to a sample image provided by the user, ignoring weak image noises and prioritizing similar shapes and edges.
This document provides a synopsis of a six-week industrial training project called "Visualizer" that involved building a system to represent real-time data from IoT devices graphically on a website. The project involved transmitting sensor data wirelessly to a database server, processing the data, and simultaneously updating a real-time line graph. Key aspects included installing necessary software, dividing the large project into subtasks, creating a MySQL database, transmitting and acquiring the data, fetching values from the database to plot the dynamic graph, and implementing a Model-View-Controller structure for the front-end and back-end development. The project has various applications including medical breath analyzers and devices for agriculture.
GENERATION OF HTML CODE AUTOMATICALLY USING MOCK-UP IMAGES WITH MACHINE LEARN...IRJET Journal
The document discusses a technique for automatically generating HTML code from mock-up images using machine learning. Mock-up images are first created containing elements like text boxes, buttons, radio buttons etc. These images are then fed into a convolutional neural network model which detects and recognizes the elements. The coordinates of the detected elements are then used to generate the HTML code for the corresponding elements using an HTML builder algorithm. This allows rapid development of website frontends from mock-up designs in an automated, time-saving and cost-effective manner compared to manual coding. The proposed technique aims to reduce development time and costs while generating HTML code templates from visual designs.
1. Internet & Web Technologies
2. Electronic Mail (eMail)
3. Call from Mobile
4. Songs (offline on Mobile/Computer)
5. Streaming Videos
6. SMS
7. WhatsApp Messaging
8. Video Conferencing
9. Zoom Architecture
10. Next Revolution – Rich Communication Services (RCS)
Three Dimensional Database: Artificial Intelligence to eCommerce Web service ...CSCJournals
A main objective of this paper is using artificial intelligence technique to web service agents and increase the efficiency of the agent communications. In recent years, web services have played a major role in computer applications. Web services are essential, as the design model of applications are dedicated to electronic businesses. This model aims to become one of the major formalisms for the design of distributed and cooperative applications in an open environment (the Internet). Current commercial and research-based efforts are reviewed and positioned within these two fields. A web service as a software system designed to support interoperable machine-to-machine interaction over a network. It has an interface described in a machine-process able format (specifically Web Services Description Language WSDL). Other systems interact with the web service in a manner prescribed by its description using SOAP messages, typically conveyed using HTTP with an XML serialization in conjunction with other Web-related standards. Particular attention is given to the application of AI techniques to the important issue of WS composition. Within the range of AI technologies considered, we focus on the work of the Semantic Web and Agent-based communities to provide web services with semantic descriptions and intelligent behavior and reasoning capabilities. Re-composition of web services is also considered and a number of adaptive agent approaches are introduced and implemented in publication domain with three dimensional databases and one of the areas of work is eCommerce.
Similar to Automatic E-Comic Content Adaptation (20)
The Use of Java Swing’s Components to Develop a WidgetWaqas Tariq
Widget is a kind of application provides a single service such as a map, news feed, simple clock, battery-life indicators, etc. This kind of interactive software object has been developed to facilitate user interface (UI) design. A user interface (UI) function may be implemented using different widgets with the same function. In this article, we present the widget as a platform that is generally used in various applications, such as in desktop, web browser, and mobile phone. We also describe a visual menu of Java Swing’s components that will be used to establish widget. It will assume that we have successfully compiled and run a program that uses Swing components.
3D Human Hand Posture Reconstruction Using a Single 2D ImageWaqas Tariq
Passive sensing of the 3D geometric posture of the human hand has been studied extensively over the past decade. However, these research efforts have been hampered by the computational complexity caused by inverse kinematics and 3D reconstruction. In this paper, our objective focuses on 3D hand posture estimation based on a single 2D image with aim of robotic applications. We introduce the human hand model with 27 degrees of freedom (DOFs) and analyze some of its constraints to reduce the DOFs without any significant degradation of performance. A novel algorithm to estimate the 3D hand posture from eight 2D projected feature points is proposed. Experimental results using real images confirm that our algorithm gives good estimates of the 3D hand pose. Keywords: 3D hand posture estimation; Model-based approach; Gesture recognition; human- computer interface; machine vision.
Camera as Mouse and Keyboard for Handicap Person with Troubleshooting Ability...Waqas Tariq
Camera mouse has been widely used for handicap person to interact with computer. The utmost important of the use of camera mouse is must be able to replace all roles of typical mouse and keyboard. It must be able to provide all mouse click events and keyboard functions (include all shortcut keys) when it is used by handicap person. Also, the use of camera mouse must allow users troubleshooting by themselves. Moreover, it must be able to eliminate neck fatigue effect when it is used during long period. In this paper, we propose camera mouse system with timer as left click event and blinking as right click event. Also, we modify original screen keyboard layout by add two additional buttons (button “drag/ drop” is used to do drag and drop of mouse events and another button is used to call task manager (for troubleshooting)) and change behavior of CTRL, ALT, SHIFT, and CAPS LOCK keys in order to provide shortcut keys of keyboard. Also, we develop recovery method which allows users go from camera and then come back again in order to eliminate neck fatigue effect. The experiments which involve several users have been done in our laboratory. The results show that the use of our camera mouse able to allow users do typing, left and right click events, drag and drop events, and troubleshooting without hand. By implement this system, handicap person can use computer more comfortable and reduce the dryness of eyes.
A Proposed Web Accessibility Framework for the Arab DisabledWaqas Tariq
The Web is providing unprecedented access to information and interaction for people with disabilities. This paper presents a Web accessibility framework which offers the ease of the Web accessing for the disabled Arab users and facilitates their lifelong learning as well. The proposed framework system provides the disabled Arab user with an easy means of access using their mother language so they don’t have to overcome the barrier of learning the target-spoken language. This framework is based on analyzing the web page meta-language, extracting its content and reformulating it in a suitable format for the disabled users. The basic objective of this framework is supporting the equal rights of the Arab disabled people for their access to the education and training with non disabled people. Key Words : Arabic Moon code, Arabic Sign Language, Deaf, Deaf-blind, E-learning Interactivity, Moon code, Web accessibility , Web framework , Web System, WWW.
Real Time Blinking Detection Based on Gabor FilterWaqas Tariq
The document proposes a new method for real-time blinking detection based on Gabor filters. It begins by reviewing existing methods and their limitations in dealing with noise, variations in eye shape, and blinking speed. The proposed method uses a Gabor filter to extract the top and bottom arcs of the eye from an image. It then measures the distance between these arcs and compares it to a threshold: a distance below the threshold indicates a closed eye, while a distance above indicates an open eye. The document claims this Gabor filter-based approach is robust to noise, variations in eye shape and blinking speed. It presents experimental results showing the method can accurately detect blinking across different users.
Computer Input with Human Eyes-Only Using Two Purkinje Images Which Works in ...Waqas Tariq
A method for computer input with human eyes-only using two Purkinje images which works in a real time basis without calibration is proposed. Experimental results shows that cornea curvature can be estimated by using two light sources derived Purkinje images so that no calibration for reducing person-to-person difference of cornea curvature. It is found that the proposed system allows usersf movements of 30 degrees in roll direction and 15 degrees in pitch direction utilizing detected face attitude which is derived from the face plane consisting three feature points on the face, two eyes and nose or mouth. Also it is found that the proposed system does work in a real time basis.
Toward a More Robust Usability concept with Perceived Enjoyment in the contex...Waqas Tariq
Mobile multimedia service is relatively new but has quickly dominated people¡¯s lives, especially among young people. To explain this popularity, this study applies and modifies the Technology Acceptance Model (TAM) to propose a research model and conduct an empirical study. The goal of study is to examine the role of Perceived Enjoyment (PE) and what determinants can contribute to PE in the context of using mobile multimedia service. The result indicates that PE is influencing on Perceived Usefulness (PU) and Perceived Ease of Use (PEOU) and directly Behavior Intention (BI). Aesthetics and flow are key determinants to explain Perceived Enjoyment (PE) in mobile multimedia usage.
Collaborative Learning of Organisational KnolwedgeWaqas Tariq
This paper presents recent research into methods used in Australian Indigenous Knowledge sharing and looks at how these can support the creation of suitable collaborative envi- ronments for timely organisational learning. The protocols and practices as used today and in the past by Indigenous communities are presented and discussed in relation to their relevance to a personalised system of knowledge sharing in modern organisational cultures. This research focuses on user models, knowledge acquisition and integration of data for constructivist learning in a networked repository of or- ganisational knowledge. The data collected in the repository is searched to provide collections of up-to-date and relevant material for training in a work environment. The aim is to improve knowledge collection and sharing in a team envi- ronment. This knowledge can then be collated into a story or workflow that represents the present knowledge in the organisation.
Our research aims to propose a global approach for specification, design and verification of context awareness Human Computer Interface (HCI). This is a Model Based Design approach (MBD). This methodology describes the ubiquitous environment by ontologies. OWL is the standard used for this purpose. The specification and modeling of Human-Computer Interaction are based on Petri nets (PN). This raises the question of representation of Petri nets with XML. We use for this purpose, the standard of modeling PNML. In this paper, we propose an extension of this standard for specification, generation and verification of HCI. This extension is a methodological approach for the construction of PNML with Petri nets. The design principle uses the concept of composition of elementary structures of Petri nets as PNML Modular. The objective is to obtain a valid interface through verification of properties of elementary Petri nets represented with PNML.
Development of Sign Signal Translation System Based on Altera’s FPGA DE2 BoardWaqas Tariq
The main aim of this paper is to build a system that is capable of detecting and recognizing the hand gesture in an image captured by using a camera. The system is built based on Altera’s FPGA DE2 board, which contains a Nios II soft core processor. Image processing techniques and a simple but effective algorithm are implemented to achieve this purpose. Image processing techniques are used to smooth the image in order to ease the subsequent processes in translating the hand sign signal. The algorithm is built for translating the numerical hand sign signal and the result are displayed on the seven segment display. Altera’s Quartus II, SOPC Builder and Nios II EDS software are used to construct the system. By using SOPC Builder, the related components on the DE2 board can be interconnected easily and orderly compared to traditional method that requires lengthy source code and time consuming. Quartus II is used to compile and download the design to the DE2 board. Then, under Nios II EDS, C programming language is used to code the hand sign translation algorithm. Being able to recognize the hand sign signal from images can helps human in controlling a robot and other applications which require only a simple set of instructions provided a CMOS sensor is included in the system.
An overview on Advanced Research Works on Brain-Computer InterfaceWaqas Tariq
A brain–computer interface (BCI) is a proficient result in the research field of human- computer synergy, where direct articulation between brain and an external device occurs resulting in augmenting, assisting and repairing human cognitive. Advanced works like generating brain-computer interface switch technologies for intermittent (or asynchronous) control in natural environments or developing brain-computer interface by Fuzzy logic Systems or by implementing wavelet theory to drive its efficacies are still going on and some useful results has also been found out. The requirements to develop this brain machine interface is also growing day by day i.e. like neuropsychological rehabilitation, emotion control, etc. An overview on the control theory and some advanced works on the field of brain machine interface are shown in this paper.
Exploring the Relationship Between Mobile Phone and Senior Citizens: A Malays...Waqas Tariq
There is growing ageing phenomena with the rise of ageing population throughout the world. According to the World Health Organization (2002), the growing ageing population indicates 694 million, or 223% is expected for people aged 60 and over, since 1970 and 2025.The growth is especially significant in some advanced countries such as North America, Japan, Italy, Germany, United Kingdom and so forth. This growing older adult population has significantly impact the social-culture, lifestyle, healthcare system, economy, infrastructure and government policy of a nation. However, there are limited research studies on the perception and usage of a mobile phone and its service for senior citizens in a developing nation like Malaysia. This paper explores the relationship between mobile phones and senior citizens in Malaysia from the perspective of a developing country. We conducted an exploratory study using contextual interviews with 5 senior citizens of how they perceive their mobile phones. This paper reveals 4 interesting themes from this preliminary study, in addition to the findings of the desirable mobile requirements for local senior citizens with respect of health, safety and communication purposes. The findings of this study bring interesting insight to local telecommunication industries as a whole, and will also serve as groundwork for more in-depth study in the future.
Principles of Good Screen Design in WebsitesWaqas Tariq
Visual techniques for proper arrangement of the elements on the user screen have helped the designers to make the screen look good and attractive. Several visual techniques emphasize the arrangement and ordering of the screen elements based on particular criteria for best appearance of the screen. This paper investigates few significant visual techniques in various web user interfaces and showcases the results for better understanding and their presence.
This document discusses the progress of virtual teams in Albania. It provides context on virtual teams and how they differ from traditional teams in their reliance on technology for communication across distances. The document then examines the use of virtual teams in Albania, noting the growing infrastructure and technology usage that enables virtual collaboration. It highlights some virtual team examples in Albanian government and academic projects.
Cognitive Approach Towards the Maintenance of Web-Sites Through Quality Evalu...Waqas Tariq
It is a well established fact that the Web-Applications require frequent maintenance because of cutting– edge business competitions. The authors have worked on quality evaluation of web-site of Indian ecommerce domain. As a result of that work they have made a quality-wise ranking of these sites. According to their work and also the survey done by various other groups Futurebazaar web-site is considered to be one of the best Indian e-shopping sites. In this research paper the authors are assessing the maintenance of the same site by incorporating the problems incurred during this evaluation. This exercise gives a real world maintainability problem of web-sites. This work will give a clear picture of all the quality metrics which are directly or indirectly related with the maintainability of the web-site.
USEFul: A Framework to Mainstream Web Site Usability through Automated Evalua...Waqas Tariq
A paradox has been observed whereby web site usability is proven to be an essential element in a web site, yet at the same time there exist an abundance of web pages with poor usability. This discrepancy is the result of limitations that are currently preventing web developers in the commercial sector from producing usable web sites. In this paper we propose a framework whose objective is to alleviate this problem by automating certain aspects of the usability evaluation process. Mainstreaming comes as a result of automation, therefore enabling a non-expert in the field of usability to conduct the evaluation. This results in reducing the costs associated with such evaluation. Additionally, the framework allows the flexibility of adding, modifying or deleting guidelines without altering the code that references them since the guidelines and the code are two separate components. A comparison of the evaluation results carried out using the framework against published evaluations of web sites carried out by web site usability professionals reveals that the framework is able to automatically identify the majority of usability violations. Due to the consistency with which it evaluates, it identified additional guideline-related violations that were not identified by the human evaluators.
Robot Arm Utilized Having Meal Support System Based on Computer Input by Huma...Waqas Tariq
A robot arm utilized having meal support system based on computer input by human eyes only is proposed. The proposed system is developed for handicap/disabled persons as well as elderly persons and tested with able persons with several shapes and size of eyes under a variety of illumination conditions. The test results with normal persons show the proposed system does work well for selection of the desired foods and for retrieve the foods as appropriate as usersf requirements. It is found that the proposed system is 21% much faster than the manually controlled robotics.
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
In recent decades speech interactive systems have gained increasing importance. Performance of an ASR system mainly depends on the availability of large corpus of speech. The conventional method of building a large vocabulary speech recognizer for any language uses a top-down approach to speech. This approach requires large speech corpus with sentence or phoneme level transcription of the speech utterances. The transcriptions must also include different speech order so that the recognizer can build models for all the sounds present. But, for Telugu language, because of its complex nature, a very large, well annotated speech database is very difficult to build. It is very difficult, if not impossible, to cover all the words of any Indian language, where each word may have thousands and millions of word forms. A significant part of grammar that is handled by syntax in English (and other similar languages) is handled within morphology in Telugu. Phrases including several words (that is, tokens) in English would be mapped on to a single word in Telugu.Telugu language is phonetic in nature in addition to rich in morphology. That is why the speech technology developed for English cannot be applied to Telugu language. This paper highlights the work carried out in an attempt to build a voice enabled text editor with capability of automatic term suggestion. Main claim of the paper is the recognition enhancement process developed by us for suitability of highly inflecting, rich morphological languages. This method results in increased speech recognition accuracy with very much reduction in corpus size. It also adapts Telugu words to the database dynamically, resulting in growth of the corpus.
An Improved Approach for Word Ambiguity RemovalWaqas Tariq
Word ambiguity removal is a task of removing ambiguity from a word, i.e. correct sense of word is identified from ambiguous sentences. This paper describes a model that uses Part of Speech tagger and three categories for word sense disambiguation (WSD). Human Computer Interaction is very needful to improve interactions between users and computers. For this, the Supervised and Unsupervised methods are combined. The WSD algorithm is used to find the efficient and accurate sense of a word based on domain information. The accuracy of this work is evaluated with the aim of finding best suitable domain of word. Keywords: Human Computer Interaction, Supervised Training, Unsupervised Learning, Word Ambiguity, Word sense disambiguation
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Waqas Tariq
From the existing research it has been observed that many techniques and methodologies are available for performing every step of Automatic Speech Recognition (ASR) system, but the performance (Minimization of Word Error Recognition-WER and Maximization of Word Accuracy Rate- WAR) of the methodology is not dependent on the only technique applied in that method. The research work indicates that, performance mainly depends on the category of the noise, the level of the noise and the variable size of the window, frame, frame overlap etc is considered in the existing methods. The main aim of the work presented in this paper is to use variable size of parameters like window size, frame size and frame overlap percentage to observe the performance of algorithms for various categories of noise with different levels and also train the system for all size of parameters and category of real world noisy environment to improve the performance of the speech recognition system. This paper presents the results of Signal-to-Noise Ratio (SNR) and Accuracy test by applying variable size of parameters. It is observed that, it is really very hard to evaluate test results and decide parameter size for ASR performance improvement for its resultant optimization. Hence, this study further suggests the feasible and optimum parameter size using Fuzzy Inference System (FIS) for enhancing resultant accuracy in adverse real world noisy environmental conditions. This work will be helpful to give discriminative training of ubiquitous ASR system for better Human Computer Interaction (HCI). Keywords: ASR Performance, ASR Parameters Optimization, Multi-Environmental Training, Fuzzy Inference System for ASR, ubiquitous ASR system, Human Computer Interaction (HCI)
Walmart Business+ and Spark Good for Nonprofits.pdfTechSoup
"Learn about all the ways Walmart supports nonprofit organizations.
You will hear from Liz Willett, the Head of Nonprofits, and hear about what Walmart is doing to help nonprofits, including Walmart Business and Spark Good. Walmart Business+ is a new offer for nonprofits that offers discounts and also streamlines nonprofits order and expense tracking, saving time and money.
The webinar may also give some examples on how nonprofits can best leverage Walmart Business+.
The event will cover the following::
Walmart Business + (https://business.walmart.com/plus) is a new shopping experience for nonprofits, schools, and local business customers that connects an exclusive online shopping experience to stores. Benefits include free delivery and shipping, a 'Spend Analytics” feature, special discounts, deals and tax-exempt shopping.
Special TechSoup offer for a free 180 days membership, and up to $150 in discounts on eligible orders.
Spark Good (walmart.com/sparkgood) is a charitable platform that enables nonprofits to receive donations directly from customers and associates.
Answers about how you can do more with Walmart!"
How to Make a Field Mandatory in Odoo 17Celine George
In Odoo, making a field required can be done through both Python code and XML views. When you set the required attribute to True in Python code, it makes the field required across all views where it's used. Conversely, when you set the required attribute in XML views, it makes the field required only in the context of that particular view.
This slide is special for master students (MIBS & MIFB) in UUM. Also useful for readers who are interested in the topic of contemporary Islamic banking.
A workshop hosted by the South African Journal of Science aimed at postgraduate students and early career researchers with little or no experience in writing and publishing journal articles.
Chapter wise All Notes of First year Basic Civil Engineering.pptxDenish Jangid
Chapter wise All Notes of First year Basic Civil Engineering
Syllabus
Chapter-1
Introduction to objective, scope and outcome the subject
Chapter 2
Introduction: Scope and Specialization of Civil Engineering, Role of civil Engineer in Society, Impact of infrastructural development on economy of country.
Chapter 3
Surveying: Object Principles & Types of Surveying; Site Plans, Plans & Maps; Scales & Unit of different Measurements.
Linear Measurements: Instruments used. Linear Measurement by Tape, Ranging out Survey Lines and overcoming Obstructions; Measurements on sloping ground; Tape corrections, conventional symbols. Angular Measurements: Instruments used; Introduction to Compass Surveying, Bearings and Longitude & Latitude of a Line, Introduction to total station.
Levelling: Instrument used Object of levelling, Methods of levelling in brief, and Contour maps.
Chapter 4
Buildings: Selection of site for Buildings, Layout of Building Plan, Types of buildings, Plinth area, carpet area, floor space index, Introduction to building byelaws, concept of sun light & ventilation. Components of Buildings & their functions, Basic concept of R.C.C., Introduction to types of foundation
Chapter 5
Transportation: Introduction to Transportation Engineering; Traffic and Road Safety: Types and Characteristics of Various Modes of Transportation; Various Road Traffic Signs, Causes of Accidents and Road Safety Measures.
Chapter 6
Environmental Engineering: Environmental Pollution, Environmental Acts and Regulations, Functional Concepts of Ecology, Basics of Species, Biodiversity, Ecosystem, Hydrological Cycle; Chemical Cycles: Carbon, Nitrogen & Phosphorus; Energy Flow in Ecosystems.
Water Pollution: Water Quality standards, Introduction to Treatment & Disposal of Waste Water. Reuse and Saving of Water, Rain Water Harvesting. Solid Waste Management: Classification of Solid Waste, Collection, Transportation and Disposal of Solid. Recycling of Solid Waste: Energy Recovery, Sanitary Landfill, On-Site Sanitation. Air & Noise Pollution: Primary and Secondary air pollutants, Harmful effects of Air Pollution, Control of Air Pollution. . Noise Pollution Harmful Effects of noise pollution, control of noise pollution, Global warming & Climate Change, Ozone depletion, Greenhouse effect
Text Books:
1. Palancharmy, Basic Civil Engineering, McGraw Hill publishers.
2. Satheesh Gopi, Basic Civil Engineering, Pearson Publishers.
3. Ketki Rangwala Dalal, Essentials of Civil Engineering, Charotar Publishing House.
4. BCP, Surveying volume 1
How to Add Chatter in the odoo 17 ERP ModuleCeline George
In Odoo, the chatter is like a chat tool that helps you work together on records. You can leave notes and track things, making it easier to talk with your team and partners. Inside chatter, all communication history, activity, and changes will be displayed.
This document provides an overview of wound healing, its functions, stages, mechanisms, factors affecting it, and complications.
A wound is a break in the integrity of the skin or tissues, which may be associated with disruption of the structure and function.
Healing is the body’s response to injury in an attempt to restore normal structure and functions.
Healing can occur in two ways: Regeneration and Repair
There are 4 phases of wound healing: hemostasis, inflammation, proliferation, and remodeling. This document also describes the mechanism of wound healing. Factors that affect healing include infection, uncontrolled diabetes, poor nutrition, age, anemia, the presence of foreign bodies, etc.
Complications of wound healing like infection, hyperpigmentation of scar, contractures, and keloid formation.
This presentation was provided by Steph Pollock of The American Psychological Association’s Journals Program, and Damita Snow, of The American Society of Civil Engineers (ASCE), for the initial session of NISO's 2024 Training Series "DEIA in the Scholarly Landscape." Session One: 'Setting Expectations: a DEIA Primer,' was held June 6, 2024.
Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...Diana Rendina
Librarians are leading the way in creating future-ready citizens – now we need to update our spaces to match. In this session, attendees will get inspiration for transforming their library spaces. You’ll learn how to survey students and patrons, create a focus group, and use design thinking to brainstorm ideas for your space. We’ll discuss budget friendly ways to change your space as well as how to find funding. No matter where you’re at, you’ll find ideas for reimagining your space in this session.
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPRAHUL
This Dissertation explores the particular circumstances of Mirzapur, a region located in the
core of India. Mirzapur, with its varied terrains and abundant biodiversity, offers an optimal
environment for investigating the changes in vegetation cover dynamics. Our study utilizes
advanced technologies such as GIS (Geographic Information Systems) and Remote sensing to
analyze the transformations that have taken place over the course of a decade.
The complex relationship between human activities and the environment has been the focus
of extensive research and worry. As the global community grapples with swift urbanization,
population expansion, and economic progress, the effects on natural ecosystems are becoming
more evident. A crucial element of this impact is the alteration of vegetation cover, which plays a
significant role in maintaining the ecological equilibrium of our planet.Land serves as the foundation for all human activities and provides the necessary materials for
these activities. As the most crucial natural resource, its utilization by humans results in different
'Land uses,' which are determined by both human activities and the physical characteristics of the
land.
The utilization of land is impacted by human needs and environmental factors. In countries
like India, rapid population growth and the emphasis on extensive resource exploitation can lead
to significant land degradation, adversely affecting the region's land cover.
Therefore, human intervention has significantly influenced land use patterns over many
centuries, evolving its structure over time and space. In the present era, these changes have
accelerated due to factors such as agriculture and urbanization. Information regarding land use and
cover is essential for various planning and management tasks related to the Earth's surface,
providing crucial environmental data for scientific, resource management, policy purposes, and
diverse human activities.
Accurate understanding of land use and cover is imperative for the development planning
of any area. Consequently, a wide range of professionals, including earth system scientists, land
and water managers, and urban planners, are interested in obtaining data on land use and cover
changes, conversion trends, and other related patterns. The spatial dimensions of land use and
cover support policymakers and scientists in making well-informed decisions, as alterations in
these patterns indicate shifts in economic and social conditions. Monitoring such changes with the
help of Advanced technologies like Remote Sensing and Geographic Information Systems is
crucial for coordinated efforts across different administrative levels. Advanced technologies like
Remote Sensing and Geographic Information Systems
9
Changes in vegetation cover refer to variations in the distribution, composition, and overall
structure of plant communities across different temporal and spatial scales. These changes can
occur natural.
Leveraging Generative AI to Drive Nonprofit InnovationTechSoup
In this webinar, participants learned how to utilize Generative AI to streamline operations and elevate member engagement. Amazon Web Service experts provided a customer specific use cases and dived into low/no-code tools that are quick and easy to deploy through Amazon Web Service (AWS.)
Film vocab for eal 3 students: Australia the movie
Automatic E-Comic Content Adaptation
1. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 1
Automatic E-Comic Content Adaptation
Kohei Arai arai@is.saga-u.ac.jp
Information Science Department
Saga University
Saga, 840-0027, Japan
Herman Tolle emang@brawijaya.ac.id
Software Engineering Department
Brawijaya University
Malang, 65145, Indonesia
Abstract
Reading digital comic on mobile phone is demanding now. Instead of create a
new mobile comic contents, adaptation of the existing digital comic web portal is
valuable. In this paper, we proposed an automatic e-comic mobile content
adaptation method for automatically create mobile comic content from existing
digital comic website portal. Automatic e-comic content adaptation is based on
our comic frame extraction method combine with additional process to extract
comic balloon and text from digital comic page. The proposed method work as a
content adaptation intermediary proxy server application, while generating a
Comic XML file as an input source for mobile phone to render a specific mobile
comic contents. Our proposed method is an effective and efficient method for
real time implementation of reading e-comic comparing to other methods.
Experimental results show that our proposed method has 100% accuracy of flat
comic frame extraction, 91.48% accuracy of non-flat comic frame extraction, and
about 90% processing time faster than previous method.
Keywords: E-comic, Content Adaptation, Comic Frame Extraction, Text Extraction, Mobile
Application
1. INTRODUCTION
Reading comic is one of popular thing in the world, especially in Japan. Everyday hundreds of
printed comic book is produced and most of printed comic book then digitized into web contents
for reading comic through the internet. As the usage of mobile device such mobile phone, PDA
and laptops growth, reading comic through mobile device is also demanding. The recent trend is
that comic content are largely demanded and became one of the most popular and profitable
mobile contents. The challenge in providing mobile comic contents for small screen devices is
how to separate comic frames and display it in the right order to read. However, the existing
mobile comic content is mainly produced manually or automatically from offline comic book.
Instead of create a new mobile content from digitized comic book in offline way, we propose a
new method for automatically adapting digitized comic page from existing website into mobile
comic content.
Several research projects [1-5], proposed systems that automatically convert web-based
documents that were designed for desktop into appropriate format for viewed in mobile devices.
2. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 2
In [6], propose the concept of automatic mobile content conversion using semantic image
analysis, but this method still using offline comic book as a comic page sources. In [6], authors
propose automatic content conversion (ACC) ontology that using X-Y recursive cut algorithm for
extracting comic frame. Like other method on comic frame extraction [7-10], those methods
cannot detect frames when the comic balloon or picture is drawn over the frames. Then Tanaka
proposes layout analysis of comic page using density gradient method [11], which applied to
comic page with balloons or pictures drawn over the frames. However, in [11] method has some
limitation in processing of comic image and not sufficient for real time application since
computation of the process. Also success rate of frame extraction and processing time should be
improved.
In this paper, an approach for automatically adapting existing online digital comic content – or
electronic comic (e-comic) - into mobile comic contents based on automatic comic frame
extraction (ACFE) method [13] is presented. We propose a new method for automatically
extracting comic frame and frame contents such us comic balloon and text inside balloon from e-
comic page. Comic frame contents such us balloon and text inside balloon is extracted for further
purpose, for example for language translation, multimedia indexing or data mining. Our propose
method is an efficient and effective comic content adaptation method that sufficient for real time
online implementation. The experimental results of our method had shown the better results on
accuracy and processing time comparing with other methods.
The reminder of this paper is organized as follows. In section 2, a detail description of the
proposed method is given. Section 3 and 4 describe the detail process on frame content
extraction and e-comic reader application. Experimental results with comparing to conventional
method are presented in Section 5. Finally, conclusions are drawn in Section 6.
2. E-COMIC CONTENT ADAPTATION SYSTEM
Figure 1 shows the illustration of automatic e-comic content adaptation system. There are 3 parts
involved in the concept of content adaptation systems [3], part A is a content provider, part B is
an intermediary proxy server application, and part C is a mobile terminal. The concept of using
content adaptation intermediary proxy server is related with current web technology and device
independent paradigm [3]. Intermediary proxy server application will automatically adapt the
comic page from existing e-comic website into mobile content specific to display on user mobile
devices.
FIGURE 1: Illustration of E-Comic Content Adaptation System
3. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 3
FIGURE 2: Architecture of E-Comic Content Adaptation System
2.1 E-Comic Content Adaptation Intermediary Proxy Server
Figure 2 shows the architecture of the automatic e-comic content adaptation systems. The
process begins when a user (part C) uses a mobile device submit a request to the system—that
is, to the content provider via an intermediary proxy server. After that, system connected to
content provider for getting the comic page and then precedes the content adaptation to generate
mobile specific content before deliver it to user. Architecture of e-comic content adaptation
intermediary proxy server system consist 4 main parts as follows:
Comic Image Extraction, comic page is grabbed from existing e-comic websites through
HTTP connection and HTML parsing process. Database is needed to store information about
comic portal URL and data about comic pages.
Comic Content Extraction, useful information is extracted from a single comic page. The
process of detecting and extracting the information about frame position, balloon position
and text position based on e-comic content extraction method. Firstly, comic frames are
extracted from comic page, then comic balloon is extracted from each frame, and the last is
extracting text from each balloon.
Comic Content Trans-coding, transform the source comic page into mobile specific content.
There are 2 modes in our transcoding system: image transcoding and information
transcoding, describe further on next sub chapter. In this part, also text image from previous
process will recognizing as text using text recognition process. Extracted text from comic
page is useful for language translation using Google translation services, data mining or
multimedia indexing.
Mobile Comic Content Generator, based on transcoding mode chooses by user, mobile
content generator part will automatically create the output as mobile content to user mobile
device. Output is Comic XML files for data and combine with XHTML mobile profile (MP) for
presentation. After a complete process, system will store the data about comic page and
adapted results to database for future usage. When another user requests the same comic
page, system will only responds with stored data from database without any processing to
reduce server load.
4. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 4
2.2 Comic Content Adaptation Strategies
Although data rate and file size is not a significant issue in recent internet wireless technology, we
design our content adaptation system to support user with low speed internet connection likewise
to user with high speed of internet connection. We design our system for generate adaptation
results in 2 modes: image transcoding and information transcoding. Those two different
adaptation modes are processed within comic content transcoding and mobile content generator
parts.
Image transcoding mode, system will reproduce comic frame content as new images with
special treatment to fulfill user device requirement, for example: image resizing, color depth
reduction or image cropping. Reproduction comic page is stored in proxy server and replace
the original comic page. Image transcoding mode is useful for the user with limited internet
connection. In this mode, output of the systems is Comic XML files for data combines with
XHTML MP for presentation, and also generates new comic frame images.
Information transcoding mode, system will produce only XML text files that store information
about extracted frame content. Information transcoding mode is designed for user with high
speed internet connection through wireless connection, because user device will display
comic page in frame by frame using original comic page image. In this mode, an output of
the systems is Comic XML files for data combines with XHTML MP for presentation. This
Comic XML only contains information about comic frame content location within comic page.
2.3 E-Comic XML
Information about comic content is generated automatically and store in XML file for usage on
mobile phone to render a comic content. Our E-Comic XML is improved from ComicsML version
0.2 by Jason McIntosh [12]. New Comic XML included the layout information of comic frame,
balloon and text, which is not exist before. The layout information getting from comic frame
content extraction process is useful for frame by frame displaying on user mobile devices. In
information transcoding mode, layout information of comic frame content is stored as information
of rectangle start point (x1, y1) and end point (x2, y2) of frame’s blob, balloon’s blob or text’s blob.
In image transcoding mode, layout information is no need but URL location of new images of
frames, balloons or texts. Figure 3 show the data structure define in document type definition
(DTD) of E-Comic XML.
<?xml version="1.0"?>
<!ELEMENT comic(title, url,
readingorder?, language?, person+,
icon?, description?, panels*)>
<!ELEMENT title (#PCDATA)>
<!ELEMENT url(#PCDATA)>
<!ELEMENT creator(#PCDATA)>
<!ELEMENT readingorder(#PCDATA)>
<!ELEMENT language (#PCDATA)>
<!ELEMENT panels (number, panel+)>
<!– Information about frame -->
<!ELEMENT number (#PCDATA)>
<!ELEMENT text (#PCDATA)>
<!ELEMENT panel (order, panelurl,
panelpos*, balloons*)>
<!ELEMENT order (#PCDATA)>
<!ELEMENT panelurl (#PCDATA)>
<!ELEMENT panelpos (posx1, posy1,
posx2, posy2)>
<!– Information about balloon -->
<!ELEMENT balloons (balloon*)>
<!ELEMENT balloon(text?, textpost*)>
<!ELEMENT text (#PCDATA)>
<!ELEMENT textpos (posx1, posy1,
posx2, posy2)>
FIGURE 3: Document Type Definition (DTD) of E-Comic XML,
3. E-COMIC CONTENT EXTRACTION
E-comic frame content extraction is based on our previous research work on automatic comic
scene frame extraction [13]. For each comic page, we extract frames and then checking if any
overlapped frames situated on the extracted frames. If overlapped frames detected, then system
precede the overlapped frame division process. After all frame extracted, then balloons within
5. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 5
frame and texts within balloons are processed. All process is done base on modified of connected
component labeling (CCL) algorithm [14] as our comic blob extraction function.
3.1 Comic Frame Extraction
Common comic frames are separated by white pixel line or white region, so the rest of white pixel
region must be the frames. While the conventional method [7-11] tries to track the white line, our
method finds the rest area of white line. We investigated many traditional and conventional
comics those in case of there is no balloon or comic art is overlapped on frames - it is called ‘flat
comic’ hereafter-, each frame can be detected as a single blob object. In our propose method, we
define all connected white pixels as a single blob object, and then each comic frames can be
identified as an individual blob object.
We modify connected component labeling algorithm [14] for specific function on comic frame blob
extraction. Figure 4.a show the flow diagram of the process of modified CCL for comic frame blob
extraction function and Figure 4.b shows the results in step-by-step basis. Firstly, binarization is
applied to converting color comic images into black and white images. Binarization with an
appropriate threshold number will produce each frame as separate blobs. The heuristic value of
threshold is 250 (for the images with quantization bits of 8 bits) that chosen empirically based on
experiments. After that, color inversion is done to switch color between blobs and background,
because our blob extraction method assume black pixel as background color. Then blob
detection process generates blob object from each connected pixels. Last process is frame blob
selection to select only blob with minimal size that determine as comic frame. The minimal size of
selected frame blob is one sixth of the image size ([Image.Width/6] x [Image.Height/8]).
The proposed methodology has 100% of success rate for extract comic frames from complete flat
comic page like comic “Nonono Volume 55” and other flat comic pages that we use in our
experiments. The proposed method also can easily detect frames in comic pages that contain
only one frame, which is problem in Tanaka’s [11] method. The modified CCL for comic frame
blob extraction method is, however, not perfect because comic image includes not only ‘flat’
frame but also more complicated frame images those are overlapped with other comic balloons or
comic arts. Then we improved our comic frame extraction method with overlapped frame
checking and extraction using division line detection method.
Pre ProcessingInput Comic
Page Image
Threshold
Invert
Blob Detection
using modified CCL
Frame Blob Extraction
From original image
Frame Blob Selection
(a) (b)
FIGURE 4: Flow diagram of comic frame extraction using comic blob extraction method,
(b). Step-by-step process and result on frame extraction
3.2 Overlapped Frame Extraction using Division Line Detection
Using only blob extraction method, overlapped frames are not detected and will recognize as
single frame. So, each frame should pass the overlapped frame checking process to detect the
occurrence of division line between frames. If the division lines detected, then we will add new
white line overlaid to create separate line between overlapped frames. Then overlapped frame
can be extracted using our base function on blob extraction method.
6. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 6
The division line detection methods work by detecting the appearance of white area within a thick
line that assumed as frame border line. For example, it is assumed that two frames are situated
at the top and the bottom and overlapped by a comic balloon. The overlapped frame extraction
process step is as follows:
1. Find the left and right frame border line, indicated by the Xth line with maximum number of
black pixel, selected as candidate border line. X1 in the left side and X2 in the right side.
2. Find white area within along of the candidate border line (X1 and X2)
3. Decide one point in the white area of line X1 as Y1 and in line X2 as Y2. Thus we have P1 (X1,
Y1) and P2(X2, Y2.)
4. Add a white pixel line between P1 to P2 as frame separator line.
5. Implement blob extraction method to separate two frames.
First, we try to detect border line by investigate on the edge area of comic page. Assume that
edge area is N far from the edge, where N is empirically equal than one fifth of the page width.
Estimated frame border line is determined from the line with maximum number of black pixel
occurrence. After the candidate borderline, X1 and X2 are nominated, white pixel region within the
lines is investigated. If X1 and X2 is real frame border of the images, it is possible to detect the
division line between that indicated by the occurrence of white pixel areas within X1 or X2 lines.
Thus one point in left side P1 (X1, Y1) and the other one point in right side P2 (X2, Y2.) determined.
The line that connects P1 and P2 is estimated as separation line. Figure 5 shows the illustration of
division line detection process and addition of separator line.
After two points detected, addition of a new white line between P1 to P2 will create separate top
frame and bottom frame as two blobs. Then, using our blob extractions functions will successfully
extracting two connected frames. For two frames connected in horizontal direction, do the same
process while change the direction of border searching to top and bottom of the image. This
method can also work well for comic art with straight division line in specific angle.
3. Frame division point candidate
N 1. Selected line area
2. Border line candidate
4. White line added as separator
X
Y
P1 (X1, Y1) P2 (X2, Y2)
FIGURE 5: Division Line detection and line adding process.
(Source Images: Dragon Ball Volume 42 p.113)
3.3 Comic Balloon Detection
Comic balloon detection method is needed for correct the overlapped frame separation and for
the purpose of text extraction process. However, while the additional of a white line in between of
two intersections frames can separate frames properly; it sometime appears that intersection of
content like balloon text is cut off. Therefore, it cannot be read properly. In order to overcome this
situation, comic balloon detection method is proposed to detect comic balloon text areas that are
7. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 7
situated in between two comic frames. If a comic balloon detected in between of two frames, then
the area of this balloon will add to one of the intersection frames where the balloon area is
situated more than 50%.
The method for balloon detection is similar with frame blob extraction method but without
inversion process. In typical comic images, balloon text usually has a white background. So,
using base blob extraction method without inversion can detect comic balloon as a white pixel
area. Balloon blob selection is base on 3 rules for classification as follows:
1. Minimal size of the blob is about [Image.Width]/10 and [Image.Height]/8 of frame image size.
2. Minimal number of white pixels in blob is 45% of blob area.
3. At least one text image is detected.
3.4 Text Detection and Extraction
Text extraction method is proposed for extracting text content from a comic balloon. The method
for extract text from a balloon is also base on same method with frame extraction and balloon
detection method. First, we implement modified CCL with morphology filter on pre-processing to
make near word image collide as a single blob. In pre-processing, erosion and opening filter is
applied with left and right side priority rather than top and bottom side. Balloon text blob selection
base on some rules for classification as follows:
1. Minimal size of the blob is 40 pixel width and height
2. Ignored all blobs that related with border of balloon, approximately 5 pixels far from the
balloon edge.
Figure 6 show the results sample on extraction of comic frame (a), comic balloon (b) and comic
text inside balloon (c) from “Dragon Ball Chapter 192” comic. Frame and balloons are extracted in
rectangle area while text is also extracted in rectangle area in the size of each word.
FIGURE 6: Result samples on (a) Frame Extraction, (b) Balloon Extraction, (c) Text Extraction
(Source Images: Dragon Ball Chapter 192 p.10)
4. ONLINE E-COMIC READER
Online e-comic reader is a special application for mobile devices that separated from comic
content adaptation systems. People can build their own application for reading comic on mobile
phone as long as they can interpret e-comic xml file into mobile comic application. That is major
point in content adaptation method when intermediary proxy server content adaptation is applied.
Figure 7 shows our simple e-comic reader application on PDA to display comic page in frame by
frame basis. Comic image is relatively convenient to read in each frame image size rather that
whole comic page size. The illustration of an online e-comic reader application with special
features for language translation is shown in Figure 8. We can combine comic reader application
with Google language translation features to generate language translation of comic from XML
files.
(a) Example of Extracted Frame
Balloon 1 Balloon 2 Balloon 3
(b) Example of Extracted Balloons from Frame
Word 1 Word 2 Word 3
(c) Example of Extracted Text Image from Balloon 3
8. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 8
FIGURE 7: E-Comic Reader Application on PDA
(Source Images: Zettai Kareshi Manga, Vol. 1, p.171)
FIGURE 8: Illustration of Online E-Comic Reader Application with Language Translation Features
(Source Images: Garfield Comic from Gocomics)
5. EXPERIMENTS
The proposed methodology for automatic e-comic content adaptation has been evaluated using
various comic image pages in offline and online situation. We implement the proposed method in
real time online and offline situation using Microsoft.Net environment with C# as native language
for proxy server application and frame content extraction process. We use desktop computer with
Pentium Dual Core processor and 1 Mbyte of RAM. Experiment is conducting through 634 comic
pages to evaluate the success rate (accuracy) of frame extraction and processing time. Common
comic image size that we use in our experiment is 800x1200 pixels. The results of the experiment
then reported and compared with other methods.
5.1 Comic Frame Extraction Experimental Results
Experimental result of frame extraction method has shown in Table 1. The results were classified
into 3 groups such as “correctly extraction”, “missed detection” and “false detection”. The term
correctly extraction means the success frame extraction without error. The terms “missed
detection” means that system cannot extract overlapped frames, and the terms “false detection”
means that some non frame detected as a frame. From the experimental results, 91.48% average
of success rate of comic frame extraction is achieved.
9. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 9
TABLE 1: Frame extraction experimental results for 634 pages from 5 comic sources.
Digital Comic
Sources
Total
Pages
Correctly
Extraction
Missed
Detection
False
Detection
Success
Rate (%)
Dragon Ball Vol 40 175 161 12 2 92.00
Dragon Ball Vol 42 237 218 10 9 91.98
One Piece Vol 1 191 171 20 0 89.53
Nonono Vol 55 18 18 0 0 100.00
Dragon Ball Ch 196 13 12 1 0 92.31
Total 634 580 46 8 91.48
An experimental result for comparison with Tanaka’s [11] method has shown in Table 2. In our
experiments we also include some particular pages in main volume were Tanaka exclude it. So,
the total number of tested images (one image is one comic page) was different. The results were
classified into 5 groups such as “Succeeded”, “Not succeeded”, “Not tested”, “Total pages tested”
and “Total pages”. The term “Succeeded” means the total pages of success on frame extraction.
The term “Not succeeded” means the total page of failure for frame extraction. The term “Not
tested” means number of pages that not include in testing process. The term “Total Pages”
means the total number of comic page of the comic.
TABLE 2: Experimental comparison results with Tanaka’s [11] method for the
Dragon Ball Volume 42 comic image sources
Classification of
Results
Comic Page
Tanaka’s
Method Our Method
Succeeded 195 / 82% 218 / 92%
Not succeeded 22 19 / 8%
Not tested 20 0
Total Page Tested 217 237
Total Page 237 237
Our method is better than Tanaka’s method as shown on the experimental result in Table 2. By
using the same comic source images, our method is 10% better than Tanaka’s methods. Our
methods also need less computation process because the efficiently of division line detection
algorithm and blob extraction method. Once blob extraction function created, then it reused in
frame extraction, balloon extraction and balloon text extraction.
5.2 Comic Balloon and Text Extraction Experimental Results
Performance evaluation of proposed methods on balloon detection and text extraction is evaluate
for 13 comic pages from Dragon Ball Chapter 196 comic pages. Experimental result of frame
extraction method has shown in Table 3. The results were classified into 3 groups such as
“correctly extraction”, “missed detection” and “false detection”. The term correctly extraction
means the success of balloon detection or text extraction without error. The terms missed
detection means that system cannot detect balloon or text. The terms false detection means that
some non balloon or non text detected. From the experimental results, 90.70% of success rate of
comic balloon detection method and 93.63% of success rate of comic text extraction methods is
achieved.
TABLE 3: Comic Balloon and Text Extraction experimental results
Comic Content Total
Correctly
Extraction
Missed
Detection
False
Detection
Success
Rate (%)
Balloon Detection 121 86 8 2 90.70
Text Extraction 314 294 20 8 93.63
10. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 10
5.3 Evaluation of Processing Time
Time consuming in processing is main issue in real time online application. We evaluate
processing time of our method in offline and online simulation. In offline simulation, processing
time of each process on comic frame extraction is evaluated. In online simulation, total time for
processing all process is evaluated, including: comic image parsing processing, comic content
frame extraction and output generating. However, in online situation experiment, we do not
counting the time consuming for access or downloading the comic image file, and also without
text recognition and image reproducing process. The result of processing time evaluation is
shown in Table 4, with comparison with other method. Processing time experimental results
shows that our proposed method is faster that other method. Comparing with [6], processing time
of our method is about 90% faster than [6]. Online situation need more processing time
consuming rather than offline situation because of another processing within the systems, but still
acceptable as online application.
TABLE 4: Processing time experimental results and comparison
Comic Page
Processing Time (in seconds)
[6] [10]
Our
Method
1 comic page offline 3 25 0.250
1 comic page online - - 0.513
30 comic page offline 90 750 10
30 comic page online - - 16
6. CONCLUSION
We implemented a system for automatically adapt e-comic content for reading comic on mobile
devices. We proposed frame content extraction method and intermediary content adaptation
proxy server systems with new E-Comic XML. Comic frame content extraction method is based
on blob extraction method using modified of connected component labeling algorithm. The
proposed method on frame extraction does work in a real time basis so that it is possible to adapt
relatively large scale of existing digital comic image to comparatively small screen size of mobile
terminals by displaying extracted images onto the screen by frame-by-frame. It is still rather
difficult to detect balloons, images, and characters those are situated in between frames. The
proposed method allows detection of these and separates the different frames even if these
balloons, images, and characters are exist.
The proposed method has produced better results in frame extraction method and executes
faster than other methods. From the experimental results, our comic frame extraction method has
100% accuracy for flat comic and 91.48% accuracy for non-flat comic, while balloon detection
method achieves 90.7% accuracy and text extraction method achieves 93.63% accuracy. Our
comic frame extraction method has 10% improvement of [11] method and about 90% processing
time improvement of [6] methods. Our comic frame extraction method is an efficient and effective
methods comparing to conventional method, and applicable for real time online e-comic content
adaptation application.
Our system is designed to be adaptable with old and new mobile technologies, because it can
create mobile comic content based on user’s profile. Our system provides 2 mode of content
adaptation, image transcoding mode for old mobile devices with limited internet connection, and
information transcoding for new mobile devices with high speed internet connection. Also, our
system creates e-comic xml files that are being able to use by third party companies to develop
their own application of e-comic reader. The future direction of this research work is to provide a
robust algorithm for extraction e-comic content and automatically convert it into mobile specific
content. The accuracy of comic frame extraction and text extraction should be improved and
needs further exploration. By utilizing the results of our study and further exploration, the real
11. Kohei Arai & Herman Tolle
International Journal of Ubiquitous Computing (IJUC) Volume (1), Issue (1) 11
implementation of online reading of existing e-comic on mobile phone can immediately be
realized.
7. REFERENCES
1. Chen, Y., Ma, W.Y., Zhang, H.J.: “Detecting Web Page Structure for Adaptive Viewing on
Small Form Factor Devices”. In Proceedings of the International WWW Conference
Budapest, Hungary, 2003
2. Wai Yip Lum, Francis C.M. Lau, "A Context-Aware Decision Engine for Content
Adaptation". IEEE Pervasive Computing, 1(3): 41-49, 2006
3. Laakko et al., “Adapting Web Content to Mobile User Agents”, IEEE Internet Computing,
9(2):46-53, 2005
4. Dongsong Zhang, “Web content adaptation for mobile handheld devices”, Communications of
the ACM, 50(2):75-79, February 2007
5. Hsiao J.-L., Hung H.-P., Chen M.-S., “Versatile Transcoding Proxy for Internet Content
Adaptation”, IEEE Trans. on Multimedia, 10(4):646--658, June 2008.
6. Eunjung Han, et.al. “Automatic Mobile Content Conversion Using Semantic Image Analysis”,
Human-Computer Interaction HCI Intelligent Multimodal Interaction Environments, LNCS
4552, Springer, Berlin, 2007
7. Ono Toshihiko. “Optimizing two-dimensional guillotine cut by generic algorithms”. In
Proceedings of the Ninth AJOU-FIT-NUST Joint Seminar, pages 40-47, July 1999.
8. Yamada, M., Budiarto, R. and Endoo, M., “Comic image decomposition for Reading comics
on cellular phones”. IEICE transaction on information and systems, E-87-D(6):1370-1376,
June 2004.
9. D. Ishii, K. Kawamura, H. Watanabe, "A Study on a Fast Frame Decomposition of Comic
Images," National Convention of IPSJ, 1P-2, March 2007
10. Chung HC, Howard L., T. Komura, “Automatic Panel Extraction of Color Comic Images”,
Advances in Multimedia Information Processing – PCM 2007, LNCS 4810, Springer, Berlin,
2007
11. Tanaka, T., Shoji, K., Toyama, F. And Miyamichi, J.: “Layout Analysis of Tree-Structured
Scene Frames in Comic Images”. In Proceedings of IJCAI 2007, pp. 2885-2890, June 2007.
12. Jason McIntosh. “ComicsML”, an essay in http://www.jmac.org, published 2005. Accessed
March 2008.
13. Kohei, A., Tolle, H. “Method for Automatic E-Comic Scene Frame Extraction for Reading
Comic on Mobile Devices”, In Proceedings of ITNG 2010 Conference, April 2010.
Confirmation accepted
14. F. Chang, C-J. Chen, and C-J. Lu. “A Linear-Time Component-Labeling Algorithm Using
Contour Tracing Technique”, Computer Vision and Image Understanding, 93(2):pp. 206-220,
2004.
15. R. Gonzalez and R. Woods. “Digital Image Processing”, Addison-Wesley Chap.2., Publishing
Company (1992)