Published on

  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide


  1. 1. Harsh Khatter , Brij Mohan Kalra / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, June-July 2012, pp.698-702 Blog Designing and Searching Methodologies: A Review Harsh Khatter*, Brij Mohan Kalra** (Department of Computer Science, Ajay Kumar Garg Engineering College, Ghaziabad, India)ABSTRACT Now, Blogs are getting popular day by day. Learning, analysis and usage of the users interest andBlogs are like an online dairy created by social linkage from the blog is therefore necessary toindividuals and stored on the internet. As Blog is a provide useful search faculty on the blogosphere totype of website, various blogging sites can provide bloggers and revenue generation opportunities likeexcellent information on many topics, although advertising to the blog service providers [3]. The act ofcontent can be subjective. Blogs are one of the main posting to a blog is called blogging and the distributed,components of Web 2.0. Paper consists of collective, and interlinked world of blogging is thedescription of various Blog site designs and blogosphere [4].searching methods with their research gaps. Themajor characteristics and features of blogs are also II. BLOGShighlighted. Blogs are the type of websites. Personal interests create Blogs. Based on the working andKeywords – Blogs, Internet, Searching, Web 2.0, designing of blogs, numbers of characteristics areWeb Tools. defined below. Users can create a new blog post, add blog post, share, rate, and comment the blog posts. ForI. INTRODUCTION all these operations, user has to login first. Purpose of In this growing world, Web services are the Blog is to share ideas and views among a group ofpart of everyone’s life. From the traditional Web 1.0, people all around the world. Intent of Blog is personal.read only web, which only includes chat, email, instant Discussions are done in the form of comments. All themessaging, now switches to a new Web, named Web posts are shown in reverse chronological order i.e.2.0. Web 2.0 consists various tools and services which latest blog post shown on top. There is a list ofprovides read write interface to their users. There are a potential benefits of blogs, which is mentioned below:large number of Web 2.0 tools: Blogs, Discussion  Can promote analogical thinking.Forums, Wikis, Social Networks, Social Bookmarking  Potential for increased access and exposure tosites, Podcasting, Online Communities, RSS and Atom quality information.feeds, and many more. But, apart from all these tools,  Combination of solitary and social interaction.Blogs are the only tool whose intent is personal, even  Can promote critical and analytical thinking.a lot of expertise are also present there to share their  Can promote creative, intuitive and associationalideas and views with other persons of similar interest. thinking (creative and associational thinking in relation to blogs being used as brainstorming tool Blogs are websites that allow one or more and also as a resource for interlinking,individuals to write about things they want to share commenting on interlinked ideas).with others. The universe of all blog sites is referred toas Blogosphere [1]. Blog, a contraction of the term III. REVIEW TO BLOG DESIGNING AND“web log” is a personal online diary that is frequently SEARCHING METHODOLOGIESupdated and intended for public consumption. Now to Beyond serving as online diaries, weblogssome extent it is a type of websites. People usually have evolved into complex social structures. Bloggingcreate a blog as a hobby to share their information and software allows users to publish opinions on any topicexperience on a particular subject. Entries are without any constraints on the predefined schema.commonly displayed in a reverse-chronological order[2]. Blogging software allows users to publish 3.1 Designing of Blogsopinions, views, and ideas on any topic. Analysis of Blogs might be of many types. Personalizedlinkage between blogs has indicated that community Blog is one of the most impressive categories of Blogsforming in blogosphere is not a random process but is where the blog posts shown to the user are of his owna result of shared interests binding bloggers together. interest. Some major works in this area are discussed below. 698 | P a g e
  2. 2. Harsh Khatter , Brij Mohan Kalra / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, June-July 2012, pp.698-702 R. Adhikari et al. mentions that it is easy and Blogosphere and demonstrate how the frameworksimple to create blog posts and their free form and could be used to help clustering blog posts. Evaluationunedited nature have made the blogosphere a rich and of the framework with content-based extendingunique source of data, which has attracted people and approach is done. Experiment results show that thecompanies across disciplines to exploit it for varied framework does help the clustering process [8].purposes. The valuable data contained in posts from alarge number of users across geographic, demographic ZHOU Ping proposed an algorithm ofand cultural boundaries provide a rich data source not personalized blog information retrieval based on user’sonly for commercial exploitation but also for interest model. The paper discusses the systempsychological & sociopolitical research. Basically architecture of personalized blog information retrievalresearchers tried to demonstrate the plausibility of the and studies the identification module of blog webpageidea through clustering and opinion mining [2].experiment on analysis of blog posts on recent socio-political developments in the new democratic republic As per Michael Chau et al., blogs are veryof Nepal; and to elaborate the broader technical dynamic, so it isn’t as straightforward to applyframework & tools required for this kind of analysis traditional Web mining techniques to them. They[1]. suggest that a general blog framework created for different tasks must consists of a blog spider, a blog Similarly CHENG Tao et al. discusses about parser, a blog content analyzer, a blog networkthe Virtual enterprise (VE), which is an effective and analyzer, and a blog visualize [9].collaborative way to jointly face the great pressuresfrom quickly growing globalization and world–wide And a framework, BlogHarvest, for blogmarket competition. Furthermore, a wiki & blog-based mining and search is demonstrated by Joshi et al. Thisknowledge-sharing mechanism and its prototype framework extracts the interests of the blogger, findssystem are designed for supporting enterprises to inter- and recommends blogs with similar topics andcommunicate, share knowledge and manage provides blog oriented search functionality [3].knowledge within a VE environment [5]. 3.2 Searching the blog posts Various models are already evolved related to Over the past decades, various searchingthe blogs and blogosphere. Tse-Ming Tsai et al. techniques are come into existence with the growth ofrecommends applying the three dimensions of value, World Wide Web. From the starting of the Web era,semantic, and the social models to the emerging various searching methods, techniques and types ofBlogosphere and improving the user experience for the searching algorithms are introduced and as perbloggers in gathering the featured items. As per the searching requirement, they are used. There are lots ofprevious works done, approaches discussed may not searching methods in which search can be done onbe comprehensive enough since the way people use keywords, on queries, on topics, on phrases, on pages,blogs continues to evolve [6]. etc. The query based and topic based search is used in forums, whereas the page search or phrase search is Bi Chen et al. proposed three models by used in search engines where the exact finding iscombining content, temporal, social dimensions: the required. Rest of all websites use keyword basedgeneral blogging-behavior model, the profile-based searching. Keyword based searching provides anblogging-behavior model and the socialnetwork and easier way to search the contents on internet. In theprofile-based blogging-behavior model. These models same way, maximum number of websites use keywordare based on two regression techniques: Extreme based searching. A review of searching algorithms andLearning Machine (ELM), and Modified General methods in brief is given below.Regression Neural Network (MGRNN). In paper, theempirical evaluation is done on DailyKos, a political Initially, the searching is done using theblog, one of the largest blogs, which produce good Query Tree. Top-down Approach is followed to searchresults for the most active bloggers and can be used to the results. But it is a traditional method wherepredict blogging behavior [7]. indexing is use to Reduce the Complexity. In this, A* Graph algorithm is used which keeps track on visited Yin ZHANG et al. discussed that Clustered node and distance travelled [10]. After this method,Web pages, such as blog posts, could be used to next approach was Proximity Search. In this method,improve Web search. In the paper, authors proposed analysis of textual proximity of keyword is done.an extending framework using relations in the Focus is on queries based on general relationship 699 | P a g e
  3. 3. Harsh Khatter , Brij Mohan Kalra / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, June-July 2012, pp.698-702among objects where proximity is defined based on of large collections of heterogeneous data is used inshortest paths between objects. Figure 1 shows the this method. Authors implemented it using graphs andworking based on this proximity approach [11]. graph indices. They conclude that this is an efficient & adaptive keyword search of all kinds of data words [18]. Georgia Koutrika et al. discusses the searching of keywords over Structured Data in a cloud. Method uses a coupling of keywords. Tags used in this method are unstructured, whereas, clouds of data contains structured data, say, Data Cloud [19]. In Fig. 1: Proximity Search Design [11]. 2011, an effectively interpreting keyword queries on RDF databases is discussed by Haizhou Fu and To search the content on internet, the method Kemafor Anyanwu. Before this method, heuristicsintroduced was “BANKS: Browsing & Keyword were used for interpreting the keyword queries. ButSearching” [12]. It is best to use for Relational heuristics fails to capture user dependent queries.databases and static data. In this method keyword Here, the sequences of structured queries are used andsearch is used. When we talk about searching the the main work is done by query interpretation. Onlycontent on web, the term semantics of data comes in the Top-k-aware queries are considered and discussedmind. A semantic web portal for ontology searching, approach is called context aware approach [20].ranking and classification is the next approach.Chintan Patel et al. discuss a model for this. Model IV. FINDINGSconsists of crawling & classification of content. Then Blogs are a source of enormous information.on the basis of page rank, Ranking has to be done. For a user it is very hard to get the relevantSearching is based on Context Oriented Query information from the huge network of World WideLanguage and a Machine Interface is well defined in Web. For bloggers and frequent blog readers, it isthe model [13]. The model has been implemented virtually impossible to keep track of the growingusing statistics, recall numbers, etc. Some minor blogosphere and hence a service recommending thechanges has been done in this model which was blogs matching their interests will seek high value.discussed by Xing Jiang and Ah-Hwee Tan. They Blogs are the important source of information, but tointroduced Description Logic and Fuzzy Description get the relevant information in an efficient time is aLogic based on the queries [14]. typical task. Blog mining is an important way for people to extract useful information. Blogs are very In previous methods and approaches to search dynamic, so it isn’t as straightforward to applythe keyword data, the problem was “keyword queries traditional Web mining techniques to them. The goalare week to express”. Gjergji Kasneci et al. discussed is to provide the user with reliable and accurate bloga framework which consists of Data Model, Query information conveniently.language, and Ranking Model. They called it NAGA,Network Assisted Genetic Algorithm. This performed After taking a complete review, the gaps andboth searching and ranking on the data [15]. The major problems are discussed in two parts. First is related tothing is to understanding the user goals for Web the designing and the architecture and working ofSearch. Daniel E. Rose and Danny Levinson discussed Blog. Second is related to the problems and gaps inthree parameters which concentrate on what the user searching of content/blog posts in Blogs.exactly wants. Parameters are Navigation,Informational, and Resource [16]. 4.1 Based on Blog designing Based on the design and the architecture of Keyword search queries might be in the blog, the efficient way to use blogs are asstructured, unstructured or semi structured form. For personalized blogs where the blog posts shown to theunstructured queries, Pavel Calado et al. suggest user are as per his own interest, irrelevant posts are notBayesian Networks. They suggest a Bayesian network shown to the user. There is no as such system, whichapproach to searching web databases through integrates both, an individual blog as well as a blogKeyword-based queries [17]. Guoliang Li et al. search engine. This kind of integration provides ansuggested an efficient 3-in-1 keyword search method additional facility to the user, which improves thewhich works for all types of data i.e. Unstructured, knowledge and searching experience of the user. TheSemi-structured and Structured. Indexing & querying 700 | P a g e
  4. 4. Harsh Khatter , Brij Mohan Kalra / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, June-July 2012, pp.698-702model of the blog must be easier to operate and handle methodologies, so that the user can get what he exactlyby both, user and the developer. wants in an efficient manner and with ease of operability.4.2 Based on Searching Methods Based on searching of blog posts, the REFERENCESsearching method must be appropriate to search the [1] V. K. Singh, D. Mahata and R. Adhikari,results from all types of data i.e. structured, Mining the Blogosphere from a Socio-politicalunstructured, and semi-structured. If a search method Perspective, International Conference onwill search the results, only of single data type then Computer Information Systems and Industrialuser will not be able to fetch all relevant blog posts. Management Applications (CISIM), 2010, 365The method discussed by Guoliang Li et al. is a better – 370.option to use in Blogs [18]. Search will be efficient [2] Zhou Ping, Research on Personalized Blogand there must be an optimized query to search the Information Retrieval, International ConferenceBlog posts. Some minor changes and some add on on Web Information Systems and Miningservices, will make the best searching results. (WISM), 2010, 289 – 292. [3] Mukul Joshi and Nikhil Belsare, BlogHarvest:V. COLLABORATION OF BLOG WITH Blog Mining and Search Framework, OTHER WEB TOOLS International Conference on Management of Blog is a web tool handled by an individual. Data COMAD, 2006.There are various other web tools like Social [4] Peter Duffy and Axel Bruns, The use of blogs,Networking Sites (SNS), Discussion Forums, Wikis, wikis and RSS in education: A conversation ofOnline Communities, etc. Each tool has its own possibilities, Learning and Teachingimportant and provides better results as per user’s field Conference, 2006.of interest. Integration of these tools will provide an [5] Cheng Tao, Peng Xiaobo, Feng Ping, Duease to the user to use best services of Web 2.0. Web Jianming, Research on Design of A Wiki &2.0 is a term used for read – write Web. User can read Blog-based Knowledge-sharing Mechanism foras well as write the content on the World Wide Web Virtual Enterprise, Third International(WWW). The proper collaboration of these Web 2.0 Conference on Measuring Technology andtools will provide a new platform to its users to learn Mechatronics Automation (ICMTMA), 2011,things more easily, to search things, to communicate 1133 – 1137.with others i.e. friends, expertise, guides, or people of [6] Tse-Ming Tsai, Chia-Chun Shih, Seng-cho T.similar interests. In present scenario, various RSS and Chou, Personalized Blog RecommendationAtom feeds are available to collaborate these external Using the Value, Semantic, and Social Model,links to any other site, may be a Blog, Wiki, International Conference on Innovations inDiscussion Forum, Online Community, or Social Information Technology, 2006, 1 – 5.Networking Site. As Web 2.0 came as an evolution in [7] Bi Chen, Qiankun Zhao, Bingjun Sun, Mitra P.internet world, the collaboration of its tools will be an , Predicting Blogging Behavior Using Temporalevolution in informal eLearning world in the same and Social Networks, Seventh IEEEway. International Conference on Data Mining, ICDM, 2006, 439 – 444.VI. CONCLUSION [8] Yin Zhang, Kening Gao, Bin Zhang, Jinhua The major part of knowledge and recent Guo, Feihang Gao, Pengwei Guo, Clusteringactivities are shared using blogs. After taking a review Blog Posts Using Tags and Relations in theof designing and searching methods of Blogs, various Blogosphere, 1st International Conference onresearch gaps and their respective findings are well Information Science and Engineering (ICISE),discussed in the section IV. An innovative idea of 2010, 817 – 820.collaboration of various Web 2.0 tools with Blogs is [9] Chau, M., Lam, P., Shiu, B., Xu, J., Jinwei Cao,given. These new ideas and suggestions will surely A Blog Mining Framework, Internationalimprove the knowledge and searching experience of Journal of IT Professional, vol. 11, 2009, 36 -bloggers. As Blogs is getting popularity day by day, 41.so, in future Blogs will play an important role in [10] ennis Shasha, Jason T. L. Wang, Rosalbaincreasing the informal learning. Moreover, the Giugno, Algorithmics and applications of treecollaboration with RSS and Atom feeds, the power of and graph searching, twenty-first ACMblogs will become more than twice. Therefore, there is SIGMOD-SIGACT-SIGART symposium ona need to improve the Blog designing and searching Principles of database systems, 2002, 39 – 52. 701 | P a g e
  5. 5. Harsh Khatter , Brij Mohan Kalra / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, June-July 2012, pp.698-702 [20] Haizhou Fu, and Kemafor Anyanwu,[11] Roy Goldman, Narayanan Shivakumar, Suresh Effectively interpreting keyword queries on Venkatasubramanian, Hector Garcia-Molina, RDF databases with a rear view, 10th Proximity Search in Databases, 24rd international conference on semantic web, International Conference on Very Large Data 2011, 193-208. Bases, 1998, 26 – 37.[12] B. Aditya, Gaurav Bhalotia, Soumen Chakrabarti, Arvind Hulgeri, Charuta Nakhe, Parag Parag, S. Sudarshan, Keyword searching and browsing in databases using BANKS, 28th international conference on Very Large Data Bases, 2002, 1083 – 1086.[13] Chintan Patel, Kaustubh Supekar, Yugyung Lee, E. K. Park, OntoKhoj: a semantic web portal for ontology searching, ranking and classification, 5th ACM international workshop on Web information and data management, Harsh Khatter is a postgraduate student, pursuing 2003, 58-61. Master of Technology in Computer Science and[14] Xing Jiang, Ah-Hwee Tan, OntoSearch: a full- Engineering from Mahamaya Technical University, text search engine for the semantic web, 21st Noida, India. He received his Bachelor’s degree in national conference on Artificial intelligence, 2010. As thesis subject, He is working on Web 2.0 vol. 2, 2006, 1325-1330. tool, Blogs. His research interests include Web[15] Gjergji Kasneci, Fabian M. Suchanek, Services, Data Mining and Databases. He has Georgiana Ifrim, Maya Ramanath, Gerhard published a research paper on databases and data Weikum, NAGA: Searching and Ranking mining in Elsevier international journal, one paper on Knowledge, 24th IEEE International Informal eLearning in ICIAICT international Conference on Data Engineering, 2008, pp. 953 conference, and one in national conference. He is also – 962. a member of IEEE Society.[16] Daniel E. Rose, and Danny Levinson, Understanding user goals in web search, 13th international conference on World Wide Web, 2004, 13-19.[17] Pavel Calado, Altigran S. da Silva, Alberto H.F. Laender, Berthier A. Ribeiro-Neto, Rodrigo C. Vieira, A Bayesian network approach to searching Web databases through keyword- based queries, International Journal of Information Processing and Management on Bayesian networks and information retrieval, vol. 40, 2004, 773-790. Brij Mohan Kalra is currently working as a Professor[18] Guoliang Li, Beng Chin Ooi, Jianhua Feng, and Head in the Department of Computer Science and Jianyong Wang, Lizhu Zhou. EASE: an Engineering at Ajay Kumar Garg Engineering Colege, effective 3-in-1 keyword search method for Ghaziabad, India. He has done his B.Tech. from Delhi unstructured, semi-structured and structured College of Engineering, Delhi in 1977 and completed data, International conference on Management his M.Tech from IIT, Delhi in 1991. He has vast of data ACM SIGMOD, 2008, 903-914. experience of 35 years of academia and industry in[19] Georgia Koutrika, Zahra Mohammadi Zadeh, CSE and IT fields. He is pursuing his Ph.D in CSE Hector Garcia-Molina, Data clouds: from Gautam Buddha University, Greater Noida, summarizing keyword search results over India. His research interests include eLearning, structured data, 12th International Conference Computer Networks, and Digital Logic Design. He is on Extending Database Technology: Advances also a member of several professional bodies: IEEE, in Database Technology, 2009, 391-402. CSI, and IETE. 702 | P a g e