IJSRP Logo
International Journal of Scientific and Research Publications

IJSRP, Volume 4, Issue 3, March 2014 Edition [ISSN 2250-3153]


Web Forum Crawling
      R.Priya, Ms.S.Dhanalakshmi, S.Priyadharshini
Abstract: The supervised web-scale forum crawler is to crawl relevant forum content from the web with minimum overhead. Forum threads contain information content that is the target of forum crawlers. each forums have different layouts or styles and have different forum software packages, they always have similar constant navigation paths connected by specific URL types to direct users from entry pages to thread page. we reduce the web forum crawling problem to a URL-type recognition problem. And shows how to learn accurate and effective regular expression patterns of constant navigation paths from automatically created training sets using aggregated results from weak page type classifiers. Robust page type classifiers can be experienced from as few as five annotated forums and applied to a large set of unseen forums.

Reference this Research Paper (copy & paste below code):

R.Priya, Ms.S.Dhanalakshmi, S.Priyadharshini (2018); Web Forum Crawling; Int J Sci Res Publ 4(3) (ISSN: 2250-3153). http://www.ijsrp.org/research-paper-0314.php?rp=P272397
©️ Copyright 2011-2023 IJSRP - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.