Rosy Madaan
YMCA University of Science and Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Rosy Madaan.
grid computing | 2010
Komal Kumar Bhatia; A. K. Sharma; Rosy Madaan
Existing search engines crawl and index surface web, ignoring hidden web which otherwise contains more than 500 times of information than PIW. In this paper, a Domain-specific Hidden Web Crawler (AKSHR) is being proposed. The framework extracts hidden web pages by accruing benefits of its three unique features: 1) automatic downloading of search interfaces to crawl hidden web databases, 2) identification of semantic mappings between search interface elements by using a novel approach called DSIM (Domain-specific Interface Mapper), and 3) the capability to automatic filling of search interfaces. The effectiveness of proposed framework has been evaluated through experiments using real web sites and encouraging preliminary results were obtained.
grid computing | 2012
Rosy Madaan; Ashok.Kumar Sharma; Ashutosh Dixit
A general crawler downloads web pages that may be of any kind, thus forming a source of information for the search engine. Blog crawler is similar to a general crawler except that it restricts its crawl boundary to the blog space, thus downloading only the blog pages and ignoring rest of the web. Since blog is an emerging phenomenon and serve as very useful source of information, a blog crawler proves to be of great help in this regard. We propose a new algorithm for blog crawler and discuss a number of related issues. Also, as the result of analysis, it has been found that our proposed blog crawler is superior to the general crawler.
international conference on contemporary computing | 2010
Rosy Madaan; Ashutosh Dixit; A. K. Sharma; Komal Kumar Bhatia
Hidden Web’s broad and relevant coverage of dynamic and high quality contents coupled with the high change frequency of web pages poses a challenge for maintaining and fetching up-to-date information. For the purpose, it is required to verify whether a web page has been changed or not, which is another challenge. Therefore, a mechanism needs to be introduced for adjusting the time period between two successive revisits based on probability of updation of the web page. In this paper, architecture is being proposed that introduces a technique to continuously update/refresh the Hidden Web repository.
Archive | 2019
Rosy Madaan; A. K. Sharma; Ashutosh Dixit; Poonam Bhatia
Search engine is a program that performs a search in the documents for finding out the response to the user’s query in form of keywords. It then provides a list of web pages comprising of those keywords. Search engines cannot differentiate between the variable documents and spams. Some search engine crawler retrieves only document title not the entire text in the document. The major objective of Question Answering system is to develop techniques that not only retrieve documents, but also provide exact answers to natural language questions. Many Question Answering systems developed are able to carry out the processing needed for attaining higher accuracy levels. However, there is no major progress on techniques for quickly finding exact answers. Existing Question Answering system is unable to handle variety of questions and reasoning-based question. In case of absence of data sources, QA system fails to answer the query. This paper investigates a novel technique for indexing the semantic Web for efficient Question Answering system. Proposed techniques include manual constructed question classifier based on , retrieval of documents specifically for Question Answering, semantic type answer extraction, answer extraction via manually constructed index for every category of Question.
International Journal of Information Retrieval Research (IJIRR) | 2014
Rosy Madaan; A. K. Sharma; Ashutosh Dixit
Question answering offers a more intuitive approach to information processing. A number of approaches have been used for answering questions. In this paper, we propose a questionansweringsystem that uses blogs as its source of information. The system deals with crawling blog pages, summarizing them, indexing and then ranking the summarized content. The user asks a question and gets answer(s) in response. The answer(s) obtained are better as compared to those provided by the existing QA systems that use the general web pages for the purpose of answering. The experimental results show that the proposed system has shown promising results and the responses given by the system are better than those given by the existing QA systems.
arXiv: Information Retrieval | 2013
Renu Mudgal; Rosy Madaan; A. K. Sharma; Ashutosh Dixit
arXiv: Information Retrieval | 2013
Rosy Madaan; A. K. Sharma; Ashutosh Dixit
International Journal of Advances in Computing and Information Technology | 2012
Rosy Madaan; Sharma A.K; Ashutosh Dixit; Deepti Kapri; Renu Mudgal
international conference on computing for sustainable global development | 2015
Rosy Madaan; A. K. Sharma; Ashutosh Dixit
Archive | 2014
Sonali Gupta; Komal Kumar Bhatia; Rosy Madaan; Subhash Medhi; Tulshi Bezboruah; Mounir Zrigui