Browsing by Author "Chen, Yuxin"
Now showing 1 - 3 of 3
Results Per Page
Sort Options
- Intelligent Fusion of Structural and Citation-Based Evidence for Text ClassificationZhang, Baoping; Goncalves, Marcos A.; Fan, Weiguo; Chen, Yuxin; Fox, Edward A.; Calado, Pavel; Cristo, Marco (Department of Computer Science, Virginia Polytechnic Institute & State University, 2004)This paper investigates how citation-based information and structural content (e.g., title, abstract) can be combined to improve classification of text documents into predefined categories. We evaluate different measures of similarity, five derived from the citation structure of the collection, and three measures derived from the structural content, and determine how they can be fused to improve classification effectiveness. To discover the best fusion framework, we apply Genetic Programming (GP) techniques. Our empirical experiments using documents from the ACM digital library and the ACM classification scheme show that we can discover similarity functions that work better than any evidence in isolation and whose combined performance through a simple majority voting is comparable to that of Support Vector Machine classifiers.
- A Novel Hybrid Focused Crawling Algorithm to Build Domain-Specific CollectionsChen, Yuxin (Virginia Tech, 2007-02-05)The Web, containing a large amount of useful information and resources, is expanding rapidly. Collecting domain-specific documents/information from the Web is one of the most important methods to build digital libraries for the scientific community. Focused Crawlers can selectively retrieve Web documents relevant to a specific domain to build collections for domain-specific search engines or digital libraries. Traditional focused crawlers normally adopting the simple Vector Space Model and local Web search algorithms typically only find relevant Web pages with low precision. Recall also often is low, since they explore a limited sub-graph of the Web that surrounds the starting URL set, and will ignore relevant pages outside this sub-graph. In this work, we investigated how to apply an inductive machine learning algorithm and meta-search technique, to the traditional focused crawling process, to overcome the above mentioned problems and to improve performance. We proposed a novel hybrid focused crawling framework based on Genetic Programming (GP) and meta-search. We showed that our novel hybrid framework can be applied to traditional focused crawlers to accurately find more relevant Web documents for the use of digital libraries and domain-specific search engines. The framework is validated through experiments performed on test documents from the Open Directory Project. Our studies have shown that improvement can be achieved relative to the traditional focused crawler if genetic programming and meta-search methods are introduced into the focused crawling process.
- Optimal Trajectory Planning for a Space Robot Docking with a Moving Target via Homotopy AlgorithmsChen, Yuxin; Watson, Layne T. (Department of Computer Science, Virginia Polytechnic Institute & State University, 1993)The mathematical formulation of optimal trajectory planning for a space robot docking with a moving target is derived. The calculus of variations is applied to the problem so that the optimal robot trajectory can be obtained directlt from the target information without first planning the trajectory of the end-effector. The nonlinear two-point boundary value problem resulting from the problem formulation is solved numerically by a globally convergent homotopy algorithm. The algorithm guarantees convergence to a solution for an arbitrarily chosen initial guess. Numerical simulation for three examples demonstrates the approach.