Facilitate Cross-Repository Big Data Discovery and Reuse
Researchers have accumulated large amount of observational, experimental, and simulation data. Much effort has been made to collect, curate, preserve, and provide open access to them, but putting the data online is only the start. Coined by Jim Gray as the fourth paradigm, the data-intensive science strives to uncover the hidden patterns and correlations across research topics and disciplines by aggregating and cross-interrogating these data silos. The productivity for e-Research may be much improved if we can provide the researchers with fast, easy, and cost-effective methods to discover and reuse these datasets in an ad-hoc and explorative manner.