• Projects

  • A declarative, rapid development framework for data-driven Ajax reports and applications. Rich visualizations and collaborative workflows require only a few lines of SQL-based code and visualization/interaction markup.

    Project Website

    Forward

    Forward

    A declarative, rapid development framework for data-driven Ajax reports and applications. Rich visualizations and collaborative workflows require only a few lines of SQL-based code and visualization/interaction markup.

    Project Website

  • DELPHI is a platform that enables integrated access and analysis of all data relevant to health. This platform promotes a more rapid development of empowering, data-driven health apps and tools by a broad community of health-related software developers.

    Project Website

    Delphi

    Delphi

    DELPHI is a platform that enables integrated access and analysis of all data relevant to health. This platform promotes a more rapid development of empowering, data-driven health apps and tools by a broad community of health-related software developers.

    Project Website

  • SQL++ is a highly expressive semi-structured query language that encompasses both the SQL and the JSON data model. SQL++ is SQL backwards-compatible. The Configurable version of SQL++ includes configuration options that formally itemize the semantics variations that language designers may choose from. We use SQL++ in FORWARD's middleware query processor.

    Project Website

    SQL++ and Middleware

    SQL++ and Middleware

    SQL++ is a highly expressive semi-structured query language that encompasses both the SQL and the JSON data model. SQL++ is SQL backwards-compatible. The Configurable version of SQL++ includes configuration options that formally itemize the semantics variations that language designers may choose from. We use SQL++ in FORWARD's middleware query processor.

    Project Website

  • We are redesigning database and NoSQL systems to take full advantage of next-generation non-volatile memories. We have developed novel programming models for dealing with persistent state in programs, and we are re-engineering SSDs and memory hierarchies to support and accelerate them.

    Project Website

    Reengineering DB Systems for SSDs

    Reengineering DB Systems for SSDs

    We are redesigning database and NoSQL systems to take full advantage of next-generation non-volatile memories. We have developed novel programming models for dealing with persistent state in programs, and we are re-engineering SSDs and memory hierarchies to support and accelerate them.

    Project Website

  • FastR: a native database that provides fully pipeline methods to answer Relationship queries. FastR uses new data organization, which allows compressions that do not support random accesses.

    Project Website

    FastR

    FastR

    FastR: a native database that provides fully pipeline methods to answer Relationship queries. FastR uses new data organization, which allows compressions that do not support random accesses.

    Project Website

  • A recent trend of workflow specification, called data-driven workflows, resulted from an evolution away from the traditional process-centric approach towards data-awareness.

    Specification and Verification

    Specification and Verification

    A recent trend of workflow specification, called data-driven workflows, resulted from an evolution away from the traditional process-centric approach towards data-awareness.

  • The ObjectRank system applies the random walk model, the effectiveness of which is proven by Google's PageRank, to keyword search in databases modeled as labeled graphs. The system ranks the database objects with respect to the user-provided keywords.

    Project Website

    ObjectRank

    ObjectRank

    The ObjectRank system applies the random walk model, the effectiveness of which is proven by Google's PageRank, to keyword search in databases modeled as labeled graphs. The system ranks the database objects with respect to the user-provided keywords.

    Project Website

  • The MIX project (Mediation of Information using XML) is a collaboration between the UCSD Database Laboratory and the Data-intensive Computing Environments (DICE) group at SDSC. The goal of the project, which recently became a part of the I2T NSF Digital Government effort, is to study, develop, apply and evaluate systems for querying across heterogeneous information sources using XML.

    MIX Project

    MIX Project

    The MIX project (Mediation of Information using XML) is a collaboration between the UCSD Database Laboratory and the Data-intensive Computing Environments (DICE) group at SDSC. The goal of the project, which recently became a part of the I2T NSF Digital Government effort, is to study, develop, apply and evaluate systems for querying across heterogeneous information sources using XML.

  • Query Set Specification Language (QSSL) allows the concise description of sets of parameterized XPath queries. A QSS is embedded in a WSDL specification to form a specialized type of web services, called Data Services. Data Services connect the calls that the source accepts with the underlying schema. QSSL will be enhanced to describe subsets of XQuery expressions beyond XPath ones.

    QSSL

    QSSL

    Query Set Specification Language (QSSL) allows the concise description of sets of parameterized XPath queries. A QSS is embedded in a WSDL specification to form a specialized type of web services, called Data Services. Data Services connect the calls that the source accepts with the underlying schema. QSSL will be enhanced to describe subsets of XQuery expressions beyond XPath ones.

  • MARS is a system for publishing as XML data from mixed, relational+XML, proprietary storage, while supporting redundancy in storage for tuning purposes. Starting with client XQueries formulated against the public schema, MARS achieves the combined effect of rewriting-with-views, composition-with-views and query minimization under integrity constraints to obtain optimal reformulations against the proprietary schema.

    MARS

    MARS

    MARS is a system for publishing as XML data from mixed, relational+XML, proprietary storage, while supporting redundancy in storage for tuning purposes. Starting with client XQueries formulated against the public schema, MARS achieves the combined effect of rewriting-with-views, composition-with-views and query minimization under integrity constraints to obtain optimal reformulations against the proprietary schema.

  • REFORM was created as a a system for rewriting XML nested queries using XML nested views. It takes an input XQuery expression and a set of XQuery views and it returns an XQuery expression equivalent to the original one, but formulated in terms of the views output schemas.

    REFORM

    REFORM

    REFORM was created as a a system for rewriting XML nested queries using XML nested views. It takes an input XQuery expression and a set of XQuery views and it returns an XQuery expression equivalent to the original one, but formulated in terms of the views output schemas.

  • Preference queries are needed in operations’ research and many real life apps but database systems cannot efficiently produce the top results of a preference query because they need to evaluate the weight function over all tuples of the relation. PREFER can pipeline and produce the top results of preference queries efficiently by using materialized views that have been preprocessed and stored.

    PREFER

    PREFER

    Preference queries are needed in operations’ research and many real life apps but database systems cannot efficiently produce the top results of a preference query because they need to evaluate the weight function over all tuples of the relation. PREFER can pipeline and produce the top results of preference queries efficiently by using materialized views that have been preprocessed and stored.

  • GEON started in 2002 as a project funded under the NSF Information Technology Research (ITR) program. The project began as a collaborative research project among a dozen PI institutions, to develop cyberinfrastructure in support of data sharing and integration among the Earth Sciences community.

    GEON - The GEOsciences Network

    GEON - The GEOsciences Network

    GEON started in 2002 as a project funded under the NSF Information Technology Research (ITR) program. The project began as a collaborative research project among a dozen PI institutions, to develop cyberinfrastructure in support of data sharing and integration among the Earth Sciences community.

  • PIX is a system that enables flexible and efficient phrase matching in XML documents. Since XML allows structured and unstructured information to be interleaved, XML documents often contain ``mixed content''. Unlike phrase matching on ``flat text'', phrase matching on mixed content raises new challenges. In particular, phrases to match might span document structure.

    Project Website

    Phrase Matching in XML

    Phrase Matching in XML

    PIX is a system that enables flexible and efficient phrase matching in XML documents. Since XML allows structured and unstructured information to be interleaved, XML documents often contain ``mixed content''. Unlike phrase matching on ``flat text'', phrase matching on mixed content raises new challenges. In particular, phrases to match might span document structure.

    Project Website

  • XML Query Algebra is used to define the formal semantics of XML query language. Our demo site provides several default XML data sources and some queries .You can run the sample queries and verify the query results. You can modify any sample queries or compose your own queries, you can also query against your own data sources by supplying them in the demo's query data window.

    Project Website

    XML Query Algebra

    XML Query Algebra

    XML Query Algebra is used to define the formal semantics of XML query language. Our demo site provides several default XML data sources and some queries .You can run the sample queries and verify the query results. You can modify any sample queries or compose your own queries, you can also query against your own data sources by supplying them in the demo's query data window.

    Project Website

  • The first generation of XML query processors, including the MIX processor of UCSD’s DB lab, focused on the efficient processing of XML queries issued on XML databases or XML views of information sources, such as relational databases, files, etc. The recently started XSM (Xml Stream Machine) project of UCSD narrows its focus to XML streams and expects order-of-magnitude performance improvements using a novel architecture and algorithms that reduce XQuery statements into an automaton-like structure, also called an XSM.

    Project Website

    XSM Project

    XSM Project

    The first generation of XML query processors, including the MIX processor of UCSD’s DB lab, focused on the efficient processing of XML queries issued on XML databases or XML views of information sources, such as relational databases, files, etc. The recently started XSM (Xml Stream Machine) project of UCSD narrows its focus to XML streams and expects order-of-magnitude performance improvements using a novel architecture and algorithms that reduce XQuery statements into an automaton-like structure, also called an XSM.

    Project Website

  • The goal of WAVE is the Specification and Verification of Interactive Web Applications powered by an underlying database. WAVE allows web application developers to declaratively specify the behavior of an interactive web application as a function of the user input, the current state of the application, and the information in an underlying database.

    Project Website

    WAVE: Web Application VErifier

    WAVE: Web Application VErifier

    The goal of WAVE is the Specification and Verification of Interactive Web Applications powered by an underlying database. WAVE allows web application developers to declaratively specify the behavior of an interactive web application as a function of the user input, the current state of the application, and the information in an underlying database.

    Project Website

  • We have developed the Indexed Lookup Eager Algorithm, Scan Eager Algorithm to answer keyword search queries. Experiments have shown that the Indexed Lookup Eager algorithm outperforms the Scan Eager algorithm and the Stack algorithm from prior work, often by orders of magnitude when the keywords in the query have quite different frequencies, and loses only by a small margin when the keywords have similar frequencies.

    Project Website

    XKSearch

    XKSearch

    We have developed the Indexed Lookup Eager Algorithm, Scan Eager Algorithm to answer keyword search queries. Experiments have shown that the Indexed Lookup Eager algorithm outperforms the Scan Eager algorithm and the Stack algorithm from prior work, often by orders of magnitude when the keywords in the query have quite different frequencies, and loses only by a small margin when the keywords have similar frequencies.

    Project Website

  • The CLIDE System assists the owners of sources that participate in Web service-based data publishing systems to publish a restricted set of parameterized queries over the schema of their sources and package them as WSDL services

    Clide

    Clide

    The CLIDE System assists the owners of sources that participate in Web service-based data publishing systems to publish a restricted set of parameterized queries over the schema of their sources and package them as WSDL services

  • XTreeNet: a distributed query dissemination engine which facilitates democratization of publishing and efficient data search among members of online communities with powerful full-text queries.

    XTreeNet

    XTreeNet

    XTreeNet: a distributed query dissemination engine which facilitates democratization of publishing and efficient data search among members of online communities with powerful full-text queries.

  • The goal of the DXQ project is to support development of reliable, extensible, and efficient distributed resource-management protocols. Our strategy to meet these requirements is to provide a high-level, distributed, and optimizable query language for implementing distributed resource-management protocols. By using a high-level language, a protocol's semantics is transparent, not hidden, in the implementation, which supports the reliability requirement.

    DXQ

    DXQ

    The goal of the DXQ project is to support development of reliable, extensible, and efficient distributed resource-management protocols. Our strategy to meet these requirements is to provide a high-level, distributed, and optimizable query language for implementing distributed resource-management protocols. By using a high-level language, a protocol's semantics is transparent, not hidden, in the implementation, which supports the reliability requirement.

  • Keyword search is the most popular information discovery method because the user does not need to know either a query language or the underlying structure of the data.

    Project Website

    XKeyword

    XKeyword

    Keyword search is the most popular information discovery method because the user does not need to know either a query language or the underlying structure of the data.

    Project Website