Sources of information and our dependence on them are increasing at a phenomenal rate. The most obvious example is the explosive growth and rapid evolution of the World Wide Web, but other projects in research, industry, healthcare and government also exhibit a critical dependence on the effective management and exploitation of large scale data. The kind of information available is also changing rapidly, and often includes un-structured and semi-structured data, streaming data, noisy and incomplete data, and linked datasets. Simultaneously dealing with the rapidly increasing size, complexity and heterogeneity of data presents a grand challenge for information systems research, and has created an urgent need for more capable information systems. Meeting this need will be critical to the UK's future competitiveness.
Informaion systems clearly have a key role to play in addressing these extremely complex problems, but they need to evolve to reflect the rapidly changing information landscape. This evolution is the basis for the emerging field of semantics-aware data management, which involves a synthesis of ontological reasoning and database management principles. Semantics-aware systems employ rich schemas (AKA ontologies) that allow them to deal with incomplete and semi-structured information from heterogeneous sources, and to answer queries in a way that reflects both knowledge and data, i.e., to deliver understanding from information.
We believe, however, that if such systems are to be widely applicable, then their enhanced capabilities must, be in addition to, and not instead of, the well-established features and high performance of existing database systems; moreover, we believe that they will need to incorporate techniques from many other areas of computer science, particularly those that give a complementary view of "Big Data" management, such as algorithms and machine learning, stream processing, and information retrieval. The goal of the Oxford Information Systems Group (ISG) is to develop next generation semantics-aware data management systems that fully realise the desired synthesis.