Since its creation, the Web has been a main object of research for information management, which has been primarily studied using classical paradigms. However, since the early 2000s, we are witnessing drastic changes in the area of Web data management. If we had to summarize them in one sentence, it would be: real distribution of big data.
In this new scenario, capturing the meaning of heterogeneous data and developing tools for its processing play a crucial role. The Semantic Web is an enormous initiative led by the World Wide Web Consortium whose main objective is to achieve these goals, thus transforming the current Web of documents into a Web of data, where human users and computer applications can take a better advantage of the massive amount of information stored on it. Some key steps have been made to achieve these goals. However, we are still far from having techniques that take full advantage of the semantics and the logic behind Web data, once its structure, scale and distribution –altogether– are considered as a full-fledged phenomenon.
The main goal of the Center for Semantic Web Research is to study how to effectively extract semantic data from the Web, and to develop the basic tools for such effective extraction. This is an initiative that brings together professors, researchers and students from Pontifical Catholic University of Chile, University of Chile and University of Talca, and which is funded by the Iniciativa Científica Milenio.
In this presentation, we propose a simple and expressive framework for adding metadata to CSV documents and their noisy variants. The framework is based on annotating parts of the document that can be later used for selecting data. These expressions are then combined using a set of rules in order to annotate the data. We study the computational complexity of implementing our framework and present an efficient evaluation algorithm that runs in time proportional to its output and linear in its input. As a proof of concept, we test an implementation of our framework against a large number of real world datasets and show that it can be efficiently used in practice.
The data that we record daily about ourselves through our cell phones, credit card purchases, emails, social media postings, etc., helps us connect with each other and improve our life quality. But we do not own nor control most of this data. I will discuss the good and the bad in this powerful information technology, including a number of cases – some inspiring, some terrifying – and what we can do as citizens to obtain the promise of big data while mitigating some of the worst problems. Among other things, I will trace problems with the release of "anonymized" data that is later "deanonymized".
Former Undergraduate student and current PhD student Pablo Muñoz under the supervision of Pablo Barceló obtained the "Vienna Center for Logic and Algorithms Outstanding Undergraduate Research Award". This award is given by one of the most important institutions in Computer Science in Europe and Pablo has been invited to present his research work at the center this year.
The Council of Professors and Heads of Computing (CPHC), in conjunction with the British Computer Society (BCS) and the BCS Academy of Computing has selected Dr. Juan Reutter’s dissertation as the winner of the BCS Distinguished Dissertation Award, that annually selects for publication the best British PhD/DPhil dissertation in computer science.
El sitio Web http://constitucionabierta.cl/ donde los ciudadanos pueden subir y visualizar actas de los Encuentros Locales Autoconvocados que ellos mismos decidan hacer públicas, ha tenido interés de la prensa nacional. A continuación puedes leer las notas que han sido publicadas en diferentes medios nacionales:
Nuestro Investigador Jorge Pérez presentó en Arica, una charla titulada: "Sharing Economy: Economía colaborativa, datos y algoritmos". "Tengo un mensaje más general de educación de tratar que estos temas se entiendan, más allá que exista una pelea particular sobre que haya o no un mejor servicio; la idea es que se entiendan las implicancias que van detrás de eso, sobre todo el uso de datos que tienen que ver con tecnología de computación". http://bit.ly/24P5IDz