Our projects encompass a diverse range of data-driven projects, including LiveData Projects, livePeople, liveLanguage, and liveKnowledge. These initiatives gather data on various aspects such as human behaviour, linguistic diversity, data and knowledge resources. Through partnerships worldwide, our projects produce valuable datasets accessible to all. From exploring daily behaviours to delving into cross-lingual semantics, our projects offer insights that contribute to real-world discoveries.
So far we have successfully completed
contributing to understanding everyday life activities across diverse communities. The DataScientia Community collaborates with universities globally to advance human-aware artificial intelligence initiatives, gathering data on human behavior, linguistic diversity, and knowledge resources.
Today's AI systems, fueled by data from billions of smartphone users, offer immense potential in understanding daily contexts for health, behavior studies, and personalized systems. The Livepeople Catalogue compiles diverse datasets on people's behaviors and lifestyles, gathered through various methodologies worldwide.
So far 8 Projects successfully completed involving
Across eight projects, data has been collected from 10 unique sites involving participants from 145 nationalities, with a total of 452 datasets, including 381 sensor datasets and 81 question answer datasets. The raw dataset size stands at 3.99 TB, representing six years of data collection involving 51,418 participants.
The UKC LiveLanguage Catalog, a vast multilingual lexical database emphasizing language diversity and covering numerous languages, offers open-access datasets focusing on linguistic diversity, particularly in cross-lingual lexical semantics.
15
dedicated to exploring cross-lingual lexical semantics were completed.
2371
demonstrating the breadth of linguistic diversity in projects.
2.7
Million
that help understand the meaning ascribed to a word in a given context.
39026
bridging linguistic gaps for better understanding.
The UKC LiveLanguage catalog comprises data from 15 projects dedicated to exploring cross-lingual lexical semantics with data spanning across 2371 languages, encompassing 1,903,077 words and 2,774,010 language-specific word senses. We have identified and covered 110,579 concepts, addressing 39,026 lexical gaps across languages.
The LiveKnowledge Catalogue contains metadata on various genres of knowledge resources generated from global Knowledge Engineering projects. These resources include teleologies, ontologies, teleontologies, lightweight classification ontologies, and schemas. The knowledge resources span across diverse topics like healthcare, culture, geography, society and territory, internet of things and many more. There are five namespaces and a total of 30 schemas across 15 projects.
15
covering diverse domains of knowledge e.g. healthcare, culture, geography etc.
30
representation of the organization and relationships within a knowledge domain.
15
completed that provide metadata on various genres of knowledge resources.
22
Projects
4 Data Domains
3 Languages
The DataScientia LiveData Catalog is a centralized collection of LiveData catalogs within the DataScientia community. It serves as a hub for accessing information about various data domains represented by specific LiveData catalogs. Users can navigate through these catalogs, exploring their contents directly from the main LiveData catalog.
With a total of 22 projects completed, our LiveData catalogs span across 4 data domains, additionally supporting three local languages (Italian, English, Mongolian). The catalogs are exploreable through 5 websites, one for Main LiveData while 4 For domain specific LiveData.
Get involved. Your next project is just a click away.