OUR PROJECTS

Our diverse projects provide valuable global insights.

Our projects encompass a diverse range of data-driven projects, including LiveData Projects, livePeople, liveLanguage, and liveKnowledge. These initiatives gather data on various aspects such as human behaviour, linguistic diversity, data and knowledge resources. Through partnerships worldwide, our projects produce valuable datasets accessible to all. From exploring daily behaviours to delving into cross-lingual semantics, our projects offer insights that contribute to real-world discoveries.

DIVE INTO OUR DIVERSE PROJECTS




So far we have successfully completed


60+ PROJECTS


SOME INVOLVING THOUSANDS OF PARTICIPANTS ACROSS THE GLOBE


contributing to understanding everyday life activities across diverse communities. The DataScientia Community collaborates with universities globally to advance human-aware artificial intelligence initiatives, gathering data on human behavior, linguistic diversity, and knowledge resources.







To Better understand the world around us we have divided our projects into different carefully choosen categories, each collecting extensive data and resources.












LivePeople

Today's AI systems, fueled by data from billions of smartphone users, offer immense potential in understanding daily contexts for health, behavior studies, and personalized systems. The Livepeople Catalogue compiles diverse datasets on people's behaviors and lifestyles, gathered through various methodologies worldwide.

So far 8 Projects successfully completed involving


20,000+


PARTICIPANTS WITH 145 DIFFERENT NATIONALITIES

Across eight projects, data has been collected from 10 unique sites involving participants from 145 nationalities, with a total of 452 datasets, including 381 sensor datasets and 81 question answer datasets. The raw dataset size stands at 3.99 TB, representing six years of data collection involving 51,418 participants.




LiveLanguage

The UKC LiveLanguage Catalog, a vast multilingual lexical database emphasizing language diversity and covering numerous languages, offers open-access datasets focusing on linguistic diversity, particularly in cross-lingual lexical semantics.

15

Projects

dedicated to exploring cross-lingual lexical semantics were completed.

2371

Languages

demonstrating the breadth of linguistic diversity in projects.

2.7
Million

Language-specific word senses

that help understand the meaning ascribed to a word in a given context.

39026

Lexical gaps covered

bridging linguistic gaps for better understanding.

The UKC LiveLanguage catalog comprises data from 15 projects dedicated to exploring cross-lingual lexical semantics with data spanning across 2371 languages, encompassing 1,903,077 words and 2,774,010 language-specific word senses. We have identified and covered 110,579 concepts, addressing 39,026 lexical gaps across languages.





LiveKnowledge

The LiveKnowledge Catalogue contains metadata on various genres of knowledge resources generated from global Knowledge Engineering projects. These resources include teleologies, ontologies, teleontologies, lightweight classification ontologies, and schemas. The knowledge resources span across diverse topics like healthcare, culture, geography, society and territory, internet of things and many more. There are five namespaces and a total of 30 schemas across 15 projects.

15

Topics

covering diverse domains of knowledge e.g. healthcare, culture, geography etc.

30

Schemas

representation of the organization and relationships within a knowledge domain.

15

Projects

completed that provide metadata on various genres of knowledge resources.





22
Projects




4 Data Domains




3 Languages




LiveData

The DataScientia LiveData Catalog is a centralized collection of LiveData catalogs within the DataScientia community. It serves as a hub for accessing information about various data domains represented by specific LiveData catalogs. Users can navigate through these catalogs, exploring their contents directly from the main LiveData catalog.

With a total of 22 projects completed, our LiveData catalogs span across 4 data domains, additionally supporting three local languages (Italian, English, Mongolian). The catalogs are exploreable through 5 websites, one for Main LiveData while 4 For domain specific LiveData.






Interested?

Get involved. Your next project is just a click away.