Skip to main content

Template: hello-world

hello-world is a datafact template for creating simple datasets.

Create a new project using this template

datafact new datafact-tutorial/hello-world -t hello-world
Click to show output

Now go to the project folder datafact-tutorial/hello-world.

cd datafact-tutorial/hello-world
ls
Hide Output
README.md         data.py           datafact.json     project.py        type.py

What to modify

You should modify the following files:

  • data.py
  • type.py
  • README.md

Implement a function called create_data_dict which produce the content of your dataset.

Example
from dataset_sh.constants import DEFAULT_COLLECTION_NAME


def create_main_collection():
return [
{'language': 'en', 'value': 'Hello World!'},
{'language': 'fr', 'value': "Bonjour le monde!"},
{'language': 'es', 'value': "¡Hola Mundo!"},
{'language': 'de', 'value': "Hallo Welt!"},
{'language': 'it', 'value': "Ciao Mondo!"},
{'language': 'pt', 'value': "Olá Mundo!"},
{'language': 'zh', 'value': "你好,世界!"},
{'language': 'ja', 'value': "こんにちは世界!"},
{'language': 'hi', 'value': "नमस्ते दुनिया!"},
{'language': 'ar', 'value': "مرحبا بالعالم!"},
{'language': 'ru', 'value': "Привет, мир!"},
{'language': 'ko', 'value': "안녕하세요 세계!"},
]


def create_data_dict():
return {
DEFAULT_COLLECTION_NAME: create_main_collection()
}

Build and publish

To build your dataset

python project.py build

To preview your build before publish

python project.py preview show

To publish your build

python project.py publish

Example: geo/country-code

More Examples