Leszek Wiesner a7bd8c60a5 Documentation		4 lat temu
..
inputs	cfe77cea87 New schemas, improved tooling, initialization scripts	4 lat temu
schemas	cfe77cea87 New schemas, improved tooling, initialization scripts	4 lat temu
scripts	cfe77cea87 New schemas, improved tooling, initialization scripts	4 lat temu
types	cfe77cea87 New schemas, improved tooling, initialization scripts	4 lat temu
typings	cfe77cea87 New schemas, improved tooling, initialization scripts	4 lat temu
.gitignore	cfe77cea87 New schemas, improved tooling, initialization scripts	4 lat temu
README.md	a7bd8c60a5 Documentation	4 lat temu
package.json	cfe77cea87 New schemas, improved tooling, initialization scripts	4 lat temu
tsconfig.json	cfe77cea87 New schemas, improved tooling, initialization scripts	4 lat temu
vscode-recommended.settings.json	cfe77cea87 New schemas, improved tooling, initialization scripts	4 lat temu

Content directory json schemas and inputs

Definitions

In order to make this documentation as clear as possible it is important to make a strict distinction between two types of schemas:

json-schemas mean files with .schema.json extension. This is a common standard for describing how to validate other json files or objects (ie. a package.json file may be an example of a file that needs to support a specific json-schema). A documentation of this standard can be found here: https://json-schema.org/
runtime-scheams means schemas as they are "understood" by the content-directory runtime module, so schemas that can be added to classes via api.tx.contentDirectory.addClassSchema and supported by a given entity via api.tx.contentDirectory.addSchemaSupportToEntity

Content directory input

Initializing content directory

In order to intialize the content directory on a development chain based on data that is provided in form of json files inside /inputs directory (classes, schemas and example entities - entityBatches), we can run:

yarn workspace cd-schemas initialize:dev

This will handle:

Creating a membership for ALICE (if not already created)
Setting (hiring) ALICE as content curators lead (if not already set)
Creating classes in the runtime based on inputs/classes json inputs (if the content directory is currently empty)
Creating schemas in the runtime based on inputs/schemas and adding them to the related classes
Creating entities based on inputs/entityBatches based on json files that allow specifying entities and relationships between them in a simplified way. Those inpus are then converted to one huge api.tx.contentDirectory.transaction call

Input files naming

Currently the tooling has some limitations when it comes to naming files inside the inputs directory. There is a specific pattern that has to be respected:

Each input file name should start with related class id (counting from 1, since it's assumed that the classes will be created inside an initially empty content directory) followed by and userscore and a class name (for example: 1_Language), followed by one of the following strings: Class, Schema or Batch (based on the input type, ie. 1_LanguageBatch)

`json-schemas` support for json inputs in `VSCode`

In order to link json files inside inputs directory to json-schemas inside schemas and have them validated in real-time by the IDE, follow the steps below:

If you don't have .vscode/settings.json in the root monorepo workspace yet:

Create .vscode directory inside your monorepo workspace
Copy vscode-recommended.settings.json into this .vscode directory and rename it to settings.json.

If you already have the .vscode/settings.json file in the root monorepo workspace:

Copy the settings from vscode-recommended.settings.json and merge them with the existing .vscode/settings.json

Now all the json files matching *Class.json, *Schema.json, *{EntityName}Batch.json patters will be linked to the correct json schemas. If you edit any file inside inputs or add a new one that follows the naming pattern (described in Input files naming), you should get the benefit of autocompleted properties, validated input, on-hover tooltips with property descriptions etc.

For more context, see: https://code.visualstudio.com/docs/languages/json

Validate inputs via command

Inpus inside inputs/classes and inputs/schemas can also be validated using yarn workspace cd-schemas validate command. NOTE: Validation of inputs inside inputs/entityBatches via the command is not supported yet.

The command also validates whether json-schemas inside schemas/extrinsics are actually valid json-schemas.

In the future it is expected that this command will do a complete validation of the content and structure of all json files inside content-directory-schemas, which should facilitate checking the validity of content-directory-schemas through CI.

Entity batches

The concept of entity batches (inputs/entityBatches) basically provides an easy way of describing complex input to content directory (ie. many entities related to each other in many ways) without the need to deal with lower-level, hard-to-validate runtime operations like CreateEntity and AddSchemaSupportEntity and trying to glue them together into a huge api.tx.contentDirectory.transaction call.

Instead, the script that initializes the content directory (scripts/initializeContentDir.ts) is able to generate the complex api.tx.contentDirectory.transaction call based on a more human-readable input provided in inputs/entityBatches.

This input can be provided as a simple json array of objects matching { [propertyName]: propertyValue} structure.

For example, in order to describe creating entities as simple as Language, which only has Code and Name properties, we can just create an array of objects like:

[
  { "Code": "EN", "Name": "English" },
  { "Code": "RU", "Name": "Russian" },
  { "Code": "DE", "Name": "German" }
]

(This is the actual content of inputs/entityBatches/1_LanguageBatch.json)

Related entities

There also exists a specific syntax for defining relations between entities in batches. We can do it by either using "new" or "existing" keyword.

The "new" keyword allows describing a scenario where related entity should be created along with the main entity and then referenced by it. An example of this could be Video and VideoMedia which have a one-to-one relationship and it doesn't make much sense to specify them in separate batches. Instead, we can use a syntax like:
```
{
"title": "Awsome video",
/* other Video properties... */
"media": { "new": {
"pixelWidth": 1024,
"pixelHeight": 764,
/* other VideoMedia object properties... */
}
}
```
The "existing" keyword allows referencing an entity created as part of some other (previous!) batch inside inputs/entityBatches. We can do it by specifying the value of any unique property of the referenced entity. So, for example to reference a Language entity from VideoBatch.json file, we use this syntax:
```
{
"title": "Awesome video",
/* other Video properties... */
"language": { "existing": { "Code": "EN" } }
}
```

`json-schemas` and tooling

Entity `json-schemas`

There is a script that provides an easy way of converting runtime-schemas (based on inputs from inputs/schemas) to json-schemas (.schema.json files) which allow validating the input (ie. json files) describing some specific entities. It can be run with:

yarn workspace cd-schemas generate:entity-schemas

Those json-schemas are currently mainly used for validating the inputs inside inputs/entityBatches.

The generated json-schemas include:

schemas/entities - json-schemas that provide validation for given entity (ie. Video) input. They can, for example, check if the title property in a json object is a string that is no longer than 64 characters. They are used to validate a single entity in inputs/entityBatches, but can also be re-used to provide "fontend" validtion of any entity input to the content directory (ie. input provided to/via joystream-cli).
schemas/entityReferences - json-schemas that describe how an entity of given class can be referenced. Currently they are used for providing an easy way of referencing entites between batches in inputs/entityBatches. For more details on how entities can be referenced in batches, read the Entity batches section.
schemas/entityBatches - very simple json-schemas that basically just provide array wrappers over schemas/entities. Those are the actual json-schemas that can be linked to json input files inside inputs/entityBatches (ie. via .vscode/settings.json)

Typescript support

Thanks to the json-schema-to-typescript library, we can very simply generate Typescript interfaces based on existing json-schemas. This can be done via:

yarn workspace cd-schemas generate:types

This command will generate:

types/entities based on schemas/entities, providing typescript interfaces for entities like Video etc. (note that this interface will include a peculiar way of describing entity relationships, further described in Entity batches section)
types/extrinsics based on schemas/extrinsics, providing typescript interfaces for input to extrinsics like AddClassSchema and CreateClass

The most obvious use-case of those interfaces currently is that when we're parsing any json files inside inputs using a Typescript code, we can assert that the resulting object will be of given type, ie.:

const createClassInput = JSON.parse(fs.readFileSync('/path/to/inputs/1_LanguageClass.json')) as CreateClass

Besides that, a Typescript code can be written to generate some inputs (ie. using a loop) that can then can be used to initialize classess, schemas or insert entities into the content directory.

There are a lot of other potential use-cases, but for the purpose of this documentation it should be enough to mention there exists this very easy way of converting .schema.json files into Typescript interfaces.

Current limitations

Some limitations that should be dealt with in the nearest future:

Filename restrictions described in Input files naming section
The requirement of knowing the class id (ie. when defining the references in inputs/schemas)
Some of the code runs on the assumption that there is only one schema for each class, which is very limiting
Inside input/entityBatches we cannot reference entities that are part of a latter batch (batches are ordered by class id)
Vector<Reference> property type is not yet supported when parsing entity batches

README.md