Service - Translations: Difference between revisions

From Izara Wiki
Jump to navigation Jump to search
No edit summary
 
(21 intermediate revisions by the same user not shown)
Line 1: Line 1:
= Overview =
= Overview =


Service manages translations in the service.
Service manages language translations.


= Repository =
= Repository =
Line 15: Line 15:
<syntaxhighlight lang="JavaScript">
<syntaxhighlight lang="JavaScript">
{
{
configKey: "TranslationsGraphServiceName"
configKey: "TranslationGraphServiceName"
configTag: "TranslationsGraphServiceName"
configTag: "TranslationGraphServiceName"
configValue: xxx // eg: "TranslationsGraph"
configValue: xxx // eg: "TranslationGraph"
}
}
</syntaxhighlight>
</syntaxhighlight>
== LogicalResults ==
Stores results for any requests to perform logical searches on media links


<syntaxhighlight lang="JavaScript">
<syntaxhighlight lang="JavaScript">
{
{
configKey: "GraphTag"
resultId: xxx // eg: filterMainId for a single logical element
configTag: "translation"
dataId: xxx // one translationLinkId or one subject nodes identifier (the logical request will set what property should be used)
configValue: xxx // eg: "translation"
}
}
</syntaxhighlight>
</syntaxhighlight>


<syntaxhighlight lang="JavaScript">
* partition key: resultId
{
* sort key: dataId
configKey: "GraphTag"
 
configTag: "possiblyTranslatesTo"
= Graph database =  
configValue: xxx // eg: "possiblyTranslatesTo"
 
}
== [[Service - Translations Graph]] ==
</syntaxhighlight>
 
* {textTag} is the name of the text being translated, eg a Catalog subject node will have a textTag "catalogName"
 
=== Nodes ===


<syntaxhighlight lang="JavaScript">
<syntaxhighlight lang="JavaScript">
{
{
configKey: "GraphTag"
nodeLabel: "{TranslationSharedLib.translationLinkNodeLabel()}", //eg: translationLink
configTag: "translatesTo"
schema: {
configValue: xxx // eg: "translatesTo"
identifier: true,
restrictProperties: true,
restrictRelationships: true,
properties: {
translationLinkId: {
identifier: true, // create unique id from request details
}
languageId: {
immutable: true,
}
textTag: {
immutable: true, // tag of what text is being translated
}
weight: {
}
},
}
}
}
</syntaxhighlight>
</syntaxhighlight>
* creates a link between a subject node and a translation text
* when recalculating current translation for a languageCode we add the calculated weighted value to this node as a property


<syntaxhighlight lang="JavaScript">
<syntaxhighlight lang="JavaScript">
{
{
configKey: "GraphTag"
nodeLabel: {TranslationSharedLib.TRANSLATION_GRAPH_NODE_LABEL}, //eg: translation
configTag: "defaultTranslatesTo"
schema: {
configValue: xxx // eg: "defaultTranslatesTo"
identifier: true,
restrictProperties: true,
restrictRelationships: true,
properties: {
text: {
identifier: true,
},
},
}
}
}
</syntaxhighlight>
</syntaxhighlight>


= Neptune graph database =  
==== (subject nodes) ====


== [[Service - Translations Graph]] ==
Subject node schemas are managed by each service that needs translations, normally as a basic schema with identifier properties only.


=== Vertices ===
* nodeIdentifierLabels: matches that specific object being translated, eg: catalog
* nodeIdentifierProperties: matches that specific object being translated, eg: catalogId
* nodeProperties: Can store additional properties, not set by translation service
* node schema should set identifier = true, immutable = true (which includes elementCanBeRemoved = false)


==== (subject vertices) ====
=== Relationships ===


vertexId: {tag}_{uniqueId} (not set by translation service)
<syntaxhighlight lang="JavaScript">
{
relationshipType: "{TranslationSharedLib.translationLinkHasRelType()", // eg: has_translationLink
schema: {
immutable: true,
restrictProperties: true,
properties: {
originTimestamp: //timestamp the request to create/change this relationship was sent
},
}
}
</syntaxhighlight>
* every translationLink will have this relationship
* is never removed, but those with low weighted links can be ignored over time


Properties: (not set by translation service)
<syntaxhighlight lang="JavaScript">
{
relationshipType: "{TranslationSharedLib.translationLinkCurrentRelType()}", // eg: current_translationLink
schema: {
elementCanBeRemoved: true,
restrictProperties: true,
properties: {
originTimestamp: //timestamp the request to create/change this relationship was sent
},
}
}
</syntaxhighlight>
* the currently used translationLink for the language the link points to
* only one should exist per subject node and textTag/languageCode combination, but each language for each textTag will have it's own current relationship
* this relationship will not exist for languages that have no translations
* can be removed/added when RecalculateCurrentTranslation


==== translation ====
<syntaxhighlight lang="JavaScript">
{
relationshipType: "{TranslationSharedLib.translationLinkDefaultRelType()}", // eg: default_translationLink
schema: {
elementCanBeRemoved: true,
restrictProperties: true,
properties: {
originTimestamp: //timestamp the request to create/change this relationship was sent
},
}
}
</syntaxhighlight>
* sets the default translationLink to use when no translationLink for the requested language/s exist
* can be changed but each subject/textTag must have 1
* initially set to the first translation created, later can move it around eg to English if English gets added later
* could create logic that goes through a sorted list of languages and applies the first languageCode found as the default


vertexId: translation_{uuid}
<syntaxhighlight lang="JavaScript">
{
relationshipType: "{TranslationSharedLib.isTranslationDefaultRelType()}",
schema: {
elementCanBeRemoved: false,
allPropertiesImmutable: true,
restrictProperties: true,
properties: {
originTimestamp: //timestamp the request to create/change this relationship was sent
},
}
}
</syntaxhighlight>


Properties:
= Complex Filter requests =


# translation
<syntaxhighlight lang="JavaScript">
#: the text of the translation
{
 
filterType: "XXX" // up to calling service
=== Edges ===
type: "group",
 
elements:
==== possiblyTranslatesTo ====
[
 
{
* All possible translations are linked to their subjectId with an edge of this type
type: "logical",
* A users support for a translation is connected to this edge (or to the translation vertex if unable to link to an edge)
logicalTag: "textTag_languageId_text",
* When recalculating current translation for a language we add the calculated weighted value for each possible translation to this edge as a property
resultType: "mediaLinkProperty"
* This type of edge is never removed, but those with low weights can be ignored over time
textTag: "mediaLinkPropertyValue",
 
languageId: "en",
==== translatesTo ====
text: "Blue",
 
subjectIdentifierPropertyName: "propertyId",
* Matches one possible translation as the currently used translation for one language
caseSensitive: true
* Only one should exist per subjectId per language
},
* Languages that have no translations will not have one
]
* Can be removed and replaced when RecalculateCurrentTranslation
}
 
</syntaxhighlight>
==== defaultTranslatesTo ====
- searches for specific and full text, optional case sensitive
 
- finds an identifier property on the subjects node and stores in LogicalResults for the request
* Used when no desired language translation exists
- resultType must exist in request because want it to match filterType and Translation has no way of knowing filterType of request
* Initially set to the first translation created
* Only one should exist per subjectId
* Can create admin logic that goes through a sorted list of languages and applies the first language found as the default


= SQS queues =
= SQS queues =
Line 101: Line 188:
== RecalculateCurrentTranslation ==
== RecalculateCurrentTranslation ==


Add to this queue the translationLabel, languageCode, subjectId, translationId:
Add to this queue the subject nodeIdentifierLabels, subject nodeIdentifierProperties, languageCode


* translationLabel: Name for the entity being translated, eg: ''categoryName'', is used to create vertex and edge labels in Translations graph database
* subject nodeIdentifierLabels
* subject nodeIdentifierProperties
* languageCode: see below
* languageCode: see below
* subjectId: is the vertex id in the Translations graph database for the entity that has a relationship with the translation, eg catalogCategory{unique identifier}
* translationId: is the vertex id in the Translations graph database for the translation, eg translation{random uuid}


This queue does not have a Lambda trigger, we could poll it when resource costs really cheap as it is low importance (and/or have an API endpoint that polls and processes a batch).
This queue does not have a Lambda trigger, we could poll it when resource costs really cheap as it is low importance (and/or have an API endpoint that polls and processes a batch).
Line 113: Line 199:


Considering using ISO 639-3 codes and designing a way to substring them to automatically go up the hierarchy if no lower level variants match, an alternative would be to allow users to create ordered lists of preferred translations and share these.
Considering using ISO 639-3 codes and designing a way to substring them to automatically go up the hierarchy if no lower level variants match, an alternative would be to allow users to create ordered lists of preferred translations and share these.
= Translation graph relationships/edges =
Links between possible translations and the object they are translating are stored in the Translations graph database as edges. Each possible translation is a vertex.


= How translations are found for users =
= How translations are found for users =


Plan is to allow users to create ordered lists of prefered languages (and perhaps automatic translating as a last option?), these lists can be shared, and new users are automatically set to a list depending on their location when signing up.
Plan is to allow users to create ordered lists of prefered languages (and perhaps optionally automatic translating as a last option?), and new users are automatically set to a list depending on their location when signing up.


For each text to translate work through the list and find the first matching translation, if none found fall back onto the TranslationDefault option.
For each text to translate: work through the list and find the first matching translation, if none found fall back onto the default option.


Will need to cache results for efficient resource use.
cache results for efficient resource use.


= System text translations =
= System text translations =


System text follows the same Tag + UniqueId system to identify specific translation subjects (one system text output), these are set in npm modules.
System text follows the same Label + UniqueIdProperty system to identify specific translation subjects (one system text output), the labels and unique ids can set in npm modules.


'''Tag''' example: [[NPM module - izara-shared#SharedLib]].constants.systemTextGraphTag
* Label example: hard coded or as a constant in NavBar service: "sysTxtNavBar"
* UniqueIdProperty example: "sysTxtTag", value: "SignOut" (can set as a constant in NavBar service)


'''UniqueId''' example: [[NPM module - izara-market-shared#CatalogManagerLib]].constants.CatalogStandardServiceNameGraphId
* Label example: [[NPM module - izara-market-shared#CatalogManagerLib]].constants.sysTxtCatalogManager ("sysTxtCatalogManager")
* UniqueIdProperty example: "CatalogName", value: [[NPM module - izara-market-shared#CatalogManagerLib]].constants.CatalogStandard.catalogName


= Working documents =
= Working documents =

Latest revision as of 15:54, 17 September 2021

Overview

Service manages language translations.

Repository

https://bitbucket.org/stb_working/translations/src/master/

DynamoDB tables

Standard Config Table Per Service

Configuration tags

{
	configKey: "TranslationGraphServiceName"
	configTag: "TranslationGraphServiceName"
	configValue: xxx // eg: "TranslationGraph"
}

LogicalResults

Stores results for any requests to perform logical searches on media links

{
	resultId: xxx // eg: filterMainId for a single logical element
	dataId: xxx // one translationLinkId or one subject nodes identifier (the logical request will set what property should be used)
}
  • partition key: resultId
  • sort key: dataId

Graph database

Service - Translations Graph

  • {textTag} is the name of the text being translated, eg a Catalog subject node will have a textTag "catalogName"

Nodes

{
	nodeLabel: "{TranslationSharedLib.translationLinkNodeLabel()}", //eg: translationLink
	schema: {
		identifier: true,
		restrictProperties: true,
		restrictRelationships: true,
		properties: {
			translationLinkId: {
				identifier: true, // create unique id from request details
			}
			languageId: {
				immutable: true,
			}
			textTag: {
				immutable: true, // tag of what text is being translated
			}
			weight: {
			}
		},
	}
}
  • creates a link between a subject node and a translation text
  • when recalculating current translation for a languageCode we add the calculated weighted value to this node as a property
{
	nodeLabel: {TranslationSharedLib.TRANSLATION_GRAPH_NODE_LABEL}, //eg: translation
	schema: {
		identifier: true,
		restrictProperties: true,
		restrictRelationships: true,
		properties: {
			text: {
				identifier: true,
			},
		},
	}
}

(subject nodes)

Subject node schemas are managed by each service that needs translations, normally as a basic schema with identifier properties only.

  • nodeIdentifierLabels: matches that specific object being translated, eg: catalog
  • nodeIdentifierProperties: matches that specific object being translated, eg: catalogId
  • nodeProperties: Can store additional properties, not set by translation service
  • node schema should set identifier = true, immutable = true (which includes elementCanBeRemoved = false)

Relationships

{
	relationshipType: "{TranslationSharedLib.translationLinkHasRelType()", // eg: has_translationLink
	schema: {
		immutable: true,
		restrictProperties: true,
		properties: {
			originTimestamp: //timestamp the request to create/change this relationship was sent
		},
	}
}
  • every translationLink will have this relationship
  • is never removed, but those with low weighted links can be ignored over time
{
	relationshipType: "{TranslationSharedLib.translationLinkCurrentRelType()}", // eg: current_translationLink
	schema: {
		elementCanBeRemoved: true,
		restrictProperties: true,
		properties: {
			originTimestamp: //timestamp the request to create/change this relationship was sent
		},
	}
}
  • the currently used translationLink for the language the link points to
  • only one should exist per subject node and textTag/languageCode combination, but each language for each textTag will have it's own current relationship
  • this relationship will not exist for languages that have no translations
  • can be removed/added when RecalculateCurrentTranslation
{
	relationshipType: "{TranslationSharedLib.translationLinkDefaultRelType()}", // eg: default_translationLink
	schema: {
		elementCanBeRemoved: true,
		restrictProperties: true,
		properties: {
			originTimestamp: //timestamp the request to create/change this relationship was sent
		},
	}
}
  • sets the default translationLink to use when no translationLink for the requested language/s exist
  • can be changed but each subject/textTag must have 1
  • initially set to the first translation created, later can move it around eg to English if English gets added later
  • could create logic that goes through a sorted list of languages and applies the first languageCode found as the default
{
	relationshipType: "{TranslationSharedLib.isTranslationDefaultRelType()}",
	schema: {
		elementCanBeRemoved: false,
		allPropertiesImmutable: true,
		restrictProperties: true,
		properties: {
			originTimestamp: //timestamp the request to create/change this relationship was sent
		},
	}
}

Complex Filter requests

{
	filterType: "XXX" // up to calling service
	type: "group",
	elements: 
	[
		{
			type: "logical",
			logicalTag: "textTag_languageId_text",
			resultType: "mediaLinkProperty"
			textTag: "mediaLinkPropertyValue",
			languageId: "en",
			text: "Blue",
			subjectIdentifierPropertyName: "propertyId",
			caseSensitive: true
		},
	]
}

- searches for specific and full text, optional case sensitive - finds an identifier property on the subjects node and stores in LogicalResults for the request - resultType must exist in request because want it to match filterType and Translation has no way of knowing filterType of request

SQS queues

RecalculateCurrentTranslation

Add to this queue the subject nodeIdentifierLabels, subject nodeIdentifierProperties, languageCode

  • subject nodeIdentifierLabels
  • subject nodeIdentifierProperties
  • languageCode: see below

This queue does not have a Lambda trigger, we could poll it when resource costs really cheap as it is low importance (and/or have an API endpoint that polls and processes a batch).

Language codes

Considering using ISO 639-3 codes and designing a way to substring them to automatically go up the hierarchy if no lower level variants match, an alternative would be to allow users to create ordered lists of preferred translations and share these.

How translations are found for users

Plan is to allow users to create ordered lists of prefered languages (and perhaps optionally automatic translating as a last option?), and new users are automatically set to a list depending on their location when signing up.

For each text to translate: work through the list and find the first matching translation, if none found fall back onto the default option.

cache results for efficient resource use.

System text translations

System text follows the same Label + UniqueIdProperty system to identify specific translation subjects (one system text output), the labels and unique ids can set in npm modules.

  • Label example: hard coded or as a constant in NavBar service: "sysTxtNavBar"
  • UniqueIdProperty example: "sysTxtTag", value: "SignOut" (can set as a constant in NavBar service)

Working documents

Working_documents - Translations