feat: allow to translate dataset text on chart #648

Ian2012 · 2024-03-11T17:26:56Z

Description

This PR allows the translation of text on datasets such as fixed strings by:

Implementing a jinja filter that converts fixed text (e.g 'audit') to its translation. It uses a CASE clause in the following format:

CASE
   {{column_name}} = {{fixed_string}} THEN {{translation}}
   ...
   ELSE {{column_name}}
END

The jinja filter can be used like this in the Chart dimensions: {% raw %}{{translate_column('enrollment_mode')}}{% endraw %}. You can do it visually and export it (asset serialization will try to keep the proper format):

This is an example of an Spanish translation working.

This PR also updates some metrics names to match what they mean.

openedx-webhooks · 2024-03-11T17:27:01Z

Thanks for the pull request, @Ian2012! Please note that it may take us up to several weeks or months to complete a review and merge your PR.

Feel free to add as much of the following information to the ticket as you can:

supporting documentation
Open edX discussion forum threads
timeline information ("this must be merged by XX date", and why that is)
partner information ("this is a course on edx.org")
any other information that can help Product understand the context for the PR

All technical communication about the code itself will be done via the GitHub pull request interface. As a reminder, our process documentation is here.

Please let us know once your PR is ready for our review and all tests are green.

bmtcril

Looks good overall, just some questions and thoughts on performance

tutoraspects/templates/aspects/apps/superset/pythonpath/openedx_jinja_filters.py

bmtcril · 2024-03-11T18:04:41Z

tutoraspects/templates/aspects/apps/superset/pythonpath/openedx_jinja_filters.py

+    case_format = """CASE \n {cases} \n ELSE {column_name} \n END"""
+    single_case_format = "WHEN {column_name} = '{string}' THEN '{translation}'"
+    cases = "\n".join(
+        single_case_format.format(column_name=column_name, string=string, translation=get_translation(string, lang))


When get_translation is called, localization.py will load all of the translations for all languages into memory. I'm not sure how long that will stick around for, but loading a bunch of files off disk repeatedly might cause performance issues. I doubt we'll have so many strings that it will make much of a dent in server memory, but we might want to compare before and after this change to see.

With this change, it's using: 347.1MiB. Before it was: 307Mi. We could load the file only when needed but the disk operation is slower. The file content will be permanently in memory when the first request ask for translations

tutoraspects/templates/aspects/apps/superset/pythonpath/openedx_jinja_filters.py

SoryRawyer · 2024-03-12T17:46:53Z

tutoraspects/templates/aspects/apps/superset/docker/docker-bootstrap.sh

@@ -47,7 +47,7 @@ elif [[ "${1}" == "beat" ]]; then
  celery --app=superset.tasks.celery_app:app beat --pidfile /tmp/celerybeat.pid -l INFO -s "${SUPERSET_HOME}"/celerybeat-schedule
 elif [[ "${1}" == "app" ]]; then
  echo "Starting web app..."
-  flask run -p 8088 --with-threads --reload --debugger --host=0.0.0.0
+  flask run -p 8088 --with-threads --reload --debugger --debug --host=0.0.0.0


Is the --debug flag meant to stay in?

Yes, this only will affect dev environment, local en k8s works the same as before.

SoryRawyer · 2024-03-12T17:54:50Z

I'm assuming the changes to the chart and dataset configurations are due to re-exporting charts that now have translated column names. Is that right?

Ian2012 · 2024-03-12T21:23:24Z

@SoryRawyer Yes, also some metrics were moved from the charts to the dataset and assigned a proper name to be translated.

bmtcril

So excited to see this problem solved!

openedx-webhooks · 2024-03-13T15:36:44Z

@Ian2012 🎉 Your pull request was merged! Please take a moment to answer a two question survey so we can improve your experience in the future.

openedx-webhooks added the open-source-contribution PR author is not from Axim or 2U label Mar 11, 2024

bmtcril requested review from SoryRawyer and bmtcril March 11, 2024 17:41

Ian2012 added 7 commits March 11, 2024 12:53

chore: enable superset hot-reload

d6369e1

chore: omit more vars in charts

02eae45

feat: allow to translate dataset text on chart

5081220

chore: mark dataset strings for translations

42c2df3

fix: remove forced time_range filter on all charts

21de07d

chore: allow to test against dev branches

6f71c60

fix: remove raw expression from clickhouse url

859c901

Ian2012 force-pushed the cag/dataset-translation branch from 4284120 to 859c901 Compare March 11, 2024 17:53

bmtcril reviewed Mar 11, 2024

View reviewed changes

Ian2012 added 2 commits March 11, 2024 14:36

chore: remove localization logging

6eb71cc

fix: add SUPERSET_ENV variable

4c8c1a7

Base automatically changed from cag/translations to main March 11, 2024 19:44

Ian2012 added 2 commits March 11, 2024 15:09

fix: use openedx api to get language preference

6cc6c2e

chore: quality fixes

9541153

Ian2012 force-pushed the cag/dataset-translation branch from fe71f9c to 9541153 Compare March 11, 2024 20:43

SoryRawyer reviewed Mar 12, 2024

View reviewed changes

Ian2012 added 5 commits March 12, 2024 13:14

fix: bimount assets and translations

55b96b0

fix: translate enrollment events

2ad6530

fix: return column_name when there is not column translation

f567c9f

fix: dev environment assets

b8d39af

fix: replace chart metrics with dataset metrics

86f8e76

Ian2012 requested review from bmtcril and SoryRawyer March 12, 2024 21:23

bmtcril approved these changes Mar 13, 2024

View reviewed changes

Ian2012 merged commit 59ce454 into main Mar 13, 2024
10 checks passed

Ian2012 deleted the cag/dataset-translation branch March 13, 2024 15:36

edx-semantic-release mentioned this pull request Mar 13, 2024

chore: preparing release 0.84.0 #650

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: allow to translate dataset text on chart #648

feat: allow to translate dataset text on chart #648

Ian2012 commented Mar 11, 2024 •

edited

Loading

openedx-webhooks commented Mar 11, 2024

bmtcril left a comment

bmtcril Mar 11, 2024

Ian2012 Mar 11, 2024

SoryRawyer Mar 12, 2024 •

edited

Loading

Ian2012 Mar 12, 2024

SoryRawyer commented Mar 12, 2024

Ian2012 commented Mar 12, 2024

bmtcril left a comment

openedx-webhooks commented Mar 13, 2024

feat: allow to translate dataset text on chart #648

feat: allow to translate dataset text on chart #648

Conversation

Ian2012 commented Mar 11, 2024 • edited Loading

Description

openedx-webhooks commented Mar 11, 2024

bmtcril left a comment

Choose a reason for hiding this comment

bmtcril Mar 11, 2024

Choose a reason for hiding this comment

Ian2012 Mar 11, 2024

Choose a reason for hiding this comment

SoryRawyer Mar 12, 2024 • edited Loading

Choose a reason for hiding this comment

Ian2012 Mar 12, 2024

Choose a reason for hiding this comment

SoryRawyer commented Mar 12, 2024

Ian2012 commented Mar 12, 2024

bmtcril left a comment

Choose a reason for hiding this comment

openedx-webhooks commented Mar 13, 2024

Ian2012 commented Mar 11, 2024 •

edited

Loading

SoryRawyer Mar 12, 2024 •

edited

Loading