Uploaded image for project: 'Talend Data Prep'
  1. Talend Data Prep
  2. TDP-5244

Compatibility issue with Studio and Dataprep server when using tDataprepRun Spark

Apply templateInsert Lucidchart Diagram
    XMLWordPrintable

Details

    • Bug
    • Status: closed
    • Critical
    • Resolution: Fixed
    • None
    • 2.4.0 (Winter '18)
    • None
    • All
    • Sprint 23 TDP (feb 26), Sprint 24 TDP (mar 19)
    • Small

    Description

      Everything here applies only if you plan to use tDataprepRun component in a Spark job.

      To work with Dataprep Cloud Winter, you'll need either:

      1. Studio 6.5 for Cloud (available in download page...)
      2. Patch your Studio 6.5

      If you want to use Studio with both env (dataprep on-prem and cloud), you'll need 2 studios:

      1. one patched for the Cloud
      2. one not patched for on-prem

      Steps to reproduce the problem :

      • Create a spark job
      • Create an input, pass the data into a tDataprepRun component
      • Check the ouptut
      • The preparation must have a step using DQ analytics like "delete the rows with invalid cells".

      Run the job, you should have the following error :

      "org.talend.dataprep.actions.RemoteResourceGetter$RemoteConnectionException: Unable to retrieve dictionaries."

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              smallet stephane mallet
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: