beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chamikara Jayalath (JIRA)" <j...@apache.org>
Subject [jira] [Created] (BEAM-2294) Initial size estimation fails for mobile gaming examples for DataflowRunner when run in Windows
Date Sun, 14 May 2017 23:37:04 GMT
Chamikara Jayalath created BEAM-2294:
----------------------------------------

             Summary: Initial size estimation fails for mobile gaming examples for DataflowRunner
when run in Windows
                 Key: BEAM-2294
                 URL: https://issues.apache.org/jira/browse/BEAM-2294
             Project: Beam
          Issue Type: Bug
          Components: sdk-py
            Reporter: Chamikara Jayalath
            Priority: Minor


This happens for pipelines user_score.py and hourly_team_score.py

Jobs complete successfully but initial size estimation fails with following error.

INFO:root:Could not estimate size of source <apache_beam.io.textio._TextSource object at
0x03E34FF0> due to an exception
: Traceback (most recent call last):
  File "C:\Users\chamikara\pythontest1\rc4_test\env_rc4_1\lib\site-packages\apache_beam\runners\dataflow\dataflow_runner
.py", line 554, in run_Read
    transform.source.estimate_size())
  File "C:\Users\chamikara\pythontest1\rc4_test\env_rc4_1\lib\site-packages\apache_beam\internal\gcp\json_value.py",
lin
e 59, in get_typed_value_descriptor
    raise TypeError('Cannot get a type descriptor for %s.' % repr(obj))
TypeError: Cannot get a type descriptor for 23899840340L.

Seems to be due to a combination of Windows returning Long for type name [1] and SDK not handling
long type at [2].

[1] http://stackoverflow.com/questions/22513445/python-handles-long-ints-differently-on-windows-and-unix

[2] https://github.com/apache/beam/blob/master/sdks/python/apache_beam/internal/gcp/json_value.py#L35


cc: [~altay]



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message