hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eugene Koifman (JIRA)" <>
Subject [jira] [Commented] (HIVE-19750) Initialize NEXT_WRITE_ID. NWI_NEXT on converting an existing table to full acid
Date Sun, 03 Jun 2018 23:53:00 GMT


Eugene Koifman commented on HIVE-19750:

This is very odd.  Test bot reports a failure:

Failing for the past 1 build (Since Failed#11481 )
Took 13 sec.
Error Message
{"writeid":0,"bucketid":536936448,"rowid":0} 1 2 file:/home/hiveptest/
java.lang.AssertionError: {"writeid":0,"bucketid":536936448,"rowid":0}	1	2	file:/home/hiveptest/
	at org.junit.Assert.assertTrue(
	at org.apache.hadoop.hive.ql.TestTxnCommands.testNonAcidToAcidConversion01(
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

but {{TestTxnCommandsWithSplitUpdateAndVectorization.testNonAcidToAcidConversion01}} does
not exist.
 {{TestTxnCommandsWithSplitUpdateAndVectorization extends TestTxnCommands2}} not {{TestTxnCommands}}

> Initialize NEXT_WRITE_ID. NWI_NEXT on converting an existing table to full acid
> -------------------------------------------------------------------------------
>                 Key: HIVE-19750
>                 URL:
>             Project: Hive
>          Issue Type: Bug
>          Components: Transactions
>    Affects Versions: 3.0.0
>            Reporter: Eugene Koifman
>            Assignee: Eugene Koifman
>            Priority: Critical
>             Fix For: 3.1.0
>         Attachments: HIVE-19750.01.patch, HIVE-19750.02.patch
> Need to set this to a reasonably high value the the table.
> This will reserve a range of write IDs that will be treated by the system as committed.
> This is needed so that we can assign unique ROW__IDs to each row in files that already
exist in the table.  For example, if the value is initialized to the number of files currently
in the table, we can think of each file as written by a separate transaction and thus a free
to assign bucketProperty (BucketCodec) of ROW_ID in whichever way is convenient.
> it's guaranteed that all rows get unique ROW_IDs this way.

This message was sent by Atlassian JIRA

View raw message