spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nicholas Marion" <>
Subject RE: [VOTE] Release Spark 2.4.7 (RC1)
Date Wed, 19 Aug 2020 16:11:17 GMT

It appears all 3 issues slated for Spark 2.4.7 have been merged. Should we
be looking at getting RC2 ready?

 NICHOLAS T. MARION                                                                
 IBM Open Data Analytics for z/OS - CPO and Service Team Lead                      
 Phone: 1-845-433-5010 | Tie-Line: 293-5010                                          
 Find me on:                                                           2455 South Rd 
                                                    Poughkeepie, New York 12601-5400 
                                                                       United States 

From:	Xiao Li <>
To:	Prashant Sharma <>
Cc:	Takeshi Yamamuro <>, dev
Date:	08/17/2020 11:33 AM
Subject:	[EXTERNAL] Re: [VOTE] Release Spark 2.4.7 (RC1) got merged. This is to
fix a correctness bug in DSV2 of Spark 2.4. Please include it in the
upcoming Spark 2.4.7 release.



On Sun, Aug 9, 2020 at 10:26 PM Prashant Sharma <>
  Thanks for letting us know. So this vote is cancelled in favor of RC2.

  On Sun, Aug 9, 2020 at 8:31 AM Takeshi Yamamuro <>
   Thanks for letting us know about the two issues above, Dongjoon.

   I've checked the release materials (signatures, tag, ...) and it looks
   fine, too.
   Also, I run the tests on my local Mac (java 1.8.0) with the options
   `-Pyarn -Phadoop-2.7 -Phive -Phive-thriftserver -Pmesos -Pkubernetes
   and they passed.


   On Sun, Aug 9, 2020 at 11:06 AM Dongjoon Hyun <>
     Another instance is SPARK-31703 which filed on May 13th and the PR
     arrived two days ago.

         [SPARK-31703][SQL] Parquet RLE float/double are read incorrectly
     on big endian platforms

     It seems that the patch is already ready in this case.
     I raised the priority of SPARK-31703 to `Blocker` for both Apache
     Spark 2.4.7 and 3.0.1.


     On Sat, Aug 8, 2020 at 6:10 AM Holden Karau <>
      I'm going to go ahead and vote -0 then based on that then.

      On Fri, Aug 7, 2020 at 11:36 PM Dongjoon Hyun <> wrote:
        Hi, All.

        Unfortunately, there is an on-going discussion about the new
        decimal correctness.

        Although we fixed one correctness issue at master and backported it
        partially to 3.0/2.4, it turns out that it needs more patched to be

        Please see for on-going
        discussion for both 3.0/2.4.

            [SPARK-32018][SQL][3.0] UnsafeRow.setDecimal should set null
        with overflowed value

        I also confirmed that 2.4.7 RC1 is affected.


        On Thu, Aug 6, 2020 at 2:48 PM Sean Owen <> wrote:
          +1 from me. The same as usual. Licenses and sigs look OK, builds
          passes tests on a standard selection of profiles.

          On Thu, Aug 6, 2020 at 7:07 AM Prashant Sharma <
> wrote:
          > Please vote on releasing the following candidate as Apache
          Spark version 2.4.7.
          > The vote is open until Aug 9th at 9AM PST and passes if a
          majority +1 PMC votes are cast, with a minimum of 3 +1 votes.
          > [ ] +1 Release this package as Apache Spark 2.4.7
          > [ ] -1 Do not release this package because ...
          > To learn more about Apache Spark, please see

          > There are currently no issues targeting 2.4.7 (try project =
          SPARK AND "Target Version/s" = "2.4.7" AND status in (Open,
          Reopened, "In Progress"))
          > The tag to be voted on is v2.4.7-rc1 (commit
          > The release files, including signatures, digests, etc. can be
          found at:
          > Signatures used for Spark RCs can be found in this file:
          > The staging repository for this release can be found at:

          > The documentation corresponding to this release can be found
          > The list of bug fixes going into 2.4.7 can be found at the
          following URL:
          > This release is using the release script of the tag v2.4.7-rc1.
          > FAQ
          > =========================
          > How can I help test this release?
          > =========================
          > If you are a Spark user, you can help us test this release by
          > an existing Spark workload and running on this release
          candidate, then
          > reporting any regressions.
          > If you're working in PySpark you can set up a virtual env and
          > the current RC and see if anything important breaks, in the
          > you can add the staging repository to your projects resolvers
          and test
          > with the RC (make sure to clean up the artifact cache
          before/after so
          > you don't end up building with an out of date RC going
          > ===========================================
          > What should happen to JIRA tickets still targeting 2.4.7?
          > ===========================================
          > The current list of open tickets targeted at 2.4.7 can be found
          > and search for
          "Target Version/s" = 2.4.7
          > Committers should look at those and triage. Extremely important
          > fixes, documentation, and API tweaks that impact compatibility
          > be worked on immediately. Everything else please retarget to an
          > appropriate release.
          > ==================
          > But my bug isn't fixed?
          > ==================
          > In order to make timely releases, we will typically not hold
          > release unless the bug in question is a regression from the
          > release. That being said, if there is something which is a
          > that has not been correctly targeted please ping me or a
          committer to
          > help target the issue.


          To unsubscribe e-mail:

      Books (Learning Spark, High Performance Spark, etc.):
      YouTube Live Streams:

   Takeshi Yamamuro


View raw message