rlei [Tue, 7 Aug 2018 09:43:23 +0000 (17:43 +0800)]
HAWQ-1647. Update HAWQ version to 2.4.0.0
- Update HAWQ version from 2.3.0.0 to 2.4.0.0.
- Removed incubating/incubator strings from HAWQ versions.
Ben Christel [Thu, 26 Jul 2018 18:43:49 +0000 (11:43 -0700)]
HAWQ-1622. Cache UGI objects and clean them periodically
Co-authored-by: Lav Jain <ljain@pivotal.io>
Co-authored-by: Ben Christel <bchristel@pivotal.io>
Co-authored-by: Alex Denissov <adenissov@pivotal.io>
Co-authored-by: Shivram Mani <smani@pivotal.io>
Co-authored-by: Francisco Guerrero <aguerrero@pivotal.io>
Co-authored-by: Divya Bhargov <dbhargov@pivotal.io>
oushu1wangziming1 [Thu, 5 Jul 2018 02:12:27 +0000 (10:12 +0800)]
HAWQ-1636. Fix compile apache hawq failure due to unsupported syntax in libyarn on osx 10.11
interma [Wed, 20 Jun 2018 05:02:44 +0000 (13:02 +0800)]
HAWQ-1627. Support setting the max protocol message size when talking
with HDFS
Sergei Lebedev [Tue, 12 Jun 2018 17:55:02 +0000 (19:55 +0200)]
HAWQ-1624. Change libhdfs3 to be ABI compatible with libhdfs
Weinan Wang [Fri, 29 Jun 2018 05:29:57 +0000 (13:29 +0800)]
HAWQ-1633. Add parameter for maven package hawq-hadoop
interma [Wed, 30 May 2018 06:26:01 +0000 (14:26 +0800)]
HAWQ-1618. Segment panic at workfile_mgr_close_file() when transaction ROLLBACK
Weinan Wang [Wed, 6 Jun 2018 04:37:29 +0000 (12:37 +0800)]
HAWQ-1619. Fix Vectorized Execution bugs
Ivan Leskin [Thu, 8 Feb 2018 13:11:19 +0000 (16:11 +0300)]
HAWQ-1599. Add PXF-Ignite plug-in.
(close #1344)
The PXF-Ignite plug-in allows to read data from Ignite database.
The syntax is similar to one of PXF-JDBC plug-in.
This implementation uses Ignite REST API. It can be easily modified to access any other databases or services via REST API.
This commit does not change `pxf-service` links and classpath files. They should be added manually in order for the PXF-Ignite to work.
See pxf-ignite/README.md for instructions on how to use this plug-in.
PXF Ignite plug-in: Implement write access
This commit adds the support of write operations to the PXF Ignite plug-in.
PXF Ignite plug-in: tests and fixes
PXF Ignite plug-in: fixes proposed by contributors
PXF Ignite plug-in: Fixes proposed by @hornn, @denalex
PXF Ignite plug-in: Fixed nullptr exception and invalid bufferWrite operations
PXF Ignite plug-in: Fixes proposed by @denalex, @sansanichfb
Use 'apache.commons.compress.utils.ByteUtils' instead of a custom 'ByteUtil' library to process fragment metadata;
This also removes the dependence of 'apache.commons.lang.ArrayUtils'.
This fix was proposed by @denalex.
Change signature of the 'buildFragmenterSql()' function in 'IgnitePartitionFragmenter', so that it returns void.
This fix was proposed by @sansanichfb.
PXF Ignite plug-in: minor codestyle improvements
PXF Ignite plug-in: README.md update
PXF Ignite plug-in: Fix handling of TIMESTAMP
Fix handling of TIMESTAMP values with milli- and microseconds
SimpleDateFormat objects are now ThreadLocal (due to safety reasons)
Divya Bhargov [Thu, 31 May 2018 18:20:27 +0000 (11:20 -0700)]
HAWQ-1621. Call FileSystem.closeAllForUGI at the end of request when PXF impersonation is on
Authored-by: Divya Bhargov <dbhargov@pivotal.io>
(close #1374)
Weinan Wang [Wed, 30 May 2018 08:49:08 +0000 (16:49 +0800)]
HAWQ-1508. fix travis broken
Wen Lin [Thu, 31 May 2018 06:52:16 +0000 (14:52 +0800)]
HAWQ-1620. Push down target list information(pi_targetlist in structure ProjectionInfo) to scan when create Bloomfilter structure.
Wen Lin [Mon, 28 May 2018 01:40:44 +0000 (09:40 +0800)]
HAWQ-1616. Fix the wrong result of hash join when enable Bloom filter,
because the projection information of join keys hasn't been pushed down to parquet scan correctly.
Wen Lin [Mon, 21 May 2018 07:44:36 +0000 (15:44 +0800)]
HAWQ-1615. Fix accessing invalid memory when run a hash-join query with Bloomfilter enable. The BloomFilter structure in RuntimeFilterState should be allocated, instead of using the address of HashJoinTable's BloomFilter, since it may be released when function FreeScanRuntimefilterState() tries to access it.
Shujie Zhang [Tue, 15 May 2018 04:39:04 +0000 (12:39 +0800)]
HAWQ-1612. Implement vectorized aggregate functions, such as avg/sum/count.
Wen Lin [Mon, 14 May 2018 09:39:38 +0000 (17:39 +0800)]
HAWQ-1608. Implement Printing Runtime Filter Information For "explain analyze". Change GUC hawq_hashjoin_bloomfilter to bool.
Michael Goddard [Fri, 11 May 2018 18:27:35 +0000 (11:27 -0700)]
Updates to PXF Parquet to support S3 eventually
Weinan Wang [Tue, 8 May 2018 05:01:19 +0000 (13:01 +0800)]
HAWQ-1613. Create Date based Vectorized type and relevant expression
Wen Lin [Wed, 9 May 2018 09:05:46 +0000 (17:05 +0800)]
HAWQ-1607. This commit implements applying Bloom filter during Scan outer table.
1. Pash down Bloom filter structure to outer table scan(only support parquet);
2. Check if the tuple from outer table is found in Bloom filter structure.
3. Add a GUC hawq_hashjoin_bloomfilter_sampling_number. This guc value controls the Bloom filter sampling number, while scanning outer table, for first N tuples of the outer table, if the ratio is larger than hawq_hashjoin_bloomfilter_ratio, the remain tuples will not be checked by Bloom filter.
4. If there is any expression on outer join keys except T_Var(projection), such as, fact.c1 + 1 = dim.c1, or if there are multiple join keys, e.g. fact.c1 = dim.c1 and fact.c2 = dim.c2, Bloomfilter won't be created. Since these two cases involve pushing down expression and project information to scan, which will be implemented later.
Weinan Wang [Fri, 4 May 2018 06:50:18 +0000 (14:50 +0800)]
HAWQ-1609. Update regression test configuration
Weinan Wang [Tue, 24 Apr 2018 07:38:12 +0000 (15:38 +0800)]
HAWQ-1609. Implement Vectorized Motion Node
Shujie Zhang [Thu, 26 Apr 2018 06:52:03 +0000 (14:52 +0800)]
HAWQ-1611. refactor the vtype in order to advance the performance (by Wang Weinan)
Shujie Zhang [Wed, 4 Apr 2018 08:10:07 +0000 (16:10 +0800)]
HAWQ-1603. Add new hook API
Wen Lin [Wed, 18 Apr 2018 03:57:25 +0000 (11:57 +0800)]
HAWQ-1606. Fix "make unittest-check" error and set GUC error
Wen Lin [Tue, 17 Apr 2018 06:05:23 +0000 (14:05 +0800)]
HAWQ-1606. This commit implements deciding to create Bloom Filter during query plan and create Bloom filter for inner table, including:
1. Introduce a GUC, hawq_hashjoin_bloomfilter_max_memory_size, controls the maximum memory size for one bloom filter in hash join.
2. Introduce a GUC, hawq_hashjoin_bloomfilter_ratio, when the ratio of (the estimated number of hash join tuples)/(number of tuples of outer table) is lower than the GUC, then Bloom filter can be used in hash join.
3. Decide whether to create Bloom filter during query plan phase.
4. During query execution phase, create Bloom filter structure and poputlate it for tuples from inner table.
Wen Lin [Sun, 8 Apr 2018 10:10:12 +0000 (18:10 +0800)]
HAWQ-1604. Add A New GUC hawq_hashjoin_bloomfilter to indicate if use Bloom filter for hash join.
Remove gp_hashjoin_bloomfilter and bloom filter in hash join table, this part of legacy codes has been verified that it won't improve hash join performance.
Weinan Wang [Fri, 30 Mar 2018 08:45:51 +0000 (16:45 +0800)]
HAWQ-1602. AO table data vectorized scan
Weinan Wang [Fri, 30 Mar 2018 02:05:56 +0000 (10:05 +0800)]
HAWQ-1600. Parquet table data vectorized scan
Weinan Wang [Mon, 26 Mar 2018 10:07:21 +0000 (18:07 +0800)]
HAWQ-1598. Vectorized Scan Node Framework initialization
Shujie Zhang [Mon, 19 Mar 2018 08:14:59 +0000 (16:14 +0800)]
HAWQ-1593. Vectorized execution condition check in plan tree
Weinan Wang [Fri, 16 Mar 2018 06:31:45 +0000 (14:31 +0800)]
HAWQ-1591. Common tuple batch structure for VecExe
Shujie Zhang [Wed, 14 Mar 2018 07:11:28 +0000 (15:11 +0800)]
HAWQ-1592. Vectorized data types initialization and relevant functions definition
interma [Wed, 14 Mar 2018 10:23:19 +0000 (18:23 +0800)]
HAWQ-1594. Memory leak in standby master (gpsyncagent process)
Ruilong Huo [Wed, 21 Feb 2018 02:34:01 +0000 (10:34 +0800)]
HAWQ-1590. bump hawq version to 2.3 in contrib/hawq-ambari-plugin/build.properties for Apache HAWQ 2.3.0.0-incubating Release
Ruilong Huo [Tue, 20 Feb 2018 13:09:54 +0000 (21:09 +0800)]
HAWQ-1589. bump hawq version to 2.3 in pom.xml for Apache HAWQ 2.3.0.0-incubating Release
Weinan Wang [Mon, 29 Jan 2018 02:51:57 +0000 (10:51 +0800)]
HAWQ-1583. Vectorization execution framework init.
shivzone [Sat, 10 Feb 2018 01:53:24 +0000 (17:53 -0800)]
HAWQ-1587. Fix metadata parameters handling in PXF
Alexander Denissov [Fri, 8 Dec 2017 01:17:32 +0000 (17:17 -0800)]
HAWQ-1036. Implement user impersonation in PXF
(close #1339)
Yi [Thu, 8 Feb 2018 08:03:55 +0000 (19:03 +1100)]
HAWQ-1586. Update version from 2.2.0.0 to 2.3.0.0
Alex Diachenko [Wed, 7 Feb 2018 21:04:00 +0000 (13:04 -0800)]
HAWQ-1585. Fixed Javadoc generation warnings and errors.
shivzone [Wed, 7 Feb 2018 00:26:05 +0000 (16:26 -0800)]
HAWQ-1581. Separate PXF system parameters from user parameters
Lav Jain [Tue, 6 Feb 2018 22:31:09 +0000 (14:31 -0800)]
HAWQ-1584. Propogate bridge endIteration exception during write to client
Kuien Liu [Mon, 22 Jan 2018 08:38:04 +0000 (16:38 +0800)]
HAWQ-1557. Concurrent drop should not report error for drop IF EXISTS.
Alex Diachenko [Tue, 6 Feb 2018 03:08:33 +0000 (19:08 -0800)]
HAWQ-1575. Implemented readable Parquet profile for PXF.
Weinan Wang [Wed, 31 Jan 2018 08:47:04 +0000 (16:47 +0800)]
HAWQ-1514. TDE feature makes libhdfs3 require openssl1.1
Shubham Sharma [Fri, 15 Sep 2017 23:05:19 +0000 (16:05 -0700)]
HAWQ-1527 Added feature to enable partition filtering for integral data types
YoungForest [Wed, 24 Jan 2018 02:46:18 +0000 (10:46 +0800)]
HAWQ-1582. hawq ssh cmd bug when pipe in cmd
Lav Jain [Tue, 23 Jan 2018 19:04:44 +0000 (11:04 -0800)]
HAWQ-1580. Set PXF port number to 5888 for GPDB
Dmitriy Dorofeev [Tue, 16 Jan 2018 23:54:45 +0000 (15:54 -0800)]
HAWQ-1579. Fix pxf logging issue with null Metadata
Chiyang Wan [Fri, 5 Jan 2018 00:21:27 +0000 (08:21 +0800)]
HAWQ-1578. Regression Test (Feature->Ranger)Failed
Kuien Liu [Thu, 28 Dec 2017 08:16:17 +0000 (16:16 +0800)]
HAWQ-1125. Running pl/python related feature_test cases in parallel
Prior commit:
dc14ecbafc368a45cda5806ef7680ffe0825aef1
Kuien Liu [Fri, 15 Dec 2017 04:51:07 +0000 (12:51 +0800)]
HAWQ-1573. Clear debug_query_string in proc_exit to avoid crash
Backport following commit from PostgreSQL:
commit
e1eb7c81192bec3735eed3228202b400f31c8010
Author: Tom Lane <tgl@sss.pgh.pa.us>
Date: Sat Mar 20 00:58:21 2010 +0000
Report from Xiaowen Zheng <xiaowen.zxw@alibaba-inc.com>
Chiyang Wan [Tue, 2 Jan 2018 01:23:16 +0000 (09:23 +0800)]
HAWQ-1566. Include Pluggable Storage Format Framework in External Table Insert
Add the external table insert and copy from(write into external table) related feature here.
Chiyang Wan [Thu, 7 Dec 2017 15:05:40 +0000 (23:05 +0800)]
HAWQ-1565. Include Pluggable Storage Format Framework in External Table Scan
Rewrite the tuple construct and consume working flow in the external table, which leads to data copy cost.
Shubham Sharma [Fri, 15 Dec 2017 00:53:33 +0000 (16:53 -0800)]
HAWQ-1572 Fix travis ci build failure. Thrift/Boost incompatibility
Kyle Roberts [Sat, 2 Dec 2017 01:44:42 +0000 (17:44 -0800)]
HAWQ-1560. Changed log level to DEBUG1 for data locality messages
modified: src/backend/cdb/cdbdatalocality.c
shivzone [Wed, 13 Dec 2017 21:35:51 +0000 (13:35 -0800)]
Removed explicit serde check with HiveLineBreakAccessor
Chiyang Wan [Tue, 5 Dec 2017 01:57:02 +0000 (09:57 +0800)]
HAWQ-1564. Add Pluggable Storage Dependent Information
The info added are mainly about external URI, block location, file splits and formatter action.
Larry Hamel [Mon, 4 Dec 2017 22:00:16 +0000 (14:00 -0800)]
HAWQ-1563. Adapt to run PXF Makefile in /bin/sh
- Since each line of Makefile runs in its own shell, pushd and popd are not necessary
- Also, pushd is not supported in /bin/sh, default shell in Ubuntu
Signed-off-by: Goutam Tadi <gtadi@pivotal.io>
Chiyang Wan [Tue, 28 Nov 2017 01:16:48 +0000 (09:16 +0800)]
HAWQ-1555. Add access interfaces for protocol and format in pluggable storage framework
Kuien Liu [Thu, 30 Nov 2017 09:43:12 +0000 (17:43 +0800)]
HAWQ-1532. test cases on 'Asia' timezone
Kuien Liu [Thu, 19 Oct 2017 09:18:11 +0000 (17:18 +0800)]
HAWQ-1532. Fix conflict in Asia timezone
Timezone abbreviation 'IST' is multiply defined in Asia.txt, which
is backported from latest PostgreSQL. In this commit we fix this
issue and add a standard timezone set 'Asia' in OVERRIDE mode.
Tom Lane [Wed, 24 Dec 2014 21:35:23 +0000 (16:35 -0500)]
HAWQ-1532. recognize timezone 'CST' correctly in China
On some platforms, CST (China Standard Time) is used as time string
suffix of GMT+8, especially for users in China. I tracked that change in
Postgres to this commit:
commit
5b89473d870dc2a9fec0926c5afccf53042dbb0a
Author: Tom Lane <tgl@sss.pgh.pa.us>
Date: Wed Dec 24 16:35:23 2014 -0500
Add CST (China Standard Time) to our lists of timezone abbreviations.
For some reason this seems to have been missed when the lists in
src/timezone/tznames/ were first constructed. We can't put it in Default
because of the conflict with US CST, but we should certainly list it among
the alternative entries in Asia.txt. (I checked for other oversights, but
all the other abbreviations that are in current use according to the IANA
files seem to be accounted for.) Noted while responding to bug #12326.
Then, we may set timezone_abbreviations = 'Asia' to recognize CST
naturally.
Shubham Sharma [Tue, 21 Nov 2017 01:54:07 +0000 (17:54 -0800)]
HAWQ-1553 Add option to hawq extract to specify log directory
Shubham Sharma [Tue, 21 Nov 2017 01:28:44 +0000 (17:28 -0800)]
HAWQ-1368 Add option to hawq register to specify log directory
Shubham Sharma [Fri, 1 Dec 2017 01:19:06 +0000 (17:19 -0800)]
HAWQ-1559 - Fix travis ci build failure due to missing apr package
Shubham Sharma [Tue, 14 Nov 2017 03:02:26 +0000 (19:02 -0800)]
HAWQ-1549 Enable standby resync using -M smart/fast/immediate
interma [Mon, 6 Nov 2017 09:01:59 +0000 (17:01 +0800)]
HAWQ-1544. prompt file count doesn't match hash bucket number when reorganize table
lavjain [Wed, 8 Nov 2017 00:22:34 +0000 (16:22 -0800)]
HAWQ-1545. Fix pxf-private.classpath file for Cloudera tarball distro
Ed Espino [Wed, 4 Oct 2017 22:06:06 +0000 (15:06 -0700)]
HAWQ-1496. Add DISCLAIMER, LICENSE and NOTICE files into PXF rpm.
The files will be installed into /opt/pxf-${project.version} directory.
Yi [Wed, 8 Nov 2017 05:02:54 +0000 (16:02 +1100)]
HAWQ-1530. Illegally killing a JDBC select query causes locking problems
shivzone [Fri, 3 Nov 2017 23:12:16 +0000 (16:12 -0700)]
HAWQ-1543. Make pxf configurable upon restart through pxf-env
Alexander Denissov [Tue, 31 Oct 2017 19:22:50 +0000 (12:22 -0700)]
HAWQ-1542. Test fix for JDK 7 for PXF Demo profile test.
Alexander Denissov [Thu, 26 Oct 2017 18:44:10 +0000 (11:44 -0700)]
HAWQ-1542. PXF Demo profile should support write use case.
shivzone [Mon, 30 Oct 2017 23:36:13 +0000 (16:36 -0700)]
HAWQ-1541. Removed pxf configs from webapp
Alexander Denissov [Wed, 18 Oct 2017 20:57:03 +0000 (13:57 -0700)]
HAWQ-1540. PXF should not accept parameters for init command
shivzone [Thu, 19 Oct 2017 21:47:49 +0000 (14:47 -0700)]
HAWQ-1539. Adding metrics-core jar to PXF classpath for HBase
Oleksandr Diachenko [Tue, 10 Oct 2017 00:18:25 +0000 (17:18 -0700)]
HAWQ-1538. Install internal profiles definition file in conf directory.
Oleksandr Diachenko [Mon, 9 Oct 2017 22:46:54 +0000 (15:46 -0700)]
HAWQ-1537. Sort files explicitly in PartitionedJsonParserNoSeekTest.
lavjain [Fri, 6 Oct 2017 20:40:36 +0000 (13:40 -0700)]
HAWQ-1533. Fix compiler warnings when building PXF code
shivzone [Fri, 6 Oct 2017 20:21:32 +0000 (13:21 -0700)]
HAWQ-1535. Link PXF Library during install
Oleksandr Diachenko [Thu, 5 Oct 2017 20:40:15 +0000 (13:40 -0700)]
HAWQ-1534. Change user to PXF_USER only for HAWQ database.
Oleksandr Diachenko [Fri, 29 Sep 2017 19:33:50 +0000 (12:33 -0700)]
HAWQ-1531. Templetized PXF user and log directories.
Kuien Liu [Mon, 25 Sep 2017 08:40:54 +0000 (16:40 +0800)]
HAWQ-1529. Fix segment resource manager hang when postmaster died.
If PostmasterIsAlive() is under implicit declaration, %eax (32-bits) will be used for comparison rather than %al (8-bits), BUT PostmasterIsAlive() only set the lower 8-bits (because 'bool' is really a 'char'). Then segment resource manager will never exit after postmaster died.
Oleksandr Diachenko [Fri, 22 Sep 2017 18:26:49 +0000 (11:26 -0700)]
HAWQ-1526. Added support for functions in WHERE clause for PXF tables.
amyrazz44 [Thu, 14 Sep 2017 10:18:03 +0000 (18:18 +0800)]
HAWQ-1518. Add a UDF for showing whether the data directory is an encryption zone.
Shubham Sharma [Thu, 14 Sep 2017 23:24:27 +0000 (16:24 -0700)]
HAWQ-1198. Fix to filter out irrelevant fragments while accessing Hive.
wcl14 [Thu, 14 Sep 2017 06:35:24 +0000 (14:35 +0800)]
HAWQ-1525. Segmentation fault occurs if reindex database when loading data from Hive to HAWQ using hcatalog
Shubham Sharma [Mon, 3 Jul 2017 23:40:19 +0000 (16:40 -0700)]
HAWQ-1495 Updated Readme with a note to run tests parallely to avoid answer file mismatch
interma [Fri, 1 Sep 2017 07:57:34 +0000 (15:57 +0800)]
HAWQ-1520. Create filespace should also skip hdfs trash directory
Shubham Sharma [Tue, 18 Jul 2017 01:42:51 +0000 (18:42 -0700)]
HAWQ-1504 - Fixed namenode hang during docker container restart
Shubham Sharma [Thu, 31 Aug 2017 15:52:21 +0000 (08:52 -0700)]
HAWQ-1524 - Fix travis ci build failure after protobuf upgrade to 3.4
interma [Tue, 29 Aug 2017 07:46:24 +0000 (15:46 +0800)]
HAWQ-1520. gpcheckhdfs should skip hdfs trash directory
Lav Jain [Tue, 29 Aug 2017 23:15:08 +0000 (16:15 -0700)]
HAWQ-1523. Provide ability to bundle PXF artifacts into a single tarball
shivzone [Mon, 28 Aug 2017 19:20:08 +0000 (12:20 -0700)]
Update pxf classpath file to make it generic
amyrazz44 [Mon, 8 May 2017 09:27:07 +0000 (17:27 +0800)]
HAWQ-1458. Fix share input scan bug for writer part.
interma [Mon, 14 Aug 2017 08:30:45 +0000 (16:30 +0800)]
HAWQ-1510. Add TDE-related functionality into hawq init command
amyrazz44 [Fri, 11 Aug 2017 12:31:46 +0000 (22:31 +1000)]
HAWQ-1310. Reformat resource_negotiator().
Yi [Fri, 11 Aug 2017 12:03:33 +0000 (22:03 +1000)]
HAWQ-1498. Segments keep open file descriptors for deleted files
amyrazz44 [Wed, 9 Aug 2017 06:10:20 +0000 (14:10 +0800)]
HAWQ-1511. Add TDE-related properties into hdfs-client.xml