Jacques Nadeau [Fri, 12 Sep 2014 15:09:01 +0000 (08:09 -0700)]
Roll to 0.5.0-incubating release version
Jacques Nadeau [Wed, 3 Sep 2014 16:30:56 +0000 (09:30 -0700)]
DRILL-1372: Separate category B licenses into their own directory in the binary distribution.
Adding a statement about the category B licensed software in the NOTICE files and adding the appropriate reference to MIT licensed source in the LICENSE file for the source distribution.
Jacques Nadeau [Wed, 3 Sep 2014 05:00:17 +0000 (22:00 -0700)]
DRILL-1369: Constrain code compilation cache.
Jacques Nadeau [Mon, 1 Sep 2014 05:37:41 +0000 (22:37 -0700)]
Freeze hive fmpp > freemarker plugin dependency.
Jacques Nadeau [Mon, 1 Sep 2014 05:28:00 +0000 (22:28 -0700)]
Updating fmpp so that is uses a specific version of the free marker plugin, avoiding build failures in situations where freemarker has a failed snapshot release.
Jacques Nadeau [Sun, 31 Aug 2014 17:54:08 +0000 (10:54 -0700)]
Update READMEs
Steven Phillips [Sun, 31 Aug 2014 06:03:34 +0000 (23:03 -0700)]
Fix partition bug in hive storage
Jacques Nadeau [Sun, 31 Aug 2014 03:24:06 +0000 (20:24 -0700)]
Decrease slice target to 100,000 records.
Steven Phillips [Sun, 31 Aug 2014 02:12:13 +0000 (19:12 -0700)]
Fix allocation errors and bug in external sort
Aman Sinha [Wed, 27 Aug 2014 20:32:03 +0000 (13:32 -0700)]
DRILL-1171: Create Drill's implementation of ReduceAggregatesRule, including a new CastHigh function.
DRILL-1342: Fix nullability handling of aggregate functions.
Jacques Nadeau [Sat, 30 Aug 2014 02:35:07 +0000 (19:35 -0700)]
Maven cleanup & pom fixes
Aditya Kishore [Fri, 29 Aug 2014 14:06:38 +0000 (19:36 +0530)]
DRILL-1366: HBaseRecordReader does not set rowcount correctly if vectors run out of memory in the middle of the row.
Jason Altekruse [Sat, 30 Aug 2014 02:45:22 +0000 (19:45 -0700)]
Fix for Parquet reader in larger files brought about by more aggressive memory management.
Hanifi Gunes [Sat, 30 Aug 2014 01:34:07 +0000 (18:34 -0700)]
DRILL-1365: output unified coordinates at profile view
Jason Altekruse [Mon, 25 Aug 2014 21:56:27 +0000 (14:56 -0700)]
DRILL-1307: add support for fixed binary columns in parquet reader.
DRILL-1314: Fix issue reading impala produced files
DRILL-1304: Regression selecting a single column from a parquet file.
Fixed issue with var length dictionary reading.
Reduced memory usage by freeing buffers after we finish reading a page (except for dictionary pages which need to be kept in memory until the entire row group has been read)
Rebased onto merge branch.
Successfully backed out the changes that had changed the structure of the nullable column readers. This re-introduced some redundancy but avoided a bug that was holding up the release. Ended up falling back on the higher level reader API, only in the case where we are reading a dictionary column and then the next page is not dictionary encoded. This can be fixed to use the optimized read instead, but it doesn't conform to the overall structure of the current reader and is a bit difficult to fix.
Sudheesh Katkam [Tue, 5 Aug 2014 16:51:42 +0000 (09:51 -0700)]
DRILL-1274: NOTICE and LICENSE for binary distribution (includes DRILL-1267).
Hanifi Gunes [Fri, 29 Aug 2014 23:51:28 +0000 (16:51 -0700)]
DRILL-1334: Make sure to copy operator id while cloning ParquetGroupScan
Steven Phillips [Tue, 19 Aug 2014 00:23:23 +0000 (17:23 -0700)]
DRILL-1329: External sort memory fixes
Jinfeng Ni [Fri, 29 Aug 2014 21:17:13 +0000 (14:17 -0700)]
DRILL-1299: Enable short circuit evaluation for case expressions in the run-time generated code.
Aditya Kishore [Fri, 29 Aug 2014 16:01:07 +0000 (21:31 +0530)]
DRILL-1346: Use HBase table size information to improve scan parallelization
Jacques Nadeau [Fri, 29 Aug 2014 21:50:52 +0000 (14:50 -0700)]
DRILL-1364: Remove extraneous old jersey-* jars from distribution.
Jacques Nadeau [Fri, 29 Aug 2014 21:09:12 +0000 (14:09 -0700)]
DRILL-1315: Allow complex zookeeper string for JDBC connections, same as ODBC.
Jacques Nadeau [Fri, 29 Aug 2014 05:58:31 +0000 (22:58 -0700)]
DRILL-1340: Add support for JDBC metadata.
Parth Chandra [Fri, 29 Aug 2014 05:23:56 +0000 (22:23 -0700)]
DRILL-1359 Fix reading and writing of repeated scalar types in Parquet
Jinfeng Ni [Thu, 28 Aug 2014 22:27:11 +0000 (15:27 -0700)]
DRILL-1265: Drill may incorrectly return null when evaluates an expression of repeated list. The fix is to correctly set offsets vectors for repeated list vector.
Jinfeng Ni [Wed, 6 Aug 2014 01:47:24 +0000 (18:47 -0700)]
DRILL-1258 : Fix compile error in generated code when Drill evaluates an expression returning a repeated list
Jinfeng Ni [Thu, 28 Aug 2014 14:21:06 +0000 (07:21 -0700)]
DRILL-1355: Ensure Drill optimizer will use storage plugin specific rules, when a new storage plugin is added.
Hanifi Gunes [Thu, 28 Aug 2014 17:21:07 +0000 (10:21 -0700)]
DRILL-1309: Implement ProjectPastFilterPushdown and update DrillScanRel cost model so that exclusive column so that star query is more expensive than exclusive column projection. Various fixes affecting record reaaders to handle `*` column as well as fixes to some test cases.
exclude parquet files from rat check
Jacques Nadeau [Thu, 28 Aug 2014 19:49:44 +0000 (12:49 -0700)]
Update to latest parquet zero copy version.
Mehant Baid [Wed, 27 Aug 2014 23:47:54 +0000 (16:47 -0700)]
DRILL-1354: Fix date truncate functions to use toString helper functions.
Minor cleanup: move the truncate logic in a separate template
Mehant Baid [Wed, 27 Aug 2014 21:27:31 +0000 (14:27 -0700)]
DRILL-1353: Fix decimal regressions
Jason Altekruse [Mon, 18 Aug 2014 23:57:16 +0000 (16:57 -0700)]
DRILL-1313: All text mode for json reader
Current implementation handles nulls that appear while in text mode differently depending if they appear in lists or maps. This allows for a null where a list or map is expected to act the same way it does without text mode enabled. For an expected map it just assumes that the field didn't exist, in which case the leaves below become null filled, and for a list it will default to showing an empty list.
If we are actually inside of a list, a null in JSON will be treated the same as the string "null", which improves over the previous behavior of just dropping the null value all together, as we do not support null values within any of the repeated primitive vectors currently.
Patch has been rebased on top of merge branch.
Jason Altekruse [Tue, 5 Aug 2014 22:44:42 +0000 (15:44 -0700)]
DRILL-1283: JSON project pushdown.
Allows for users to avoid reading columns of a JSON file, including those that include elements of JSON that drill does not currently support. This can be used to query a subset of an existing file while avoiding elements like schema changes in some columns or nulls in lists that are currently not compatible with Drill.
Patch was revised based on Hanifi's review comments, and then rebased off of the merge branch.
Parth Chandra [Thu, 28 Aug 2014 01:07:35 +0000 (18:07 -0700)]
DRILL-1352: C++ Client. Update the decoding of nullable value vectors to read a byte for every nullable bit.
Jacques Nadeau [Thu, 28 Aug 2014 00:53:55 +0000 (17:53 -0700)]
enable mapr profile hbase
Jacques Nadeau [Thu, 28 Aug 2014 00:31:59 +0000 (17:31 -0700)]
Update RPC version number to correspond to changes in nullable buffer from bitmap to bytes.
Parth Chandra [Thu, 12 Jun 2014 01:10:34 +0000 (18:10 -0700)]
DRILL-1021: Windows build
Jacques Nadeau [Wed, 27 Aug 2014 17:31:56 +0000 (10:31 -0700)]
DRILL-998: Limit amount of memory used by drill C++ client API
Jinfeng Ni [Sat, 21 Jun 2014 23:04:12 +0000 (16:04 -0700)]
DRILL-1335: Fix merge join operator when compare null against null value.
Jinfeng Ni [Tue, 26 Aug 2014 00:08:49 +0000 (17:08 -0700)]
DRILL-1337 : Fix incorrect query result for TPCH Q13. Stop pulling up local filter from an outer join condition.
Jinfeng Ni [Sat, 16 Aug 2014 00:21:38 +0000 (17:21 -0700)]
DRILL-595, DRILL-811, DRILL-1029: Support select * in view, subquery, table expression in with clause. Most fixes in Optiq.
Aditya Kishore [Wed, 27 Aug 2014 18:05:07 +0000 (23:35 +0530)]
Bundle the HBase 0.94 jar compiled with Hadoop 2 profile
+ Removed the HBase shaded jar module.
Mehant Baid [Tue, 26 Aug 2014 09:53:46 +0000 (02:53 -0700)]
Support for @inject in aggregation functions
Jacques Nadeau [Tue, 26 Aug 2014 05:30:37 +0000 (22:30 -0700)]
Fix initCap function to set outbound position correctly.
Jacques Nadeau [Tue, 26 Aug 2014 04:49:46 +0000 (21:49 -0700)]
Fix issues with Hive function generation to support DrillBuf
Aditya Kishore [Mon, 25 Aug 2014 18:18:18 +0000 (23:48 +0530)]
Fix test case failures in TestParquetWriter
* Use different table names for different test cases.
Aditya Kishore [Mon, 25 Aug 2014 15:40:36 +0000 (21:10 +0530)]
Set the BitVector memory to zero on allocation.
Aditya Kishore [Mon, 25 Aug 2014 15:39:16 +0000 (21:09 +0530)]
Fixed the index check in DrillBuf.
Jacques Nadeau [Wed, 9 Jul 2014 21:01:35 +0000 (14:01 -0700)]
Switch to DrillBuf
Add @Inject DrillBuf
Move comparison functions to memory sensitive ones
Add scalar replacement functionality for value holders
Simplify date parsing function
Add local compiled code caching
Jason Altekruse [Fri, 15 Aug 2014 16:31:19 +0000 (09:31 -0700)]
DRILL-982: Return nulls for non-existent columns in parquet reader.
Mehant Baid [Fri, 22 Aug 2014 00:31:33 +0000 (17:31 -0700)]
DRILL-1312: Use correct index to get partition vectors in ScanBatch
Aman Sinha [Fri, 22 Aug 2014 19:01:39 +0000 (12:01 -0700)]
DRILL-1132: WriterPrel should not care about collation trait of the child.
Aman Sinha [Mon, 18 Aug 2014 23:32:20 +0000 (16:32 -0700)]
DRILL-1310: Fix assertion in ProjectRecordBatch for certain types of star queries.
Mehant Baid [Fri, 22 Aug 2014 08:03:47 +0000 (01:03 -0700)]
DRILL-1322: Fix memory leak when all records in a batch are filtered
Jacques Nadeau [Sun, 24 Aug 2014 19:02:30 +0000 (12:02 -0700)]
Update TestViewSupport so that it doesn't have a null column.
Jacques Nadeau [Sun, 24 Aug 2014 17:45:01 +0000 (10:45 -0700)]
Turn off proguard verbose output.
Jacques Nadeau [Sun, 24 Aug 2014 17:04:47 +0000 (10:04 -0700)]
Fix issue introduced by DRILL-1202 where allocators are being closed after reporting success.
Update ScreenRoot to cleanup before returning success.
Update ScanBatch to cleanup reader in case of limit query to avoid memory leak in ParquetReader.
Update allocators so that we don't have memory leak when using debug options.
Update project record batch so that it doesn't try to return a released remainder.
Parth Chandra [Sun, 24 Aug 2014 04:48:04 +0000 (21:48 -0700)]
Pass correct uncompressed data size to BytesInput ctor in ComplexParquetReader
Jacques Nadeau [Sun, 24 Aug 2014 15:28:25 +0000 (08:28 -0700)]
Skip static fields in PojoDataType.
Parth Chandra [Wed, 20 Aug 2014 20:37:27 +0000 (13:37 -0700)]
Use Direct Memory in Parquet Writer
Steven Phillips [Mon, 18 Aug 2014 21:16:26 +0000 (14:16 -0700)]
DRILL-1281: Shaded hbase jar
Steven Phillips [Mon, 11 Aug 2014 18:58:43 +0000 (11:58 -0700)]
DRILL-1281: Use ByteBuffer read codepath in complex parquet reader
Parth Chandra [Mon, 28 Jul 2014 20:44:52 +0000 (13:44 -0700)]
DRILL-1281: Read into Direct Memory in Parquet Reader. Requires Hadoop 2.4 or above
Mehant Baid [Sun, 17 Aug 2014 00:15:33 +0000 (17:15 -0700)]
DRILL-1295: Select returns null for non pivot fields with a nested select
* Fix DrillTextRecordReader to handle the case when requested columns includes the entire group and a subset.
Aman Sinha [Mon, 4 Aug 2014 16:59:50 +0000 (09:59 -0700)]
DRILL-1293: Fix assertion when selecting star column from view that also has star column.
Patrick Wong [Wed, 13 Aug 2014 01:02:50 +0000 (18:02 -0700)]
DRILL-1292: Standalone JDBC jar is in distribution.
* The code was already there, it just needed to be uncommented.
Mehant Baid [Tue, 12 Aug 2014 01:04:30 +0000 (18:04 -0700)]
DRILL-1278: Fix selecting scalar field from a map with join clause.
Mehant Baid [Mon, 11 Aug 2014 18:37:06 +0000 (11:37 -0700)]
DRILL-1277: Rewrite compound identifiers in 'ON' clause as item operator
Sudheesh Katkam [Mon, 11 Aug 2014 23:54:42 +0000 (16:54 -0700)]
DRILL-1275: Added disclaimer file.
Aman Sinha [Thu, 14 Aug 2014 23:22:39 +0000 (16:22 -0700)]
DRILL-1241: Ensure that Limit produces at least 1 batch with the output schema.
Sudheesh Katkam [Mon, 11 Aug 2014 23:49:16 +0000 (16:49 -0700)]
DRILL-1240: Updated NOTICE to 2013-2014.
Hanifi Gunes [Wed, 13 Aug 2014 01:57:51 +0000 (18:57 -0700)]
DRILL-1205: group by causes access to DeadBuf
* Fix an issue that causes calls to splitAndTransferTo malfunction on repeated vector types
Sudheesh Katkam [Fri, 1 Aug 2014 23:28:39 +0000 (16:28 -0700)]
DRILL-1178: Double copy in setSafe for VariableLengthVectors
Sudheesh Katkam [Tue, 12 Aug 2014 17:07:44 +0000 (22:37 +0530)]
DRILL-1163: Estimate memory and re-planning query + system options UI.
Mehant Baid [Sat, 16 Aug 2014 00:04:41 +0000 (17:04 -0700)]
DRILL-1039: Preserve schema even when no rows are copied in RemovingRecordBatch
norrislee [Wed, 30 Jul 2014 20:31:20 +0000 (13:31 -0700)]
DRILL-1226: C++ Client Decimal ignores leading zeros for decimal 9 and 18
Xiao Meng [Tue, 29 Jul 2014 22:10:10 +0000 (15:10 -0700)]
DRILL-1222: C++ Client. Fix DRILL-1144 compilation on Mac OS
Xiao Meng [Fri, 11 Jul 2014 23:26:16 +0000 (16:26 -0700)]
DRILL-1137: C++ Client. Support setting default schema in the connection phase.
Hanifi Gunes [Mon, 11 Aug 2014 22:55:20 +0000 (15:55 -0700)]
DRILL-1245: throw more descriptive error messages if i) json file is invalid ii) also display max record size allowed
Hanifi Gunes [Mon, 11 Aug 2014 18:40:54 +0000 (11:40 -0700)]
DRILL-1249: ensure ResultBatchLoader to check group count to determine if incoming batch has empty values for repeated fields
Jacques Nadeau [Sun, 10 Aug 2014 22:14:08 +0000 (15:14 -0700)]
Reenable producer/consumer.
Jinfeng Ni [Tue, 5 Aug 2014 20:13:52 +0000 (13:13 -0700)]
DRILL-1250 : Fix compilation error in generated code when Drill evaluates an expression that returns a repeated scalar value.
Steven Phillips [Fri, 1 Aug 2014 06:46:26 +0000 (23:46 -0700)]
DRILL-1252: Implement Complex parquet and json writers
Mehant Baid [Wed, 6 Aug 2014 22:20:25 +0000 (15:20 -0700)]
DRILL-1270: Resize buffers used in workspace vectors for aggregate functions dynamically
Hanifi Gunes [Wed, 6 Aug 2014 00:05:54 +0000 (17:05 -0700)]
DRILL-1239: stop writing to vector if the vector runs out of memory
Aditya Kishore [Sun, 3 Aug 2014 06:12:24 +0000 (23:12 -0700)]
DRILL-632: JDBC single jar build failing, too many classes
akumarb2010 [Fri, 1 Aug 2014 12:58:20 +0000 (18:28 +0530)]
DRILL-1235: Fix null handling in REST interface
Aditya Kishore [Thu, 31 Jul 2014 22:24:14 +0000 (15:24 -0700)]
DRILL-1236: Create script to verify release artifacts
Hanifi Gunes [Wed, 30 Jul 2014 01:39:11 +0000 (18:39 -0700)]
DRILL-1202: fixes memory leak issues: i) ProducerConsumerBatch should clean up resources ii) FragmentExecutor should clean up gracefully at faulty & non-faulty runs regardless
Mehant Baid [Wed, 6 Aug 2014 07:39:13 +0000 (00:39 -0700)]
DRILL-1253: Allocate new container for workspace vectors before exhausting the current container.
Mehant Baid [Wed, 6 Aug 2014 04:38:55 +0000 (21:38 -0700)]
DRILL-1214: Disable SV4 handling in FilterRecordBatch
Mehant Baid [Tue, 5 Aug 2014 06:29:36 +0000 (23:29 -0700)]
DRILL-1207: Use similar initial allocation size in ObjectVector as used by other vectors
Mehant Baid [Sat, 2 Aug 2014 00:35:14 +0000 (17:35 -0700)]
DRILL-1232: Fix unsupported exception while using 'where' clause with partition names
Mehant Baid [Fri, 1 Aug 2014 02:22:06 +0000 (19:22 -0700)]
DRILL-1247: Override toString() method in Time (java.sql.Time) class to print millis
Mehant Baid [Fri, 1 Aug 2014 01:22:34 +0000 (18:22 -0700)]
DRILL-1246: Fix rounding when scale is zero while casting from varchar ->decimal28/decimal38
Mehant Baid [Wed, 30 Jul 2014 23:43:14 +0000 (16:43 -0700)]
DRILL-1216: Fix decimal multiplication when precision can exceed 38
Mehant Baid [Thu, 31 Jul 2014 03:21:02 +0000 (20:21 -0700)]
DRILL-1233: Use ObjectHolder interface for Max, Min decimal aggregate functions
Steven Phillips [Wed, 6 Aug 2014 19:40:04 +0000 (12:40 -0700)]
DRILL-1063: Fix bug in merge join when value is repeating across batch boundary
Aditya Kishore [Thu, 31 Jul 2014 17:33:16 +0000 (10:33 -0700)]
DRILL-1234: Building Drill from source distribution fails [addendum]
Aditya Kishore [Thu, 31 Jul 2014 09:02:45 +0000 (02:02 -0700)]
DRILL-1234: Building Drill from source distribution fails