Jon [Mon, 9 May 2022 23:43:56 +0000 (16:43 -0700)]
add ::value_type and ::comparator to REQ for consistency across quantiles
Jon [Mon, 9 May 2022 23:37:34 +0000 (16:37 -0700)]
Add KS Test to python, updating docs where appropriate
Alexander Saydakov [Mon, 9 May 2022 17:03:03 +0000 (10:03 -0700)]
Merge pull request #274 from apache/common_kolmogorov_smirnov
Common Kolmogorov-Smirnov test
AlexanderSaydakov [Sat, 7 May 2022 00:54:42 +0000 (17:54 -0700)]
higher resolution
AlexanderSaydakov [Fri, 6 May 2022 23:29:04 +0000 (16:29 -0700)]
common Kolmogorov-Smirnov test, some minor fixes
Jon Malkin [Fri, 6 May 2022 18:20:01 +0000 (11:20 -0700)]
Merge pull request #273 from apache/sorted_view
move quantiles and req from quantile_calculator to sorted_view
Jon Malkin [Fri, 6 May 2022 16:16:02 +0000 (09:16 -0700)]
add sanity check on item counts in deserialization constructor
Jon [Fri, 6 May 2022 03:05:54 +0000 (20:05 -0700)]
move quantiles and req from quantile_calculator to sorted_view (and fix a quantiles deser bug)
Jon Malkin [Thu, 5 May 2022 18:18:17 +0000 (11:18 -0700)]
Merge pull request #271 from apache/quantiles
Quantiles
Jon Malkin [Thu, 5 May 2022 18:05:30 +0000 (11:05 -0700)]
Merge branch 'master' into quantiles
Jon [Thu, 5 May 2022 17:27:15 +0000 (10:27 -0700)]
improve language around the inclusive flag
Jon [Thu, 5 May 2022 09:54:28 +0000 (02:54 -0700)]
test get_quantile with and without inclusive
Jon [Thu, 5 May 2022 02:57:48 +0000 (19:57 -0700)]
remove asserts, mostly replacing with std::logic_error
Jon [Thu, 5 May 2022 02:32:38 +0000 (19:32 -0700)]
changes based on review comments aside from asserts
Jon Malkin [Wed, 4 May 2022 23:27:40 +0000 (16:27 -0700)]
Merge pull request #272 from apache/fix_wheels
Fix wheels
Jon Malkin [Wed, 4 May 2022 21:51:07 +0000 (14:51 -0700)]
specify Python3 rather than Python in MSVC-specific block
Jon Malkin [Wed, 4 May 2022 21:41:08 +0000 (14:41 -0700)]
restore wheel matrix, remove verbosity, condition find_package(Python3) on cmake version to address issue
Jon Malkin [Wed, 4 May 2022 18:35:43 +0000 (11:35 -0700)]
trace-expand cmake logs
Jon Malkin [Wed, 4 May 2022 18:26:52 +0000 (11:26 -0700)]
have cibuildwheel install cmake>=3.18 before build
Jon Malkin [Wed, 4 May 2022 17:46:51 +0000 (10:46 -0700)]
more debug messages
Jon Malkin [Wed, 4 May 2022 17:34:48 +0000 (10:34 -0700)]
fix verbosity
Jon Malkin [Wed, 4 May 2022 17:19:51 +0000 (10:19 -0700)]
fix sdist build
Jon Malkin [Wed, 4 May 2022 17:05:13 +0000 (10:05 -0700)]
disble mac/windows and 32bit builds for speed, increase logging and verbosity
Jon Malkin [Tue, 3 May 2022 23:24:29 +0000 (16:24 -0700)]
tell pybind to use newer python-finding approach
Jon Malkin [Tue, 3 May 2022 22:25:32 +0000 (15:25 -0700)]
try letting pybind11 find python on its own
Jon Malkin [Tue, 3 May 2022 19:48:23 +0000 (12:48 -0700)]
try backing off to cmake 3.16 for greater compatibility
Jon Malkin [Tue, 3 May 2022 16:52:44 +0000 (09:52 -0700)]
ensure pyproject.toml has consistent cmake version
Jon [Tue, 3 May 2022 10:05:21 +0000 (03:05 -0700)]
bump cmake version to specify development.module w/in python
Jon [Tue, 3 May 2022 09:46:29 +0000 (02:46 -0700)]
combine a couple branches to address issues with python wheels
Jon Malkin [Mon, 2 May 2022 20:08:13 +0000 (13:08 -0700)]
Merge pull request #269 from apache/serde_instance
Serde instance
Jon [Wed, 27 Apr 2022 20:50:13 +0000 (13:50 -0700)]
fix CMakeLists to include correct quantile calculator filenames
Jon Malkin [Tue, 26 Apr 2022 07:17:00 +0000 (00:17 -0700)]
clone and minorly adapt python kll testing to classic quantiles
Jon Malkin [Tue, 26 Apr 2022 06:59:42 +0000 (23:59 -0700)]
more test coverage
Jon Malkin [Tue, 26 Apr 2022 06:55:28 +0000 (23:55 -0700)]
improve test coverage
Jon [Sat, 23 Apr 2022 07:09:37 +0000 (00:09 -0700)]
clean up a few warnings
Jon [Sat, 23 Apr 2022 06:38:27 +0000 (23:38 -0700)]
finish merge, update types to be able handle const input sketchs instead of just exact matches
Jon [Wed, 20 Apr 2022 23:25:24 +0000 (16:25 -0700)]
incomplete merge implementation
AlexanderSaydakov [Tue, 19 Apr 2022 00:06:55 +0000 (17:06 -0700)]
wrong offset
Jon Malkin [Wed, 6 Apr 2022 23:11:29 +0000 (16:11 -0700)]
quantile calculator missing algorithm
Jon Malkin [Wed, 6 Apr 2022 23:04:25 +0000 (16:04 -0700)]
add missing cassert include -- should probably ultimately remove asserts instead
Jon Malkin [Wed, 6 Apr 2022 22:43:07 +0000 (15:43 -0700)]
add python wrapper (w/o merging as that's not implemented yet)
Jon Malkin [Tue, 5 Apr 2022 23:49:14 +0000 (16:49 -0700)]
ensure k is power of 2 to match java
Jon Malkin [Tue, 5 Apr 2022 20:10:21 +0000 (13:10 -0700)]
Read correct sketch from java for testing
Jon Malkin [Tue, 5 Apr 2022 19:11:38 +0000 (12:11 -0700)]
Add more tests ported from kll test suite (all except merging)
Jon Malkin [Tue, 5 Apr 2022 18:54:11 +0000 (11:54 -0700)]
remove unnecessary move, sort BB as side-effect on serialize despite const method since always compact
Jon Malkin [Tue, 5 Apr 2022 08:15:38 +0000 (01:15 -0700)]
minor cleanup
Jon Malkin [Tue, 5 Apr 2022 08:11:39 +0000 (01:11 -0700)]
(de)serializatio fixes and compatibility with java, including older serialization versions
AlexanderSaydakov [Mon, 28 Mar 2022 23:59:49 +0000 (16:59 -0700)]
no class-level serde, support serde instance
AlexanderSaydakov [Mon, 28 Mar 2022 21:34:32 +0000 (14:34 -0700)]
no class-level serde, support serde instance
Jon Malkin [Sat, 26 Mar 2022 02:20:11 +0000 (19:20 -0700)]
more warning fixes
Jon Malkin [Sat, 26 Mar 2022 02:13:31 +0000 (19:13 -0700)]
updates to reduce warnings
Jon Malkin [Fri, 25 Mar 2022 22:37:18 +0000 (15:37 -0700)]
naming consistency for serde template parameter
AlexanderSaydakov [Fri, 25 Mar 2022 21:40:50 +0000 (14:40 -0700)]
support serde instance
AlexanderSaydakov [Fri, 25 Mar 2022 21:09:28 +0000 (14:09 -0700)]
unnecessary
AlexanderSaydakov [Fri, 25 Mar 2022 20:17:23 +0000 (13:17 -0700)]
support serde instance
AlexanderSaydakov [Fri, 25 Mar 2022 18:54:43 +0000 (11:54 -0700)]
better for transition
Alexander Saydakov [Thu, 24 Mar 2022 18:19:55 +0000 (11:19 -0700)]
Merge pull request #267 from apache/hll-deserialize-test
serialize-deserialize-serialize tests
Alexander Saydakov [Thu, 24 Mar 2022 18:19:16 +0000 (11:19 -0700)]
Merge pull request #268 from apache/sorted_view_performance
quantile sorted view performance improvement
Jon Malkin [Thu, 24 Mar 2022 00:40:02 +0000 (17:40 -0700)]
remove extraneous serde template parameter
Jon Malkin [Thu, 24 Mar 2022 00:25:58 +0000 (17:25 -0700)]
move serde from class onto only methods needing it
Jon Malkin [Thu, 24 Mar 2022 00:00:13 +0000 (17:00 -0700)]
deserialize void*
AlexanderSaydakov [Wed, 23 Mar 2022 23:10:44 +0000 (16:10 -0700)]
docs
AlexanderSaydakov [Wed, 23 Mar 2022 23:10:16 +0000 (16:10 -0700)]
support serde instance
Jon Malkin [Wed, 23 Mar 2022 21:39:14 +0000 (14:39 -0700)]
checkpoint: streaming deserialization
AlexanderSaydakov [Tue, 22 Mar 2022 20:37:30 +0000 (13:37 -0700)]
better name - not always a pointer
AlexanderSaydakov [Thu, 17 Mar 2022 21:57:00 +0000 (14:57 -0700)]
faster merge
AlexanderSaydakov [Mon, 14 Mar 2022 22:06:03 +0000 (15:06 -0700)]
serialize-deserialize-serialize tests
Jon Malkin [Sat, 12 Mar 2022 00:12:06 +0000 (16:12 -0800)]
Merge pull request #266 from etseidl/patch-1
is CouponHashSet deserialization incorrect?
etseidl [Wed, 9 Mar 2022 16:46:56 +0000 (08:46 -0800)]
is CouponHashSet deserialization incorrect?
When deserializing an updatable CouponHashSet from a byte array, wouldn't you need to copy the entire coupons array rather than just using couponCount since the hash array is not populated sequentially like the CouponList?
Alexander Saydakov [Tue, 8 Mar 2022 03:26:48 +0000 (19:26 -0800)]
Merge pull request #264 from apache/kll_forwarding
Reduced code duplication in KLL
AlexanderSaydakov [Mon, 7 Mar 2022 20:47:09 +0000 (12:47 -0800)]
added forwarding
Alexander Saydakov [Tue, 1 Mar 2022 00:21:43 +0000 (16:21 -0800)]
Merge pull request #265 from aseure/fix-python-build
fix: add missing <stdexcept> includes
Anthony Seure [Mon, 28 Feb 2022 15:32:41 +0000 (16:32 +0100)]
fix: add missing <stdexcept> includes
AlexanderSaydakov [Fri, 25 Feb 2022 22:54:10 +0000 (14:54 -0800)]
added necessary include
AlexanderSaydakov [Fri, 25 Feb 2022 19:39:06 +0000 (11:39 -0800)]
fixed docs
AlexanderSaydakov [Wed, 23 Feb 2022 22:17:46 +0000 (14:17 -0800)]
reduced code duplication
AlexanderSaydakov [Wed, 23 Feb 2022 00:41:08 +0000 (16:41 -0800)]
reduced code duplication
Jon Malkin [Fri, 18 Feb 2022 10:08:16 +0000 (02:08 -0800)]
add serialization (self-consistent, not fully tested)
Alexander Saydakov [Tue, 8 Feb 2022 21:23:22 +0000 (13:23 -0800)]
Merge pull request #259 from apache/quantile_sketch_sorted_view
Quantile sketch sorted view
AlexanderSaydakov [Mon, 7 Feb 2022 21:13:25 +0000 (13:13 -0800)]
use the same type as in the sorted view
AlexanderSaydakov [Sat, 5 Feb 2022 00:44:25 +0000 (16:44 -0800)]
return a value from get_quantile() for arithmetic types
AlexanderSaydakov [Fri, 4 Feb 2022 22:16:21 +0000 (14:16 -0800)]
slightly larger margin
AlexanderSaydakov [Fri, 4 Feb 2022 20:14:11 +0000 (12:14 -0800)]
include floating point types, simplify comparator
AlexanderSaydakov [Fri, 4 Feb 2022 20:08:54 +0000 (12:08 -0800)]
singular
AlexanderSaydakov [Fri, 4 Feb 2022 19:34:59 +0000 (11:34 -0800)]
removed some unnecessary complexity
AlexanderSaydakov [Mon, 31 Jan 2022 22:04:12 +0000 (14:04 -0800)]
copy items of integral types instead of using pointers
AlexanderSaydakov [Wed, 26 Jan 2022 01:04:18 +0000 (17:04 -0800)]
include cmath for std::isnan
AlexanderSaydakov [Wed, 26 Jan 2022 01:03:53 +0000 (17:03 -0800)]
adjusted python wrapper
AlexanderSaydakov [Tue, 25 Jan 2022 23:49:10 +0000 (15:49 -0800)]
implicit conversion
AlexanderSaydakov [Tue, 25 Jan 2022 23:41:46 +0000 (15:41 -0800)]
quantile sketch sorted view
Jon Malkin [Thu, 20 Jan 2022 09:05:33 +0000 (01:05 -0800)]
use common quantiles calculator between req and classic quantiles
Jon Malkin [Sat, 15 Jan 2022 03:13:28 +0000 (19:13 -0800)]
fix backwards logic on a comparator, simplify to reduce redundancy, add additional asserts for debugging
Jon Malkin [Fri, 14 Jan 2022 17:15:09 +0000 (09:15 -0800)]
add missing header, fix comparator usage
Jon Malkin [Fri, 14 Jan 2022 17:04:59 +0000 (09:04 -0800)]
include chrono in the right place
Jon Malkin [Fri, 14 Jan 2022 10:29:35 +0000 (02:29 -0800)]
use the same random_bit implementation across all quantiles sketches, in common_defs.hpp
Alexander Saydakov [Thu, 13 Jan 2022 22:13:19 +0000 (14:13 -0800)]
Merge pull request #258 from apache/kll_inclusive_rank
KLL inclusive rank
AlexanderSaydakov [Tue, 11 Jan 2022 00:54:21 +0000 (16:54 -0800)]
test inclusive PMF and CDF
AlexanderSaydakov [Mon, 10 Jan 2022 21:24:44 +0000 (13:24 -0800)]
adjusted the python wrapper
AlexanderSaydakov [Sat, 8 Jan 2022 01:35:28 +0000 (17:35 -0800)]
kll inclusive rank
Alexander Saydakov [Wed, 5 Jan 2022 23:52:54 +0000 (15:52 -0800)]
Merge pull request #255 from apache/theta_common_reading_bytes
Theta: common code for wrap and deserialize