Laura Schanno [Tue, 17 Sep 2019 15:45:25 +0000 (11:45 -0400)]
Move performance tests to accumulo-testing #1200 (#90)
* Move ManySplitIT as HighSplitCreationPT.
* Move BalanceFasterIT as SplitBalancingPT.
* Move DeleteTableDuringSplitIT as TableDeletionDuringSplitPT.
* Move RollWALPerformanceIT as RollWALPT.
* Use Result.result() instead of Result.parameter() when reporting
resulting values so that they will be put in the appropriate section
when the JSON files are written.
Related: #1200
Mike Miller [Thu, 12 Sep 2019 17:06:55 +0000 (13:06 -0400)]
Fix BulkImport concurrent rwalk
Mike Miller [Wed, 11 Sep 2019 19:49:09 +0000 (15:49 -0400)]
Fix error in Dockerfile. Closes #109
Mike Miller [Thu, 15 Aug 2019 21:51:29 +0000 (17:51 -0400)]
Drop example files (#100)
* Drop .example files
* Remove references to these files from documentation
Christopher Tubbs [Sun, 4 Aug 2019 19:51:11 +0000 (15:51 -0400)]
Use Accumulo 2.0.0 in testing repo
Christopher Tubbs [Tue, 30 Jul 2019 21:19:01 +0000 (17:19 -0400)]
Fix warnings and clean up pom.xml
POM updates:
* Update plugins
* Add m2e profile for ensuring compiler compliance and ignoring plugins
in Eclipse that Eclipse doesn't understand
* Require minimum java version of 1.8 and Maven 3.5
* Remove unneeded plugins (felix bundle and exec-maven-plugin)
Fix warnings in code:
* Use try-with-resources for TestEnv
* Use Double.parseDouble instead of new Double
* Compute iterable sizes from scanners without unused variables
* Suppress deprecation warnings where needed
* Update BulkPlusOne RW test to use new bulk import API
* Add missing case statement for GET_SUMMARIES in TableOp
* Remove unused variables
Christopher Tubbs [Tue, 30 Jul 2019 20:25:46 +0000 (16:25 -0400)]
Revert to Java 8 for accumulo-testing
Allow accumulo-testing to build and run with Java 8, but support the
release flag on JDK9 and later.
Keith Turner [Tue, 30 Jul 2019 18:17:24 +0000 (14:17 -0400)]
Use non blocking secure random for seeding. (#106)
When attempting to generate bulk data to ingest, map reduce jobs were
timing out. The jobs were blocked in SecureRandom code for over 10
minutes (likely trying to read /dev/random, but not sure about this).
This change creates a SecureRandom that should be non-blocking
with Java's default configs. With this change generating bulk data no
longer times out.
Christopher Tubbs [Tue, 30 Jul 2019 00:18:07 +0000 (20:18 -0400)]
Update accumulo-testing to use Java 11
Jeffrey Manno [Fri, 26 Jul 2019 18:55:22 +0000 (14:55 -0400)]
Clean up and Minor Changes to ConditionalMutationsPT (#105)
Christopher Tubbs [Thu, 25 Jul 2019 00:10:33 +0000 (20:10 -0400)]
Avoid exceeding maximum log length in Travis
Suppress Downloading messages from Maven to avoid exceeding Travis CI's
maximum log length of 4MB.
Christopher Tubbs [Wed, 24 Jul 2019 23:52:14 +0000 (19:52 -0400)]
Use openjdk8 in Travis (oraclejdk8 gone)
Keith Turner [Wed, 24 Jul 2019 23:50:06 +0000 (19:50 -0400)]
Improve bulk CI docs (#104)
Mike Miller [Wed, 24 Jul 2019 17:00:17 +0000 (13:00 -0400)]
Replace Thrift types with proper API Exception (#99)
elinaawise [Wed, 24 Jul 2019 00:33:21 +0000 (20:33 -0400)]
Fix #30 Test performance of Conditional Mutations (#79)
Keith Turner [Tue, 23 Jul 2019 16:14:17 +0000 (12:14 -0400)]
Update documented expected CI bulk ingest counts (#103)
After the changes in #101 the counts produced by CI bulk ingest are more predictable.
Christopher Tubbs [Tue, 23 Jul 2019 16:12:34 +0000 (12:12 -0400)]
BulkIngest CI improvements (#101)
* Add job name
* Seed Random with strong seed
* Fix linked list to chain adjacent elements, rather than skip over
adjacent elements to link to the next element
Mike Miller [Tue, 23 Jul 2019 15:15:33 +0000 (11:15 -0400)]
Fixes from shellcheck. Closes #69 (#102)
Mike Miller [Mon, 22 Jul 2019 20:05:34 +0000 (16:05 -0400)]
Improve scripts (#91)
* Made scripts all call ./bin/build
* Added check for env vars in env.sh
* Print out where the env vars are coming from
Keith Turner [Fri, 19 Jul 2019 00:37:52 +0000 (20:37 -0400)]
Update ci bulk ingest docs (#98)
Laura Schanno [Tue, 16 Jul 2019 22:03:29 +0000 (18:03 -0400)]
Fix #88 incorrect argument array indexes (#95)
Update reference to correct position in array for the table name.
Keith Turner [Mon, 1 Jul 2019 18:42:38 +0000 (14:42 -0400)]
update bulk docs (#85)
Christopher Tubbs [Wed, 26 Jun 2019 20:48:32 +0000 (16:48 -0400)]
Fix shellcheck issues in conf dir
Christopher Tubbs [Wed, 26 Jun 2019 20:47:22 +0000 (16:47 -0400)]
Fix #83 Add hadoop conf to CLASSPATH for defaultFS (#87)
Ensure test client code has core-site.xml on its class path so that when
client code calls "new Configuration()" (for example, to qualify HDFS
paths without a scheme), it will use the fs.defaultFS to qualify them.
This fixes issues in bulk import test code that does not properly
qualify Paths as hdfs:// paths, and instead assumes file:// incorrectly.
Christopher Tubbs [Tue, 25 Jun 2019 20:32:05 +0000 (16:32 -0400)]
Fix split point code for bulk ingest
Mike Miller [Tue, 25 Jun 2019 19:30:00 +0000 (15:30 -0400)]
Add type safety to State (#86)
* Clean up code through reuse of new type safe State methods in bulk and concurrent modules
* Allowed removal of a bunch of warning suppresses
Mike Miller [Thu, 20 Jun 2019 17:42:47 +0000 (13:42 -0400)]
Some improvements to rwalk script (#82)
* Some improvements to ./bin/rwalk
* Fix script name
Keith Turner [Wed, 19 Jun 2019 20:06:03 +0000 (16:06 -0400)]
add gcs to docker
Keith Turner [Tue, 18 Jun 2019 20:07:25 +0000 (16:07 -0400)]
add gcs to test list in readme
Keith Turner [Tue, 18 Jun 2019 18:59:39 +0000 (14:59 -0400)]
Add new garbage collection simulation test (#81)
Mike Miller [Wed, 12 Jun 2019 21:08:27 +0000 (17:08 -0400)]
Update checkstyle (#80)
* Fixes CVE-2019-9658
Jeffrey Manno [Mon, 3 Jun 2019 16:55:14 +0000 (12:55 -0400)]
Remove Snappy Property (#77)
Jeffrey Zeiberg [Thu, 30 May 2019 17:55:41 +0000 (13:55 -0400)]
Updated Hadoop and Zookeeper versions in the pom file (#76)
Jeffrey Manno [Tue, 21 May 2019 18:58:00 +0000 (14:58 -0400)]
Add Durability Performance Test to Fix Durability IT (#74)
Keith Turner [Mon, 6 May 2019 18:40:05 +0000 (14:40 -0400)]
Use Uno in upgrade test script and update for 2.0 (#73)
Mike Miller [Thu, 28 Mar 2019 19:06:18 +0000 (15:06 -0400)]
Cleanup Continuous Ingest (#71)
* Add SLF4J logging
* Some minor cleanup
Mike Miller [Wed, 27 Mar 2019 20:35:22 +0000 (16:35 -0400)]
Replace C-style arrays (#70)
Mike Walch [Thu, 21 Mar 2019 20:05:39 +0000 (16:05 -0400)]
Refactored RowHash and TeraSortIngest (#68)
Mike Walch [Mon, 18 Mar 2019 14:55:32 +0000 (10:55 -0400)]
Update Accumulo version to 2.0.0-SNAPSHOT (#66)
Mike Walch [Thu, 14 Mar 2019 22:47:35 +0000 (18:47 -0400)]
Update ClientOpts (#65)
* Reduce the number of options
* Create clients using Accumulo.newClient()
* Remove use MapReduce opts from Accumulo
Keith Turner [Fri, 8 Mar 2019 23:03:07 +0000 (18:03 -0500)]
Made CI bulk import configurable and scriptable (#64)
Christopher Tubbs [Thu, 7 Mar 2019 23:03:40 +0000 (18:03 -0500)]
Apply formatter changes on build
Christopher Tubbs [Thu, 7 Mar 2019 22:59:14 +0000 (17:59 -0500)]
Update pom.xml
Update apache parent pom and update formatting plugins
Mike Miller [Thu, 7 Mar 2019 22:54:51 +0000 (17:54 -0500)]
Creates continuous BulkIngest (#63)
* A new Bulk import MapReduce job run using "cingest bulk"
* Will create a million random keys in Rfiles in the same format as CI
* The keys are in a linked list with the value being the prev row
* Works with CI createTable and verify
Mike Walch [Wed, 27 Feb 2019 20:07:29 +0000 (15:07 -0500)]
Clean up CLI parsing and stop depending on Accumulo internals (#61)
* Remove dependencies on Accumulo internal CLI parsing code
by addeing ClientOpts and Help
* Remove trace code from TestIngest
Jeffrey Manno [Wed, 27 Feb 2019 17:21:59 +0000 (12:21 -0500)]
Factor RFile and CryptoServiceFactory from Import-Control (#60)
Keith Turner [Fri, 22 Feb 2019 18:44:27 +0000 (13:44 -0500)]
Update CI compaction strat settings (#58)
Update the continuous ingest table settings to reflect the changes made
in apache/accumulo#935
Keith Turner [Fri, 22 Feb 2019 18:43:05 +0000 (13:43 -0500)]
Improve yield+scan exec performance test (#57)
Before this commit the filters running in the background had a very low
chance of returning data. This did not execercise the use case of an
iterator returning data very slowly. Returning data very slowly results
in slowly filling a buffer. The test was modified to run the filter
scans with multiple probabilites which causes data to return with
different velocities. Also the yield filter was modfied to support
yielding across returning a key/value.
The design of the yield filter was improved so that subclasses do not
have to override init().
Mike Miller [Fri, 8 Feb 2019 20:21:24 +0000 (15:21 -0500)]
Update japi readme and master config (#55)
* Update japi readme and master config
* Update japi-accumulo-master.xml
Remove old jar
Keith Turner [Fri, 8 Feb 2019 20:05:06 +0000 (15:05 -0500)]
Added perf test for yielding+scan dispatching (#56)
Test running lots of short scans while many filters that return little
data are running in the background. If nothing is done these filters
will prevent the short scans from running. This test configures
Accumulo so that the short scans should get a chance to run. It does
three things to facilitate the short scans : the filters yield, there
are two scan executors, and a scan dispatcher sends long running scans
to the second executor. If yielding and dispatching are working
correctly then the short scans should have very short response times.
This happens because the filters should end up in a separate thread pool
than the short scan.
Also fixed some bugs with the performance test framework.
Mike Miller [Fri, 8 Feb 2019 17:48:32 +0000 (12:48 -0500)]
Update japi-accumulo-master.xml
Remove old jar
Mike Miller [Thu, 7 Feb 2019 23:23:38 +0000 (18:23 -0500)]
Update japi readme and master config
Keith Turner [Thu, 31 Jan 2019 18:41:58 +0000 (13:41 -0500)]
Allowed configuring table props for CI table (#54)
Keith Turner [Wed, 30 Jan 2019 17:31:07 +0000 (12:31 -0500)]
Stopped using non public API ClientContext
Needed to do this inorder to build against 2.0.0-alpha-2
Mike Miller [Tue, 29 Jan 2019 19:09:19 +0000 (14:09 -0500)]
Update checkstyle to use import-control for API check (#52)
* Update checkstyle to use import-control for checking use of accumulo API
* Add checkstyle plugin to the build execution to always run
* Add exceptions to API checking to hold the line until we can fix
* Use Guava peeking iterator in place of accumulo peeking iterator
Mike Miller [Mon, 28 Jan 2019 16:05:44 +0000 (11:05 -0500)]
Remove CachedConfiguration to fix build
Keith Turner [Sat, 26 Jan 2019 22:03:41 +0000 (17:03 -0500)]
Improvements made for 2.0.0-alpha-2 CI testing (#51)
* Improvements made for 2.0.0-alpha-2 CI testing
Added ability to set test options on command line. This makes running different
types of ingest from the same docker image easy. Without this change a new
docker image would need to be created for each different ingest type.
Documented commands for running a long running ingest test.
Made the ingest flush size configurable. This made it possible to write really
small amounts of data and then pause. This changes makes testing
for the problem in apache/accumulo#854 possible.
Added docs for how to copy docker image to other nodes if you don't have
a repository handy.
Keith Turner [Wed, 23 Jan 2019 23:17:49 +0000 (18:17 -0500)]
Add commons codec depdency to pom
Tried building against alpha-2 release and it failed to find commons
codec. Maybe it used to get it transitively from Accumulo.
Mike Walch [Thu, 10 Jan 2019 17:38:56 +0000 (12:38 -0500)]
Updates for starting MapReduce jobs from randomwalk (#49)
* Set additional MapReduce configuration and Hadoop username
* These settings work if running randomwalk on user machine
or in Docker
Mike Walch [Wed, 2 Jan 2019 18:47:15 +0000 (13:47 -0500)]
Update copyright date to 2019
Mike Walch [Wed, 2 Jan 2019 17:50:52 +0000 (12:50 -0500)]
Updates due to Accumulo 2.0 changes (#48)
* Use new MapReduce API and client classloader
* Use createClient and close client
* Updated codestyle.xml to match file in Accumulo repo
Mike Walch [Fri, 7 Dec 2018 21:02:15 +0000 (16:02 -0500)]
Updated M/R code due to changes in Accumulo
Mike Walch [Fri, 7 Dec 2018 15:07:20 +0000 (10:07 -0500)]
Reduced use of ClientInfo (#47)
Mike Walch [Fri, 30 Nov 2018 15:37:29 +0000 (10:37 -0500)]
Closed clients that weren't being closed (#46)
* Stopped using changeUser
* Closed clients using try-with-resources blocks
Mike Walch [Wed, 28 Nov 2018 22:13:16 +0000 (17:13 -0500)]
Minor bug fixes (#45)
* Fixed bug in creating TEST_LOG4J in env.sh
* Used full HDFS uri in RW bulk import test
Mike Walch [Wed, 28 Nov 2018 19:12:28 +0000 (14:12 -0500)]
Build shaded jar without Hadoop for M/R (#44)
* Hadoop jars get added to CLASSPATH when needed
Mike Walch [Tue, 27 Nov 2018 21:17:40 +0000 (16:17 -0500)]
Major refactor to support running in Docker (#43)
* Multiple test clients are now run in Docker instead of YARN
* Tests can still be run locally but accumulo-testing script
has been removed and replaced by seperate scripts for
cingest, rwalk, and agitator
* Project no longer contains multiple Maven modules
* Renamed package to remove 'core' from it
* Renamed accumulo-testing.env to env.sh
* Created Dockerfile
Mike Walch [Mon, 19 Nov 2018 16:52:06 +0000 (11:52 -0500)]
Fixed running continuous ingest in YARN (#41)
* Excluded license directory from shaded jar to prevent
Twill from failing
* Upgraded Twill to 0.13 and made minor improvement to code.
Mike Walch [Sat, 17 Nov 2018 17:32:32 +0000 (12:32 -0500)]
Updates due to changes in Accumulo 2.0-SNAPSHOT (#40)
* Fixes due to impl package renames
* Fixes due to AccumuloClient builer method renames
Keith Turner [Sat, 13 Oct 2018 17:33:00 +0000 (13:33 -0400)]
fixed peformance test (#39)
Mike Walch [Wed, 26 Sep 2018 14:10:54 +0000 (10:10 -0400)]
Refactor usages of Connector to AccumuloClient (#36)
Mike Walch [Tue, 25 Sep 2018 19:20:23 +0000 (15:20 -0400)]
Updates due to Accumulo 2.0 changes (#35)
- using accumulo-client.properties to create clients
Mike Miller [Mon, 24 Sep 2018 18:29:32 +0000 (14:29 -0400)]
Replace Connector with AccumuloClient (#34)
Sean Busbey [Fri, 21 Sep 2018 16:29:29 +0000 (11:29 -0500)]
Update JAPI Compliance Checker definitions for releases 1.8.0 and 1.9.2, and current master branch. (#33)
Mike Miller [Thu, 30 Aug 2018 17:08:21 +0000 (13:08 -0400)]
Remove HdfsZooInstance & formatting
Mike Miller [Tue, 21 Aug 2018 17:20:29 +0000 (13:20 -0400)]
Fix typo in README
Keith Turner [Thu, 2 Aug 2018 17:37:35 +0000 (13:37 -0400)]
Use hints (apache/accumulo#555) in scan executor test (#31)
Keith Turner [Tue, 31 Jul 2018 15:58:35 +0000 (11:58 -0400)]
Add performance test for small scans (#24)
Keith Turner [Mon, 23 Jul 2018 21:34:48 +0000 (17:34 -0400)]
Created performance test framework (#21)
Mike Walch [Thu, 12 Jul 2018 18:51:29 +0000 (14:51 -0400)]
Merge branch '1.9'
Mike Walch [Thu, 12 Jul 2018 18:49:11 +0000 (14:49 -0400)]
Fix package names (#19)
Mike Walch [Thu, 12 Jul 2018 18:48:30 +0000 (14:48 -0400)]
Fixes #16 - Could not run continuous ingest (#18)
* Upgraded Twill to 0.12.1 to fix issue
Keith Turner [Thu, 12 Jul 2018 16:38:18 +0000 (12:38 -0400)]
Clean up shaded jar conflicts (#17)
Mike Walch [Thu, 5 Jul 2018 17:01:08 +0000 (13:01 -0400)]
Merge branch '1.9'
Mike Walch [Thu, 5 Jul 2018 17:00:44 +0000 (13:00 -0400)]
Mike Walch [Thu, 5 Jul 2018 15:09:26 +0000 (11:09 -0400)]
Added random pausing to continuous ingest (#15)
Mike Walch [Tue, 26 Jun 2018 20:55:12 +0000 (16:55 -0400)]
Merge pull request #14 from mikewalch/minor-fixes
Minor bug fixes
Mike Walch [Tue, 26 Jun 2018 20:28:21 +0000 (16:28 -0400)]
Added TravisCI links
Mike Walch [Tue, 26 Jun 2018 20:10:22 +0000 (16:10 -0400)]
Minor bug fixes
* Removed Accumulo properties that were removed from 2.0
* Hdfs root default now uses 8020
* Fixed Array index out of bounds exception
Mike Walch [Mon, 25 Jun 2018 16:38:30 +0000 (12:38 -0400)]
Updates to work with Accumulo 1.9.1
Keith Turner [Wed, 25 Apr 2018 15:44:33 +0000 (11:44 -0400)]
Updates to Undefined Analyzer made for apache/accumulo#441
Mike Miller [Thu, 19 Apr 2018 16:05:01 +0000 (12:05 -0400)]
Update mapreduce dependency
Christopher Tubbs [Thu, 5 Apr 2018 17:50:48 +0000 (13:50 -0400)]
ACCUMULO-4833 Disable broken randomwalk task
Comment out the replication task in the Concurrent test configuration
for the randomwalk test framework.
Mike Miller [Wed, 14 Mar 2018 21:01:57 +0000 (17:01 -0400)]
ACCUMULO-4848 Fix ClientConfiguration errors
Mike Walch [Fri, 3 Nov 2017 19:09:16 +0000 (15:09 -0400)]
ACCUMULO-4736 Created CONTRIBUTING.md (#10)
Mike Miller [Wed, 25 Oct 2017 22:40:38 +0000 (18:40 -0400)]
ACCUMULO-4717 Update TableOp to handle error codes
Mike Miller [Wed, 25 Oct 2017 21:29:14 +0000 (17:29 -0400)]
ACCUMULO-4717 Fix boolean checks in AlterTablePerm
Mike Miller [Wed, 25 Oct 2017 19:40:10 +0000 (15:40 -0400)]
ACCUMULO-4717 Refactor WalkingSecurity to use API (#9)
Simplified WalkingSecurity by removing the inheritance from server
classes and removing unused code. Now it is simply a helper class and
the other classes in the randomwalk framework call the API directly.
Also added try catch blocks where exceptions are now thrown.
Mike Miller [Wed, 25 Oct 2017 18:34:02 +0000 (14:34 -0400)]
ACCUMULO-4718 Add Hadoop common to shade pom
Mike Miller [Thu, 19 Oct 2017 18:34:00 +0000 (14:34 -0400)]
ACCUMULO-4725 Add modules dir to script help (#8)