Lewis John McGibbney [Thu, 25 Jan 2018 18:43:56 +0000 (10:43 -0800)]
[maven-release-plugin] prepare branch 2.2
Lewis John McGibbney [Thu, 25 Jan 2018 18:43:09 +0000 (10:43 -0800)]
Prepare for Any23 2.2 RC#1
Lewis John McGibbney [Thu, 25 Jan 2018 18:32:21 +0000 (10:32 -0800)]
ANY23-210 Address 1.0 Release Review Discrepancies
Hans [Thu, 25 Jan 2018 05:15:41 +0000 (23:15 -0600)]
ANY23-227, ANY23-268, ANY23-317, ANY23-271, ANY23-273, ANY23-326, ANY23-267 Wrote tests to ensure that all of these issues were fixed by PR #59.
Lewis John McGibbney [Thu, 25 Jan 2018 04:54:23 +0000 (20:54 -0800)]
Merge branch 'ANY23-291' of https://github.com/HansBrende/any23
Hans [Thu, 25 Jan 2018 01:58:25 +0000 (19:58 -0600)]
ANY23-291 Allow JSONLD scripts to be located anywhere in document
Hans [Wed, 24 Jan 2018 12:26:40 +0000 (06:26 -0600)]
ANY23-326 fixed rdfa issue with unclosed input & meta tags
Hans [Tue, 23 Jan 2018 18:18:18 +0000 (12:18 -0600)]
ANY23-324 Added license to TagSoupParsingConfiguration
Hans [Thu, 18 Jan 2018 21:08:27 +0000 (15:08 -0600)]
ANY23-324 Changed default html parser from NekoHTML to Jsoup. This also indirectly fixes ANY23-317, ANY23-273, ANY23-267, and ANY23-326.
Lewis John McGibbney [Mon, 8 Jan 2018 14:35:24 +0000 (09:35 -0500)]
ANY23-309 'Scraper' misspelled as 'Scarper' on Downloads webpage
Lewis John McGibbney [Mon, 8 Jan 2018 13:16:00 +0000 (08:16 -0500)]
Merge branch 'ANY23-320'
Frankie Robertson [Mon, 8 Jan 2018 07:09:22 +0000 (09:09 +0200)]
Fix HTTP repository link
Lewis John McGibbney [Wed, 3 Jan 2018 00:19:05 +0000 (00:19 +0000)]
Resolve merge conflict between master and ANY23-320
Jacek Grzebyta [Tue, 2 Jan 2018 19:27:55 +0000 (19:27 +0000)]
Ref ANY23-316
- update number of cases in the schema test after update yaml schema
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Tue, 2 Jan 2018 18:30:17 +0000 (18:30 +0000)]
Merge branch 'ANY23-316'
Jacek Grzebyta [Mon, 1 Jan 2018 14:16:53 +0000 (14:16 +0000)]
Solved ANY23-316
- remove commented line in test
- RDFUtils: fixed isRelative method
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Mon, 1 Jan 2018 12:56:37 +0000 (12:56 +0000)]
Merge branch 'master' into ANY23-316
Lewis John McGibbney [Mon, 1 Jan 2018 02:58:36 +0000 (02:58 +0000)]
ANY23-320 Address @Ignore tests in Any23 and ANY23-131 Nested Microdata are not extracted
Lewis John McGibbney [Sat, 30 Dec 2017 23:26:15 +0000 (23:26 +0000)]
Fix for broken live link in org.apache.any23.cli.CrawlerTest
Lewis John McGibbney [Sat, 30 Dec 2017 22:59:25 +0000 (22:59 +0000)]
Fix for Tika and RDF4J upgrades
Lewis John McGibbney [Sat, 30 Dec 2017 18:59:29 +0000 (18:59 +0000)]
ANY23-140 Revise Any23 tests to remove fetching of web content
Lewis John McGibbney [Sat, 30 Dec 2017 17:21:57 +0000 (17:21 +0000)]
Move LICENSE.txt to LICENSE.md
Lewis John McGibbney [Sat, 30 Dec 2017 17:17:54 +0000 (17:17 +0000)]
Update README.md with badges
Lewis John McGibbney [Sat, 30 Dec 2017 17:11:45 +0000 (17:11 +0000)]
Merge branch 'ANY23-140'
Lewis John McGibbney [Sat, 30 Dec 2017 17:08:41 +0000 (17:08 +0000)]
ANY23-318 ExtractionException handling in BaseRDFExtractor.java kills entire extraction
Lewis John McGibbney [Sat, 30 Dec 2017 13:38:17 +0000 (13:38 +0000)]
Merge branch 'master' into ANY23-318
Lewis John McGibbney [Sat, 30 Dec 2017 02:13:14 +0000 (02:13 +0000)]
ANY23-319 Upgrade jsonld-java dependency to 0.11.1
Lewis John McGibbney [Wed, 27 Dec 2017 21:41:39 +0000 (21:41 +0000)]
ANY23-140 - Revise Any23 tests to remove fetching of web content
Lewis John McGibbney [Wed, 27 Dec 2017 20:06:08 +0000 (20:06 +0000)]
ANY23-318 ExtractionException handling in BaseRDFExtractor.java kills entire extraction
Jacek Grzebyta [Tue, 26 Dec 2017 00:37:13 +0000 (00:37 +0000)]
Solved ANY23-316
- create value for null values: yaml:Null
- update null-aware unit test
- modify example for unit test: 'null' value is an item of a list
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Tue, 19 Dec 2017 12:46:40 +0000 (04:46 -0800)]
Merge branch 'ANY23-314'
Lewis John McGibbney [Tue, 19 Dec 2017 12:45:59 +0000 (04:45 -0800)]
Merge branch 'ANY23-298'
Jacek Grzebyta [Thu, 14 Dec 2017 12:16:13 +0000 (12:16 +0000)]
Merge remote-tracking branch 'origin/ANY23-312-b'
Lewis John McGibbney [Wed, 13 Dec 2017 18:31:06 +0000 (10:31 -0800)]
ANY23-314 Service fails to return extraction in case of extraction error
Lewis John McGibbney [Wed, 13 Dec 2017 18:26:04 +0000 (10:26 -0800)]
ANY23-298 Revisit the OGP.java vocabulary and update it
Lewis John McGibbney [Tue, 12 Dec 2017 21:51:48 +0000 (13:51 -0800)]
ANY23-314 Service fails to return extraction in case of extraction error
Jacek Grzebyta [Tue, 12 Dec 2017 16:48:11 +0000 (16:48 +0000)]
Solved issues in javadoc description.
- replaces Map.Entry by dedicated small getters bean.
- other small changes
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Thu, 7 Dec 2017 17:02:42 +0000 (17:02 +0000)]
Fixed javadoc error
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Thu, 7 Dec 2017 12:24:26 +0000 (12:24 +0000)]
Merge branch 'ANY23-312'
- Fixed yaml parser
- add more unit tests
Jacek Grzebyta [Thu, 7 Dec 2017 12:19:42 +0000 (12:19 +0000)]
Clean code
- remove some tabs in empty rows
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Mon, 20 Nov 2017 00:41:12 +0000 (00:41 +0000)]
Fixed problem with list parser
- in raw rdf nodes are not reversed
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Sun, 19 Nov 2017 00:11:56 +0000 (00:11 +0000)]
Add unit test for extracting lists.
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Sat, 18 Nov 2017 23:20:05 +0000 (23:20 +0000)]
Update License information if files heads.
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Thu, 16 Nov 2017 18:54:17 +0000 (10:54 -0800)]
Add CONTRIBUTING guide
Lewis John McGibbney [Wed, 15 Nov 2017 05:02:52 +0000 (21:02 -0800)]
Merge branch 'master' of https://github.com/imduffy15/any23
Lewis John McGibbney [Wed, 15 Nov 2017 04:57:41 +0000 (20:57 -0800)]
Merge branch 'patch-1' of https://github.com/The-Alchemist/any23
Lewis John McGibbney [Wed, 15 Nov 2017 04:57:25 +0000 (20:57 -0800)]
Website updates as published to production
Ian Duffy [Wed, 8 Nov 2017 13:59:42 +0000 (13:59 +0000)]
Support attribute content on all fields.
<sometag content="something" /> should be considered, regardless if `content` is not a valid
attribute of `sometag`.
The specification for microdata[1] details that an elements content attribute should be considered
before text content.
Any23 doesn't currently do this, it only considers `content` for `meta` tags which is the only
HTML tag which is suppose to have a `content` but not all sites follow HTML specifications.
Updating the microdata parser to be able to get `content` from any element should it exist.
[1] https://www.w3.org/TR/microdata/#values
Signed-off-by: Ian Duffy <ian.duffy@zalando.ie>
Jacek Grzebyta [Mon, 6 Nov 2017 12:35:00 +0000 (12:35 +0000)]
Fix ANY23-312
- solved blank nodes creation problem for maps
- add another unit test example typical for configuration files
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Fri, 3 Nov 2017 22:25:29 +0000 (22:25 +0000)]
Update ElementProcessor
- instantiate a map's root node
- update unit tests
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Fri, 27 Oct 2017 19:03:18 +0000 (20:03 +0100)]
Fix ANY23-312
- fix problem with making wrong IRI if docIRI ends with # character
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Fri, 27 Oct 2017 16:17:10 +0000 (17:17 +0100)]
Fix ANY23-312
- fix unit test for literals
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
The Alchemist [Thu, 26 Oct 2017 20:36:04 +0000 (16:36 -0400)]
Formatting was off
Jacek Grzebyta [Mon, 23 Oct 2017 13:55:55 +0000 (14:55 +0100)]
Ref ANY23-312
- ElementProcessor: fix issue in map validation
returns always empty model if parsed to literal
- YamlExtractor: fix issue with yaml containing only literals
- update unit tests
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Mon, 23 Oct 2017 11:40:21 +0000 (12:40 +0100)]
Update YamlExtractor
- update unit tests
- add test to simple text file
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Mon, 23 Oct 2017 11:36:04 +0000 (12:36 +0100)]
Update ElementProcessor - RDF-izer
- make processor singleton
- update unit tests
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Sat, 21 Oct 2017 09:27:06 +0000 (10:27 +0100)]
Create ElementProcessor - RDF-izer
- Add RDF-iser based on types
- add unit tests
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Fri, 20 Oct 2017 16:41:40 +0000 (17:41 +0100)]
Create ElementProcessor
- add unit test
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Wed, 18 Oct 2017 17:50:30 +0000 (18:50 +0100)]
Update RDFUtils
- remove some converters to util class
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Tue, 17 Oct 2017 16:22:53 +0000 (17:22 +0100)]
Ref ANY23-312
- buildNode method returns Optional wrapper
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Tue, 17 Oct 2017 16:06:44 +0000 (17:06 +0100)]
Ref ANY23-312
- update example file
- update unit test
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Fri, 15 Sep 2017 07:14:07 +0000 (00:14 -0700)]
[maven-release-plugin] prepare for next development iteration
Lewis John McGibbney [Fri, 15 Sep 2017 07:14:01 +0000 (00:14 -0700)]
[maven-release-plugin] prepare release any23-2.1
Lewis John McGibbney [Fri, 15 Sep 2017 06:19:19 +0000 (23:19 -0700)]
[maven-release-plugin] prepare for next development iteration
Lewis John McGibbney [Fri, 15 Sep 2017 06:19:13 +0000 (23:19 -0700)]
[maven-release-plugin] prepare branch 2.1.0
Lewis John McGibbney [Fri, 15 Sep 2017 03:17:46 +0000 (20:17 -0700)]
Preparing files for Any23 2.1 RC#1
Jacek Grzebyta [Wed, 13 Sep 2017 14:15:20 +0000 (15:15 +0100)]
Merge branch 'ANY23-311-2'
- fixed javadoc issues
Jacek Grzebyta [Wed, 13 Sep 2017 14:14:21 +0000 (15:14 +0100)]
Fixed ANY23-311
- solved javadoc problems
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Wed, 13 Sep 2017 10:46:46 +0000 (11:46 +0100)]
Merge branch 'ANY23-311' of https://github.com/jgrzebyta/any23
Jacek Grzebyta [Sat, 9 Sep 2017 20:13:37 +0000 (21:13 +0100)]
Fix testing issue
- add documentation to RDFUtils class
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Tue, 29 Aug 2017 11:41:16 +0000 (12:41 +0100)]
Merge branch 'master' into ANY23-311
- Resolve conflict in YAMLExtractor.java
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Wed, 23 Aug 2017 20:26:23 +0000 (13:26 -0700)]
ANY23-304 skip tests in openie module
Lewis John McGibbney [Wed, 23 Aug 2017 19:15:56 +0000 (12:15 -0700)]
ANY23-304 Add extractor for OpenIE
Lewis John McGibbney [Thu, 27 Jul 2017 19:16:29 +0000 (12:16 -0700)]
ANY23-304 implement temporary file reader within test logic
Lewis John McGibbney [Wed, 26 Jul 2017 21:19:37 +0000 (14:19 -0700)]
ANY23-304 increase number of extractors found
Lewis John McGibbney [Wed, 26 Jul 2017 21:10:34 +0000 (14:10 -0700)]
ANY23-304 merge with master branch
Jacek Grzebyta [Wed, 26 Jul 2017 21:09:41 +0000 (22:09 +0100)]
Merge branch 'master' into ANY23-311
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Wed, 26 Jul 2017 20:56:53 +0000 (13:56 -0700)]
Fix failing tests regrding ordering of entries in prefixes.properties
Lewis John McGibbney [Wed, 26 Jul 2017 20:44:10 +0000 (13:44 -0700)]
Merge branch 'master' into ANY23-282
Lewis John McGibbney [Wed, 26 Jul 2017 20:34:32 +0000 (13:34 -0700)]
Merge branch 'ANY23-310' of https://github.com/jgrzebyta/any23 into ANY23-310
Jacek Grzebyta [Tue, 25 Jul 2017 23:35:30 +0000 (00:35 +0100)]
Ref ANY23-310
- add unit test with options without logging file
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Mon, 24 Jul 2017 22:21:32 +0000 (15:21 -0700)]
Merge branch 'ANY23-310' of https://github.com/jgrzebyta/any23 into ANY23-310
Jacek Grzebyta [Wed, 19 Jul 2017 15:29:02 +0000 (16:29 +0100)]
Fixed error message in Rover: No suitable extractors found for source
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
William L. Anderson [Fri, 14 Jul 2017 16:07:44 +0000 (11:07 -0500)]
Merge branch 'ANY23-309'
Jacek Grzebyta [Fri, 14 Jul 2017 15:42:46 +0000 (16:42 +0100)]
Add new line to the file end
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Fri, 14 Jul 2017 15:41:38 +0000 (16:41 +0100)]
Clean SimpleRoverTest outcome
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Fri, 14 Jul 2017 15:27:16 +0000 (16:27 +0100)]
Fixed issue ANY23-310
- pipe proper logger in Rover.performExtraction
- make the content length counted
- use StringUtils#join for merging with delimiter
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Fri, 14 Jul 2017 02:50:30 +0000 (19:50 -0700)]
ANY23-282 Replacement for all Sindice namespaces and URI's
Lewis John McGibbney [Fri, 14 Jul 2017 02:40:21 +0000 (19:40 -0700)]
Merge branch 'master' into ANY23-282
Jacek Grzebyta [Thu, 13 Jul 2017 19:19:02 +0000 (20:19 +0100)]
Update values in RDFSchemaUtilsTest.java
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
William L. Anderson [Thu, 29 Jun 2017 19:10:52 +0000 (14:10 -0500)]
ANY23-306 (corrected URLs for "WAR package with dependencies")
William L. Anderson [Thu, 29 Jun 2017 18:50:08 +0000 (13:50 -0500)]
Close ANY23-309 (correct misspelled word)
Jacek Grzebyta [Thu, 13 Jul 2017 18:08:56 +0000 (19:08 +0100)]
Ref ANY23-310
- created unit test
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
William L. Anderson [Thu, 13 Jul 2017 17:58:58 +0000 (12:58 -0500)]
Merge remote-tracking branch 'any23wip/master'
Jacek Grzebyta [Thu, 13 Jul 2017 17:09:39 +0000 (18:09 +0100)]
Simplify rdf graph structure
- if yaml document contains a map than the document become the map
- if String is a valid IRI than convert to IRI
- add additional unit test for parsing tree
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Peter Ansell [Thu, 13 Jul 2017 05:50:45 +0000 (15:50 +1000)]
Merge branch 'ANY23-308-pr'
Signed-off-by: Peter Ansell <p_ansell@yahoo.com>
Peter Ansell [Thu, 13 Jul 2017 05:50:09 +0000 (15:50 +1000)]
Fix compile and test errors to merge in
Signed-off-by: Peter Ansell <p_ansell@yahoo.com>
William L. Anderson [Wed, 12 Jul 2017 19:31:35 +0000 (14:31 -0500)]
Merge remote-tracking branch 'upstream/master'
Jacek Grzebyta [Wed, 12 Jul 2017 17:52:43 +0000 (18:52 +0100)]
Ref ANY23-308
- restore csvutils
- detect yaml based on the file name
- remove utils module
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Wed, 12 Jul 2017 16:06:50 +0000 (17:06 +0100)]
Detection MIME based on the file URI rather than on the base namespace.
- file path add to meta
- add documentation to unit test
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>