Lewis John McGibbney [Thu, 1 Mar 2018 22:58:37 +0000 (14:58 -0800)]
[maven-release-plugin] prepare release any23-2.2
Lewis John McGibbney [Wed, 28 Feb 2018 06:07:12 +0000 (22:07 -0800)]
ANY23-321 fix integration build
Lewis John McGibbney [Wed, 28 Feb 2018 04:43:05 +0000 (20:43 -0800)]
Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/any23
Lewis John McGibbney [Tue, 27 Feb 2018 18:11:57 +0000 (10:11 -0800)]
ANY23-321 Add openie toggle functionality to service
Hans [Mon, 26 Feb 2018 22:47:26 +0000 (16:47 -0600)]
updated pom.xml with Hans Brende developer information
Lewis John McGibbney [Sat, 24 Feb 2018 01:56:20 +0000 (17:56 -0800)]
ANY23-321 Add openie toggle functionality to service
Lewis John McGibbney [Fri, 23 Feb 2018 17:58:54 +0000 (09:58 -0800)]
ANY23-321 Add openie toggle functionality to service
Lewis John McGibbney [Fri, 23 Feb 2018 17:23:10 +0000 (09:23 -0800)]
Merge into master
Lewis John McGibbney [Fri, 23 Feb 2018 16:58:14 +0000 (08:58 -0800)]
Merge branch 'ANY23-328' of https://github.com/HansBrende/any23
Lewis John McGibbney [Tue, 13 Feb 2018 20:26:56 +0000 (12:26 -0800)]
Revert to 2.2
Hans [Sun, 11 Feb 2018 18:11:32 +0000 (12:11 -0600)]
ANY23-328 Strip comments from json-ld content to make parsing more lenient
Lewis John McGibbney [Fri, 9 Feb 2018 22:17:14 +0000 (14:17 -0800)]
Revert Any23 RC#1
Lewis John McGibbney [Fri, 9 Feb 2018 20:46:07 +0000 (12:46 -0800)]
Remove external repositories from pom.xml
Lewis John McGibbney [Fri, 9 Feb 2018 20:35:10 +0000 (12:35 -0800)]
Update poweredby page to account for Nutch usage
Hans [Fri, 9 Feb 2018 05:27:30 +0000 (23:27 -0600)]
ANY23-264 Upgrade to use public commons-csv instead of custom SNAPSHOT
Jacek Grzebyta [Sat, 3 Feb 2018 14:49:38 +0000 (14:49 +0000)]
Merge remote-tracking branch 'HansBrende/ANY23-327'
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Sat, 3 Feb 2018 05:55:12 +0000 (21:55 -0800)]
ANY23-321 Add openie toggle functionality to service
Hans [Mon, 29 Jan 2018 05:30:54 +0000 (23:30 -0600)]
ANY23-327 Change log level to debug for RDFUtils.isAbsoluteIRI()
Lewis John McGibbney [Thu, 25 Jan 2018 18:58:13 +0000 (10:58 -0800)]
[maven-release-plugin] prepare for next development iteration
Lewis John McGibbney [Thu, 25 Jan 2018 18:58:04 +0000 (10:58 -0800)]
[maven-release-plugin] prepare release any23-2.2
Lewis John McGibbney [Thu, 25 Jan 2018 18:44:01 +0000 (10:44 -0800)]
[maven-release-plugin] prepare for next development iteration
Lewis John McGibbney [Thu, 25 Jan 2018 18:43:56 +0000 (10:43 -0800)]
[maven-release-plugin] prepare branch 2.2
Lewis John McGibbney [Thu, 25 Jan 2018 18:43:09 +0000 (10:43 -0800)]
Prepare for Any23 2.2 RC#1
Lewis John McGibbney [Thu, 25 Jan 2018 18:32:21 +0000 (10:32 -0800)]
ANY23-210 Address 1.0 Release Review Discrepancies
Hans [Thu, 25 Jan 2018 05:15:41 +0000 (23:15 -0600)]
ANY23-227, ANY23-268, ANY23-317, ANY23-271, ANY23-273, ANY23-326, ANY23-267 Wrote tests to ensure that all of these issues were fixed by PR #59.
Lewis John McGibbney [Thu, 25 Jan 2018 04:54:23 +0000 (20:54 -0800)]
Merge branch 'ANY23-291' of https://github.com/HansBrende/any23
Hans [Thu, 25 Jan 2018 01:58:25 +0000 (19:58 -0600)]
ANY23-291 Allow JSONLD scripts to be located anywhere in document
Hans [Wed, 24 Jan 2018 12:26:40 +0000 (06:26 -0600)]
ANY23-326 fixed rdfa issue with unclosed input & meta tags
Hans [Tue, 23 Jan 2018 18:18:18 +0000 (12:18 -0600)]
ANY23-324 Added license to TagSoupParsingConfiguration
Hans [Thu, 18 Jan 2018 21:08:27 +0000 (15:08 -0600)]
ANY23-324 Changed default html parser from NekoHTML to Jsoup. This also indirectly fixes ANY23-317, ANY23-273, ANY23-267, and ANY23-326.
Lewis John McGibbney [Mon, 8 Jan 2018 14:42:26 +0000 (09:42 -0500)]
Merge branch 'master' into ANy23-321
Lewis John McGibbney [Mon, 8 Jan 2018 14:35:24 +0000 (09:35 -0500)]
ANY23-309 'Scraper' misspelled as 'Scarper' on Downloads webpage
Lewis John McGibbney [Mon, 8 Jan 2018 14:26:05 +0000 (09:26 -0500)]
Merge branch 'master' into ANY23-321
Lewis John McGibbney [Mon, 8 Jan 2018 13:16:00 +0000 (08:16 -0500)]
Merge branch 'ANY23-320'
Frankie Robertson [Mon, 8 Jan 2018 07:09:22 +0000 (09:09 +0200)]
Fix HTTP repository link
Lewis John McGibbney [Wed, 3 Jan 2018 00:19:05 +0000 (00:19 +0000)]
Resolve merge conflict between master and ANY23-320
Lewis John McGibbney [Wed, 3 Jan 2018 00:16:07 +0000 (00:16 +0000)]
Merge branch 'master' into ANY23-321
Lewis John McGibbney [Wed, 3 Jan 2018 00:05:39 +0000 (00:05 +0000)]
ANY23-321 Add openie toggle functionality to service
Jacek Grzebyta [Tue, 2 Jan 2018 19:27:55 +0000 (19:27 +0000)]
Ref ANY23-316
- update number of cases in the schema test after update yaml schema
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Tue, 2 Jan 2018 18:30:17 +0000 (18:30 +0000)]
Merge branch 'ANY23-316'
Jacek Grzebyta [Mon, 1 Jan 2018 14:16:53 +0000 (14:16 +0000)]
Solved ANY23-316
- remove commented line in test
- RDFUtils: fixed isRelative method
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Mon, 1 Jan 2018 12:56:37 +0000 (12:56 +0000)]
Merge branch 'master' into ANY23-316
Lewis John McGibbney [Mon, 1 Jan 2018 02:58:36 +0000 (02:58 +0000)]
ANY23-320 Address @Ignore tests in Any23 and ANY23-131 Nested Microdata are not extracted
Lewis John McGibbney [Sat, 30 Dec 2017 23:26:15 +0000 (23:26 +0000)]
Fix for broken live link in org.apache.any23.cli.CrawlerTest
Lewis John McGibbney [Sat, 30 Dec 2017 22:59:25 +0000 (22:59 +0000)]
Fix for Tika and RDF4J upgrades
Lewis John McGibbney [Sat, 30 Dec 2017 18:59:29 +0000 (18:59 +0000)]
ANY23-140 Revise Any23 tests to remove fetching of web content
Lewis John McGibbney [Sat, 30 Dec 2017 17:21:57 +0000 (17:21 +0000)]
Move LICENSE.txt to LICENSE.md
Lewis John McGibbney [Sat, 30 Dec 2017 17:17:54 +0000 (17:17 +0000)]
Update README.md with badges
Lewis John McGibbney [Sat, 30 Dec 2017 17:11:45 +0000 (17:11 +0000)]
Merge branch 'ANY23-140'
Lewis John McGibbney [Sat, 30 Dec 2017 17:08:41 +0000 (17:08 +0000)]
ANY23-318 ExtractionException handling in BaseRDFExtractor.java kills entire extraction
Lewis John McGibbney [Sat, 30 Dec 2017 13:38:17 +0000 (13:38 +0000)]
Merge branch 'master' into ANY23-318
Lewis John McGibbney [Sat, 30 Dec 2017 02:13:14 +0000 (02:13 +0000)]
ANY23-319 Upgrade jsonld-java dependency to 0.11.1
Lewis John McGibbney [Wed, 27 Dec 2017 21:41:39 +0000 (21:41 +0000)]
ANY23-140 - Revise Any23 tests to remove fetching of web content
Lewis John McGibbney [Wed, 27 Dec 2017 20:06:08 +0000 (20:06 +0000)]
ANY23-318 ExtractionException handling in BaseRDFExtractor.java kills entire extraction
Jacek Grzebyta [Tue, 26 Dec 2017 00:37:13 +0000 (00:37 +0000)]
Solved ANY23-316
- create value for null values: yaml:Null
- update null-aware unit test
- modify example for unit test: 'null' value is an item of a list
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Tue, 19 Dec 2017 12:46:40 +0000 (04:46 -0800)]
Merge branch 'ANY23-314'
Lewis John McGibbney [Tue, 19 Dec 2017 12:45:59 +0000 (04:45 -0800)]
Merge branch 'ANY23-298'
Jacek Grzebyta [Thu, 14 Dec 2017 12:16:13 +0000 (12:16 +0000)]
Merge remote-tracking branch 'origin/ANY23-312-b'
Lewis John McGibbney [Wed, 13 Dec 2017 18:31:06 +0000 (10:31 -0800)]
ANY23-314 Service fails to return extraction in case of extraction error
Lewis John McGibbney [Wed, 13 Dec 2017 18:26:04 +0000 (10:26 -0800)]
ANY23-298 Revisit the OGP.java vocabulary and update it
Lewis John McGibbney [Tue, 12 Dec 2017 21:51:48 +0000 (13:51 -0800)]
ANY23-314 Service fails to return extraction in case of extraction error
Jacek Grzebyta [Tue, 12 Dec 2017 16:48:11 +0000 (16:48 +0000)]
Solved issues in javadoc description.
- replaces Map.Entry by dedicated small getters bean.
- other small changes
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Thu, 7 Dec 2017 17:02:42 +0000 (17:02 +0000)]
Fixed javadoc error
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Thu, 7 Dec 2017 12:24:26 +0000 (12:24 +0000)]
Merge branch 'ANY23-312'
- Fixed yaml parser
- add more unit tests
Jacek Grzebyta [Thu, 7 Dec 2017 12:19:42 +0000 (12:19 +0000)]
Clean code
- remove some tabs in empty rows
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Mon, 20 Nov 2017 00:41:12 +0000 (00:41 +0000)]
Fixed problem with list parser
- in raw rdf nodes are not reversed
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Sun, 19 Nov 2017 00:11:56 +0000 (00:11 +0000)]
Add unit test for extracting lists.
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Sat, 18 Nov 2017 23:20:05 +0000 (23:20 +0000)]
Update License information if files heads.
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Thu, 16 Nov 2017 18:54:17 +0000 (10:54 -0800)]
Add CONTRIBUTING guide
Lewis John McGibbney [Wed, 15 Nov 2017 05:02:52 +0000 (21:02 -0800)]
Merge branch 'master' of https://github.com/imduffy15/any23
Lewis John McGibbney [Wed, 15 Nov 2017 04:57:41 +0000 (20:57 -0800)]
Merge branch 'patch-1' of https://github.com/The-Alchemist/any23
Lewis John McGibbney [Wed, 15 Nov 2017 04:57:25 +0000 (20:57 -0800)]
Website updates as published to production
Ian Duffy [Wed, 8 Nov 2017 13:59:42 +0000 (13:59 +0000)]
Support attribute content on all fields.
<sometag content="something" /> should be considered, regardless if `content` is not a valid
attribute of `sometag`.
The specification for microdata[1] details that an elements content attribute should be considered
before text content.
Any23 doesn't currently do this, it only considers `content` for `meta` tags which is the only
HTML tag which is suppose to have a `content` but not all sites follow HTML specifications.
Updating the microdata parser to be able to get `content` from any element should it exist.
[1] https://www.w3.org/TR/microdata/#values
Signed-off-by: Ian Duffy <ian.duffy@zalando.ie>
Jacek Grzebyta [Mon, 6 Nov 2017 12:35:00 +0000 (12:35 +0000)]
Fix ANY23-312
- solved blank nodes creation problem for maps
- add another unit test example typical for configuration files
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Fri, 3 Nov 2017 22:25:29 +0000 (22:25 +0000)]
Update ElementProcessor
- instantiate a map's root node
- update unit tests
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Fri, 27 Oct 2017 19:03:18 +0000 (20:03 +0100)]
Fix ANY23-312
- fix problem with making wrong IRI if docIRI ends with # character
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Fri, 27 Oct 2017 16:17:10 +0000 (17:17 +0100)]
Fix ANY23-312
- fix unit test for literals
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
The Alchemist [Thu, 26 Oct 2017 20:36:04 +0000 (16:36 -0400)]
Formatting was off
Jacek Grzebyta [Mon, 23 Oct 2017 13:55:55 +0000 (14:55 +0100)]
Ref ANY23-312
- ElementProcessor: fix issue in map validation
returns always empty model if parsed to literal
- YamlExtractor: fix issue with yaml containing only literals
- update unit tests
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Mon, 23 Oct 2017 11:40:21 +0000 (12:40 +0100)]
Update YamlExtractor
- update unit tests
- add test to simple text file
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Mon, 23 Oct 2017 11:36:04 +0000 (12:36 +0100)]
Update ElementProcessor - RDF-izer
- make processor singleton
- update unit tests
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Sat, 21 Oct 2017 09:27:06 +0000 (10:27 +0100)]
Create ElementProcessor - RDF-izer
- Add RDF-iser based on types
- add unit tests
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Fri, 20 Oct 2017 16:41:40 +0000 (17:41 +0100)]
Create ElementProcessor
- add unit test
Signed-off-by: Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Wed, 18 Oct 2017 17:50:30 +0000 (18:50 +0100)]
Update RDFUtils
- remove some converters to util class
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Tue, 17 Oct 2017 16:22:53 +0000 (17:22 +0100)]
Ref ANY23-312
- buildNode method returns Optional wrapper
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Tue, 17 Oct 2017 16:06:44 +0000 (17:06 +0100)]
Ref ANY23-312
- update example file
- update unit test
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Fri, 15 Sep 2017 07:14:07 +0000 (00:14 -0700)]
[maven-release-plugin] prepare for next development iteration
Lewis John McGibbney [Fri, 15 Sep 2017 07:14:01 +0000 (00:14 -0700)]
[maven-release-plugin] prepare release any23-2.1
Lewis John McGibbney [Fri, 15 Sep 2017 06:19:19 +0000 (23:19 -0700)]
[maven-release-plugin] prepare for next development iteration
Lewis John McGibbney [Fri, 15 Sep 2017 06:19:13 +0000 (23:19 -0700)]
[maven-release-plugin] prepare branch 2.1.0
Lewis John McGibbney [Fri, 15 Sep 2017 03:17:46 +0000 (20:17 -0700)]
Preparing files for Any23 2.1 RC#1
Jacek Grzebyta [Wed, 13 Sep 2017 14:15:20 +0000 (15:15 +0100)]
Merge branch 'ANY23-311-2'
- fixed javadoc issues
Jacek Grzebyta [Wed, 13 Sep 2017 14:14:21 +0000 (15:14 +0100)]
Fixed ANY23-311
- solved javadoc problems
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Wed, 13 Sep 2017 10:46:46 +0000 (11:46 +0100)]
Merge branch 'ANY23-311' of https://github.com/jgrzebyta/any23
Jacek Grzebyta [Sat, 9 Sep 2017 20:13:37 +0000 (21:13 +0100)]
Fix testing issue
- add documentation to RDFUtils class
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Jacek Grzebyta [Tue, 29 Aug 2017 11:41:16 +0000 (12:41 +0100)]
Merge branch 'master' into ANY23-311
- Resolve conflict in YAMLExtractor.java
Signed-off-by:Jacek Grzebyta <grzebyta.dev@gmail.com>
Lewis John McGibbney [Wed, 23 Aug 2017 20:26:23 +0000 (13:26 -0700)]
ANY23-304 skip tests in openie module
Lewis John McGibbney [Wed, 23 Aug 2017 19:15:56 +0000 (12:15 -0700)]
ANY23-304 Add extractor for OpenIE
Lewis John McGibbney [Thu, 27 Jul 2017 19:16:29 +0000 (12:16 -0700)]
ANY23-304 implement temporary file reader within test logic
Lewis John McGibbney [Wed, 26 Jul 2017 21:19:37 +0000 (14:19 -0700)]
ANY23-304 increase number of extractors found