[TXTREC-9] Aggregate all the info of a page Created: 26/Mar/19 Updated: 19/Aug/19 Resolved: 23/Jul/19 |
|
| Status: | Closed |
| Project: | Text Classification |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | 1.0 |
| Type: | Story | Priority: | Neutral |
| Reporter: | Laura Delnevo | Assignee: | Le Hai Thanh |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | 7.5h | ||
| Time Spent: | 2d 0.5h | ||
| Original Estimate: | Not Specified | ||
| Issue Links: |
|
||||||||
| Template: |
|
||||||||
| Acceptance criteria: |
Empty
|
||||||||
| Task DoD: |
[ ]*
Doc/release notes changes? Comment present?
[ ]*
Downstream builds green?
[ ]*
Solution information and context easily available?
[ ]*
Tests
[ ]*
FixVersion filled and not yet released
[ ] 
Architecture Decision Record (ADR)
|
||||||||
| Documentation update required: |
Yes
|
||||||||
| Date of First Response: | |||||||||
| Epic Link: | Txt Classification integration | ||||||||
| Sprint: | Add-Ons 15, Add-Ons 16 | ||||||||
| Story Points: | 5 | ||||||||
| Description |
dev notes Content to be tagged: ideally aggregate content of all the page (but we have a limit from Amazon of 5,000 characters) |
| Comments |
| Comment by Le Hai Thanh [ 16/Jul/19 ] |
|
Solution: Introduce a configuration `text-classification -> aggregateDefinition -> properties`. Only aggregate properties which are presented in AggregateDefinition, is a String and it not empty. All properties will be aggregated into one document.
aggregateDefinition: properties: [title, keywords, description, text]
1. Aggregate properties which are defined in AggregateDefinition.properties for a Node. 2. Aggregate Component's properties (by recursion) which are defined in AggregateDefinition.properties for a Node. 3. Aggregate Area's properties (by recursion) which are defined in AggregateDefinition.properties for a Node. Reason:
Cons: Need to define which properties will be aggregated. |