Document splitting feature is very useful but currently it restricts to only 50 segments. We are dealing with huge documents so this restriction does not allow us to use this feature.
Why is it useful?
|
|
Who would benefit from this IDEA? | This enhancement would help customers to use the document splitting feature for huge corpus and avoid any custom solution |
How should it work?
|
|
Idea Priority | |
Priority Justification | |
Customer Name | |
Submitting Organization | |
Submitter Tags |
This limit has been increased to 250, and may be able to increase further - let us know if 250 is still not enough
Attachments Open full size
Thanks Phil, much appreciated.
Attachments Open full size
Per Deepak's comments on https://ibm-watson.ideas.aha.io/ideas/WDS-I-136, our use cases require splits of up to 15000 segments in order to get to a document worth returning as an answer. These are very long PDF and Word documents - happy to provide examples.
Attachments Open full size
We have built a WEX custom pipeline to split documents into paragraphs. It does a pretty good job in splitting complex & large pdfs and docs. Would like to have similar or better capability in WDS for our client to move from WEX to WDS
Attachments Open full size
Drew/Deepak - can you please create a new Idea for "splitting on paragraphs" if that is the requirement? That's a slightly different feature then splitting on section.
Attachments Open full size