IBM Watson™ Discovery Service Ideas

We've moved...

You'll be redirected shortly, we've moved to our new idea portal: https://ibm-watson.ideas.aha.io

Remove restriction of 50 segments in document splitting feature of Discovery Service

Document splitting feature is very useful but currently it restricts to only 50 segments. We are dealing with huge documents so this restriction does not allow us to use this feature.

  • Guest
  • Nov 16 2017
  • Shipped
  • Attach files
  • Admin
    Phil Anderson commented
    March 16, 2018 12:54

    This limit has been increased to 250, and may be able to increase further - let us know if 250 is still not enough

  • Jaysen Ollerenshaw commented
    March 18, 2018 22:46

    Thanks Phil, much appreciated.

  • Drew JOHNSON commented
    May 15, 2018 08:35

    Per Deepak's comments on https://ibm-watson.ideas.aha.io/ideas/WDS-I-136, our use cases require splits of up to 15000 segments in order to get to a document worth returning as an answer. These are very long PDF and Word documents - happy to provide examples.

  • Deepak Sekar commented
    May 16, 2018 04:58

    We have built a WEX custom pipeline to split documents into paragraphs. It does a pretty good job in splitting complex & large pdfs and docs. Would like to have similar or better capability in WDS for our client to move from WEX to WDS

  • Admin
    Phil Anderson commented
    June 14, 2018 12:46

    Drew/Deepak - can you please create a new Idea for "splitting on paragraphs" if that is the requirement?  That's a slightly different feature then splitting on section.