IBM Watson™ Discovery Service Ideas

We've moved...

You'll be redirected shortly, we've moved to our new idea portal: https://ibm-watson.ideas.aha.io

Ability to Split Documents At Ingest Time

It would be useful to be able to provide Discovery logic that would allow it to split one ingested document into multiple indexed documents.  For example, make every paragraph or page a single Discovery document.

  • Phil Anderson
  • Jun 5 2017
  • Shipped
Why is it useful?
Who would benefit from this IDEA? As Deb, I want to create answers from our FAQ for questions people are asking my chat bot
How should it work?
Idea Priority Medium
Priority Justification
Customer Name
Submitting Organization
Submitter Tags
  • Attach files
  • Senthil B commented
    June 30, 2017 07:01

    +1

  • Admin
    Phil Anderson commented
    June 30, 2017 12:29

    Hi Senthil, no need to type +1, just ensure you click the vote button, which actually gives this a plus one :)

  • Lalit Agarwalla commented
    July 20, 2017 15:31

    Along with splitting the document, there is also a need to have HTML version of the text in another field (can be made optional). When we split using Document Conversion as answer units, everything becomes plain text. So even if there is a table or list, it all becomes mixed up.

    Idea is to having a "html" field along with "text" field in the json, just like it is having when we upload html file.

     

     

  • Percy Shi commented
    July 25, 2017 04:46

    have we got some update on this requirement?

    thanks!

  • Admin
    Phil Anderson commented
    October 03, 2017 16:35

    This is now in Production (in beta)

  • Percy Shi commented
    October 03, 2017 16:41

    @James Anderson

    Do we have some document/info about this feature?

     

    thanks!

  • Admin
    Phil Anderson commented
    October 03, 2017 16:48

    Yes, you can read the docs here: https://console.bluemix.net/docs/services/discovery/building.html#doc-segmentation and the announcement here https://apps.na.collabserv.com/blogs/152f58a2-3bb3-4992-86a7-c56ad4bbd21c/entry/Document_Splitting_answer_units_Beta_Released?lang=en_us

  • Percy Shi commented
    October 03, 2017 16:53

    thanks, @James Anderson !

  • Guest commented
    December 12, 2017 05:37

    Is there an expectation to add configuring of splitting into the Discovery Tool?  So setting up splitting would happen in the UI rather than using the API...?