Text mining for corporate information retrieval

Funding Details
Natural Sciences and Engineering Research Council of Canada
  • Grant type: Engage Plus Grants Program
  • Year: 2014/15
  • Total Funding: $12,083
Keywords
Principle Investigator(s)
Collaborator(s)

No researchers found.

Partners

Project Summary

Authoring technical documentation of products is a time-consuming process for industry. Furthermore,products are typically not designed in isolation, but belong to product lines, in which different products sharefeatures and operating instructions. Therefore, authoring technical documentation should be enabled to buildon existing documentation components that are retrieved and adapted to different, but similar, products.Innovatia is an industry leader in supporting companies re-use and deploy in electronic form existingdocumentation materials. The proposed project will continue to investigate the application of text mining andtext visualization techniques to the problem of management and re-use of technical documentation.Different text similarity methods will be applied, aiming to improve the ability of Innovatia's authoring systemto identify and cluster together similar document components, even if they use different wording to expresssimilar concepts. In the proposed phase of the project, authors from Innovatia will use the implementedextensions to the company's Content Miner system to evaluate their effectiveness in the authoring task.Additional problems that will be addressed in this phase include the automatic evaluation of languagenon-uniformity, and the application of visual text analytics techniques to deriving insight from help-desktickets.

Related Projects

The NSERC Business Intelligence Network will create an innovation platform to enhance the collaboration of the top Canadian knowledge and information management researchers and the top Canadian companies in business intelligence, an area that is cent... More ...
We propose an industrial stream, advanced, collaborative graduate Training program in Big Text Data (TRIBE). On the one hand, Big Text represents additional challenges with respect to Big Data, as the information is unstructured and presented in (oft... More ...