Visualization and text mining of patent and non-patent data
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share

Visualization and text mining of patent and non-patent data

  • 2,033 views
Uploaded on

 

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
2,033
On Slideshare
2,031
From Embeds
2
Number of Embeds
1

Actions

Shares
Downloads
54
Comments
0
Likes
1

Embeds 2

http://www.slideshare.net 2

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Visualization and text mining of patent and non-patent data Anton Heijs Treparel Information Solutions Delft, The Netherlands http://www.treparel.com/ ICIC conference , Nice , France , 2008 Visualization and text mining Treparel
  • 2. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Outline Introduction Applications on patent and non-patent literature Patent mining and visualization Mining and visualization Medline Conclusions and future development Visualization and text mining Treparel
  • 3. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Introduction Mining + visualization = visual analytics More data - less insight Overview needed Text mining approaches controlling over/under-fit Rule based : difficult to control under fit Natural language processing : difficult to control over fit Machine learning : controls over-fit and under-fit Visualization and text mining Treparel
  • 4. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Document clustering : finding patterns unsupervised Unsupervised text mining : clustering Input is a vector representation of all documents Separate the data in its natural groups based on a similarity metric Output is a similarity matrix Visualization and text mining Treparel
  • 5. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Document classification : finding patterns supervised Supervised text mining : classification Input is a vector representation of all documents Determine a hyper plane which separates all the data Binary classification : classify to 1 class Multi class classification : classify to multiple classes Compound classification : combine multiple binary classifiers Output is a matrix with classification scores Visualization and text mining Treparel
  • 6. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Text analytics Text analytics is text mining and visualization Combined use of mining and visualization for analysis Uses text mining to determine trends in the data Use visualization to make complex data understandable Visualization and text mining Treparel
  • 7. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Applications on patent and non-patent literature Text mining and visualization steps management and preprocessing data text mining (filtering the data in the pipeline) mapping filtered data to geometry rendering and interaction Visualization and text mining Treparel
  • 8. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Patent analytics applications Patent search using classification and clustering algorithms Patent landscaping, spotting new opportunities Patent ranking, utilization optimization and valuation Visualization and text mining Treparel
  • 9. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Search by querie by class Visualization and text mining Treparel
  • 10. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Tree map visualization of the patent classification Visualization and text mining Treparel
  • 11. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Clustering of patents of 7 classes in Optical Recording Visualization and text mining Treparel
  • 12. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Clustering of large patent sets Visualization and text mining Treparel
  • 13. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Building a classifier Visualization and text mining Treparel
  • 14. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Testing a classifier Visualization and text mining Treparel
  • 15. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Visualization of the classifier performance Visualization and text mining Treparel
  • 16. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Patent analytics application combining mining and visualization Visualization and text mining Treparel
  • 17. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Correlation between patents Visualization and text mining Treparel
  • 18. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Patents over time Visualization and text mining Treparel
  • 19. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Patents visualization hierarchically, supervised, unsupervised and over time Visualization and text mining Treparel
  • 20. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Text mining and visualizations applications on Medline Search for facts and relationships (using classification and clustering) Disambiguation (using classification) Named entity recognition (using classification) Concept and relationships discovery Knowledge discovery by integrating text mining and visualization with Data mining : mining table data from experiments Image mining : micro array analysis Graph mining : path way analysis Visualization and text mining Treparel
  • 21. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Mining and visualization Medline Visualization and text mining Treparel
  • 22. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Search Medline by querie Visualization and text mining Treparel
  • 23. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Classification of Medline documents to multiple topics Visualization and text mining Treparel
  • 24. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Visualization of trends in Medline documents over time Visualization and text mining Treparel
  • 25. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Conclusions Combination of text mining and visualization algorithms enables more application solutions Multiple application solutions are a combination of a small set of text mining algorithms Multiple application solutions require multiple coupled view visualizations to provide complete Visualization and text mining Treparel
  • 26. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Future development Patent analytics will include patent ranking and valuation Text mining will integrate with data mining , for instance for patent portfolio management Stand alone applications and applications with work flow support will advance Visualization and text mining Treparel
  • 27. Introduction Applications Patent Analytics Medline Analytics Conclusions and future development Trends Patterns Relationships http://www.treparel.com/ Enabling you to learn more and see more Visualization and text mining Treparel