• Save
Effective Strategies for Searching Oracle UCM
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share

Effective Strategies for Searching Oracle UCM

  • 6,143 views
Uploaded on

 

More in: Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
No Downloads

Views

Total Views
6,143
On Slideshare
6,076
From Embeds
67
Number of Embeds
4

Actions

Shares
Downloads
0
Comments
1
Likes
4

Embeds 67

http://cfour.fishbowlsolutions.com 63
http://www.slideshare.net 2
http://webcache.googleusercontent.com 1
http://www.thiswayup.de 1

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • From the 1st Java release of Intra.doc! until the release of Oracle Content Server 10gR3, Verity’s VDK search integration was the default search solution for Content Servers though alternatives were offered at different timesWith the Verity integration unavailable except to customers already using it and the ability to support it waning, existing customers need to consider moving to another solution and new clients
  • Boolean operatorsPhrase matchingProximity searchingRange searching (dates or numeric data)Attribute search or Zone searchingFuzzy SearchWildcard support (Substring search)Word forms (stemming)Regular ExpressionCase Sensitivity

Transcript

  • 1. Effective Strategies for Searching Oracle UCM
    Fishbowl Solutions
    Oracle Universal Content Management
    Experts Since 1999
  • 2. Where’s My CONTENT?
    One of the key features of a CMS is the ability to search and FIND needed content.
    A Forrester study in 2009 found that
    “Of resondents planning to increase ECM use, 61% said content sharing was the most important driver, followed by compliance, 51%; improved search, 45%; and cost-effective automation, 44%.”
  • 3. Survey
    What version of Content Server are you running?
    10g, 7.5, earlier?
    What’s your search index?
    Verity?
    Metadata only?
    Oracle DB or MS SQL Server Full Text?
    Oracle SES, Google, or FAST?
    Do you have any problems with searching your repository?
  • 4. Background
    Verity VDK
    Integrated Verity VDK search solution previously default indexing solution
    Verity VDK integration no longer available
    In the past the search solution was pretty well defined, but now there are a number of options.
    Both new and existing clients need to decide how they will power indexing of their UCM content
  • 5. What’s options are available?
    Database Metadata only indexing
    Database Full Text
    Oracle 9i, 10g, and 11g
    MS SQL Server
    External Indexes
    Oracle Secure Enterprise Search (SES)
    Fast
    Verity integration
    Google Mini 3rd Party integration
    Others?
  • 6. Which to use???
    Is full text necessary?
    Metadata only indexing creates an overall simpler, lighter weight architecture
    Do you want to search for things not managed in Content Server?
    Enterprise search (custom or 3rd party product required)
    Does your database support full text search? (With the features you need?)
  • 7. Search Features
    Search Vocabulary (Boolean, proximity, zone, stemming, case sensitivity, range searching?)
    Spell checking
    Results pagination
    Ability to limit size of search results (MaxResults)
    Ability to return large hit count
  • 8. Search Features (cont’d)
    Sorting on different types or multiple fields at once (score, metadata, number or date fields)
    Relevance or scoring ability and quality
    Snippet or summary returned with results
    Keyword match highlighting (PDF Highlighting)
  • 9. Indexing Features
    Stop words
    Stemming
    Dictionaries available (multi-language support and support of multiple languages in a single collection)
    File formats supported
    Parametric search or faceted search
  • 10. Indexing Features (cont’d)
    Performance
    Index size (relative to indexed content)
    Page limit (how much content can be indexed?)
    Indexing Latency
    Ability to index large amounts of numeric data
    Metadata indexing (lots of metadata?)
    Rebuild required on metadata additions?
    Indexing SDK return value confirms success
  • 11. Other Considerations
    Scalability
    Platform support (Hardware/OS)
    Support Availability
    Documentation available
    Cost of indexing solution and hardware
  • 12. Verity
    Verity was purchased by Autonomy is focusing their development efforts on their IDOL platform
    Oracle stopped distributing it in June 2008 to new customers
    As of February 28, 2009 the Verity components are no longer available through any distribution channels, including media request or download
    Only solution to support PDF Highlighting
  • 13. Database Search Options
    DisableTotalItemsSearchQuery
    Improves search performance when = false
    Substring searches may perform poorly
    Should not be default search operator
    SearchSortOptions component
    Can be used to optimize search performance
    Recommend indexing commonly searched fields for improved performance!
    CaseInsensitiveSearch component
    For Oracle databases
  • 14. Metadata Only Indexing
    All databases supported by Content Server can be used to support metadata only indexing
    Good for Records Management or scanning solutions where content access isn’t as important and full text indexing is not necessary
    Good for DAM instances where content is not indexable.
  • 15. Database Full Text Indexing
    Oracle Text 11g database is recommended
    Score based sorting
    Performance improvements
    Faceted search available
    Oracle Text 10g is functional (9i not recommended)
    Microsoft SQL Server also supported
    No Score based sorting
  • 16. Secure Enterprise Search
    Oracle intends to provide a limited license with UCM to index UCM content
    Available when 11g is released (most likely)
    Recommended where Content Server database does not support full text indexing
    Similar indexing features to Oracle Text 11g
    Can be used as enterprise search solution with additional licensing
  • 17. Other external indexes
    Fast
    Now owned by Microsoft
    Fast but heavy hardware requirements
    10g integration supported, but not currently available
    Few customers are using this option
    Google Mini integration
    Available from 3rd party includes lease of Google search appliance
    Custom integration
    Component architecture would allow for a Lucene or other search index integration
  • 18. Oracle Text 11g Faceted Search
    3 fields are configured for drill down OTB: and Security Group, and Account. 1 additional could be added.
    They allow you to filter your search by any of these fields.
    This faceted search feature could be added to other search results templates to add value to an interface
  • 19. Oracle Text 11g Faceted Search (cont’d)
  • 20. Content Rating
    Allows users to rate content.
    Content rating is included w/ search results
  • 21. Predictive Search Component
    Google type feature to suggest search terms as users types query
    Captures user search terms and stores those terms that return results
    Fishbowl Solutions component offered to compliment any search index solution
  • 22. Conclusions
    Many customers will need to select a new search option in the not too distant future
    The 1st question is if metadata only is sufficient, if enterprise search is required, and/or if a full text index is required
    There are a lot of options, but not a whole lot of objective information available regarding how to compare the options (some of this is expected to be forthcoming with the 11g release)
    A custom integration may add significant value
  • 23. Fishbowl Solutions, Inc.www.fishbowlsolutions.com
    Questions?
    Oracle Universal Content Management Experts Since 1999