php - Building a fast semantic MySQL search engine for private articles from scratch -
i working on project involve full-text and semantic searches of articles within site (if it's not possible combine it, user can select either option). these articles subscription-based , can searched after logging in; not accessible external search engines or apis.
i read sphinx full text keywords searches (and intend implement aspect) not sure how go building semantic search engine out of this. e.g. searching "u.s. president" should list articles contain references actual names of u.s. presidents e.g. george washington, bill clinton (or william jefferson clinton).
i have ideas maybe sort of tagging system can used relate various keywords e.g. relate president george washington , president bill clinton, since data huge , many such relations exist don't know how further idea.
please advice me on how go building semantic search engine (i guess sphinx can handle full-text keyword search) scratch. otherwise, please inform me of internet-based resources or if there existent software in language can integrate application.
p.s. database of choice mysql (please advice if database system more suitable task), , prefer program in php if need learn python or other language more effective task, willing.
i searched @ answers.semanticweb.com
i use apache solr. think it's more flexible sphinx. solr supports full-text search , believe has add-ons semantic support (like siren). solr serverized version of lucene.
solr supports synonymfilter: http://wiki.apache.org/solr/analyzerstokenizerstokenfilters#synonymfilter
this post discusses strategies optimizing content retrieval http://www.lucidimagination.com/devzone/technical-articles/optimizing-findability-lucene-and-solr
Comments
Post a Comment