Comparison shopping search of similar design and implementation of key words - goods than one hundred (

This period has been busy comparison shopping search goods than one hundred ( SEO work, after the restructuring website, keyword placement and outside the chain of construction, Baidu has been stable for the site included, webmaster tool to query: "Baidu in last days included a website ( 1010 pages, "contains the total number: 72400. (2011-09-20)

Search applications in the realization of shopping time, often encounter similar Taobao search results page, "you are not looking for:", etao search results page under the search box "hot search" and the like demand, that is the key to the current search words, similar to the query keywords.

Seo and website from the existing structure into account, a separate page with a popular show keywords, URL design, such as:

And Achieve results similar words such as:

Comparison shopping search of similar design and implementation of key words - goods than one hundred (

That under the implementation details, you first need a keyword database, the user can search for keywords statistics, you can also collect mall popular keywords. Wrote a simple program, statistics Taobao, Taobao Mall popular keywords more than 100,000 as a keyword library.

Similar to the query, I must mention the TF-IDF. TF-IDF (term frequency-inverse document frequency) is used for information retrieval and information exploration of the common weighting techniques. The main idea is: if a word or phrase in an article in the frequency of TF high and rarely in other articles, is that the word or phrase has a good ability to distinguish between the categories, suitable for classification. TFIDF fact: TF * IDF, TF word frequency (Term Frequency), IDF anti-document frequency (Inverse Document Frequency).

Lucene also provides a similar comparison of the interface, MoreLikeThis interface. Do not talk nonsense, and directly on the code.

Popular Keywords Recommended code:

public List<Hotkey> searchMoreLikeThis(String wd, int top) throws IOException, Exception { MoreLikeThisQuery query = new MoreLikeThisQuery(wd, new String[] { "wd" }, analyzerUtil.ikAnalyzer); TopDocs topDocs = getSearcher().search(query, top); int totalHits = topDocs.totalHits; ScoreDoc scoreDocs[] = topDocs.scoreDocs; top = top < totalHits ? top : totalHits; List<Hotkey> list = new ArrayList<Hotkey>(); for (int i = 0; i < top; i++) { Document doc = getSearcher().doc(scoreDocs[i].doc); Hotkey hotkey = doc2Object(doc); int freq = taobaoItemSearcher.docFreq(hotkey.getWd()); //... Here temporarily invisible ^_^ list.add(doc2Object(doc)); } return list; }

Interested friends look at the site, mention recommendations. Technology, Web site operators, SEO and other interested friends, a lot of exchanges.


You can also add QQ: 909546261, doubt phase analysis.

分类:Internet 时间:2011-09-21 人气:145
blog comments powered by Disqus


  • Android --- floating search box (SearchManager) 2010-11-29

    The use of floating search box is not difficult, but its configuration is very complicated, its use is mainly for developers to easily search for programs in business, better design UI SearchManager specific use the following steps: (1) configuration

  • [Netease Wealth 10-year programming Qualifying first] second question Wealth search box 2010-06-01

    Description The search box in the proper way, when entering one or more characters, the search box will appear a certain number of prompt, as shown below: Now give your word and a number N a, please Shuchu tips Results , to simplify the problem, is p

  • The core support information retrieval technology 2011-04-14

    (Information Retrieval), usually refers to text information retrieval, including information storage, organization, performance, query, access and other aspects, the core of text indexing and retrieval. Historically, information retrieval through a m

  • Introduction to Information Retrieval (translated): Chapter Boolean Search (1) 2010-05-01

    Information retrieval is very broad meaning of the word. Remove the card only from the wallet, then enter the credit card number also belongs to the scope of information retrieval. However, from an academic point of view, information retrieval is def

  • Baidu search box sent directly to the sentence issued search microblogging microblogging 2011-04-07

    Microblogging fire ah, today found that Baidu search box hair microblogging features, Baidu to search for that function well and microblogging linked together, and improve user interaction, a sentence, then the search engine is difficult to understan

  • js - Baidu search box prompts imitation 2011-07-27

    About the content to achieve This is a very simple example, the server is using a jsp page, the return type is xml. Talk about next is how it is in the browser, through ajax request, sending a string of letters, the server side by comparing the retur

  • [Information retrieval] Lucene (1): the basic principle of Search 2010-03-04

    The full text: Study: forfuture1978 (from CSDN) We all know, Lucene is an open source text search engine tool kit. Search in the end then what? This should start with the data in our

  • Personalization: the key to the successful implementation of CRM 2010-09-18

    As the tide of information arrival, as well as changes in business strategy thinking, all kinds of information management systems competing to emerge, some of the advanced enterprise management focus from product-centric to customer-centric changes.

  • The implementation details ConcurrentHashMap 2010-11-04

    ConcurrentHashMap is Java 5, support high-concurrency, high-throughput thread-safe HashMap implementation. Before that I only have a superficial understanding of ConcurrentHashMap only know that it uses a number of locks, probably enough. But in an i

iOS 开发

Android 开发

Python 开发



PHP 开发

Ruby 开发






Javascript 开发

.NET 开发



Copyright (C), All Rights Reserved. 版权所有 黔ICP备15002463号-1

processed in 0.278 (s). 12 q(s)