日期:2014-05-20  浏览次数:21014 次

如何获取网页被哪些蜘蛛爬过?
最近突然想到一个问题,怎么样通过程序来获取哪些网页被哪些蜘蛛来爬过?
不知道是否可以得到相应的信息?


------解决方案--------------------
request.getHeader
------解决方案--------------------
呵呵,这个贴还没结。。。。
给你答案:


先在web.xml中加这么几条
<filter>
<filter-name> FileFilter </filter-name>
<filter-class> com.abc.FileFilter </filter-class>
</filter>
<filter-mapping>
<filter-name> FileFilter </filter-name>
<url-pattern> /*.jsp </url-pattern>
</filter-mapping>
这是把所有的访问xxx.jsp的都先通过com.abc.FileFilter这个类,这样就好记录了
com.abc.FileFilter:

import org.apache.commons.logging.Log;
import org.apache.commons.logging.LogFactory;

import javax.servlet.*;
import javax.servlet.http.HttpServletRequest;
import java.util.Date;


public class FileFilter
implements Filter {
public void init(FilterConfig config) throws javax.servlet.ServletException {
}

private static Log log = LogFactory.getLog(FileFilter.class);

private StringBuffer sb;

public void doFilter(ServletRequest request, ServletResponse response,
FilterChain chain) throws java.io.IOException,
javax.servlet.ServletException {
HttpServletRequest hrequest = (HttpServletRequest) request;
sb = new StringBuffer();
String uri = hrequest.getServletPath();
uri = uri.toLowerCase();
String referer = request.getHeader( "referer ");
log.debug(referer);//=================这里记录访问记录等信息
if (uri.startsWith( "/list- ")) {//================这里是返回到你的页面,具体改一下
hrequest.getRequestDispatcher(sb.append( "/search.jsp?action=singer&keyword= ").append(uri.substring(8, uri.length()).replaceAll( ".jsp ", " ")).toString()).forward(request, response);
sb = null;
}
}

public void destroy() {
}
}


分不是问题,这只是个帐号,只是看你对这个问题的重视程度而已,所以上面我才说“才20分”。。。