?
我想将WEB服务器日志通过管道处理然后写入数据库
?
$ cat match.d import std.regex; import std.stdio; import std.string; import std.array; void main() { auto r = regex(`^(\S+) (\S+) (\S+) \[(.+)\] "([^"]+)" ([0-9]{3}) ([0-9]+) "([^"]+)" "([^"]+)" "([^"]+)"`); foreach(line; stdin.byLine) { foreach(m; match(line, r)){ auto c = m.captures; c.popFront(); auto value = join(c, "\",\""); auto sql = format("insert into log(remote_addr,unknow,remote_user,time_local,request,status,body_bytes_sent,http_referer,http_user_agent,http_x_forwarded_for) value(\"%s\");", value ); writeln(sql); } } }
?编译
$ dmd match.d 编译优化(减肥) $ strip match
测试
$ cat access.log | ./match 输出类似 insert into log(remote_addr,unknow,remote_user,time_local,request,status,body_bytes_sent,http_referer,http_user_agent,http_x_forwarded_for) value("192.168.2.76","-","-","19/Mar/2013:15:24:20 +0800","GET /favicon.ico HTTP/1.1","404","193","-","Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/535.11 (KHTML, like Gecko) Chrome/17.0.963.84 Safari/535.11 SE 2.X MetaSr 1.0","-");
??
用法
?
$ cat access.log | match | mysql -hlocalhost -ulog -p123456 logging
?
高级用法, 实时处理日志
- 首先创建一个管道
- 寻该日志文件写入管道中
- cat ?管道名?| match | mysql -hlocalhost -ulog -p123456 logging ?这样就可以实现实时日志插入。
上面程序稍加修改即可实现Hbase,?Hypertable?插入
?
?
?
?
?
?