日期:2014-05-16 浏览次数:20614 次
查询和删除表中重复数据
文章分类:数据库?
若想将姓名、身份证号、住址这三个字段完全相同的记录查询出来?
select ? p1.* ? from ? persons ? p1,persons ? p2 ? where ? p1.id<>p2.id ? and ? p1.cardid ? = ? p2.cardid ? and ? p1.pname ? = ? p2.pname ? and ? p1.address ? = ? p2.address?
?
可以实现上述效果.?
?
几个删除重复记录的SQL语句?
?
1.用rowid方法?
?
2.用group by方法?
?
3.用distinct方法?
?
1。用rowid方法?
?
据据oracle带的rowid属性,进行判断,是否存在重复,语句如下:?
?
查数据:?
?
?? ? select * from table1 a where rowid !=(select ? max(rowid)?
?? ? from table1 b where a.name1=b.name1 and a.name2=b.name2......)?
?
删数据:?
?
?? ?delete ? from table1 a where rowid !=(select ? max(rowid)?
?? ? from table1 b where a.name1=b.name1 and a.name2=b.name2......)?
?
2.group by方法?
?
查数据:?
?
select count(num), max(name) from student --列出重复的记录数,并列出他的name属性?
group by num?
having count(num) >1 --按num分组后找出表中num列重复,即出现次数大于一次?
?
删数据:?
?
delete from student?
group by num?
having count(num) >1?
这样的话就把所有重复的都删除了。?
?
3.用distinct方法 -对于小的表比较有用?
?
?
create table table_new as ? select distinct * ? from table1 minux?
truncate table table1;?
insert into table1 select * from table_new;?
?
查询及删除重复记录的方法大全?
?
1、查找表中多余的重复记录,重复记录是根据单个字段(peopleId)来判断?
select * from people?
where peopleId in (select peopleId from people group by peopleId having count(peopleId) > 1)?
?
?
2、删除表中多余的重复记录,重复记录是根据单个字段(peopleId)来判断,只留有rowid最小的记录?
delete from people?
where peopleId in (select peopleId from people group by peopleId ? having count(peopleId) > 1)?
and rowid not in (select min(rowid) from people group by peopleId having count(peopleId )>1)?
?
sql server不支持rowid?
我们可以变通实现同样的效果?
eg:?
delete from t_serviceitem?
where servid in (select servid from t_serviceitem group by servid having count(servid)>1)?
and gid not in (select min(gid) from t_serviceitem group by servid having count(servid )>1)?
(gid是该表PK)?
?
3、查找表中多余的重复记录(多个字段)?
select * from vitae a?
where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)?
?
?
4、删除表中多余的重复记录(多个字段),只留有rowid最小的记录?
delete from vitae a?
where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)?
and rowid not in (select min(rowid) from vitae group by peopleId,seq having count(*)>1)?
?
?
5、查找表中多余的重复记录(多个字段),不包含rowid最小的记录?
select * from vitae a?
where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)?
and rowid not in (select min(rowid) from vitae group by peopleId,seq having count(*)>1)?
?
?
(二)?
比方说?
在A表中存在一个字段“name”,?
而且不同记录之间的“name”值有可能会相同,?
现在就是需要查询出在该表中的各记录之间,“name”值存在重复的项;?
Select Name,Count(*) From A Group By Name Having Count(*) > 1?
如果还查性别也相同大则如下:?
Select Name,sex,Count(*) From A Group By Name,sex Having Count(*) > 1?
?
(三)?
方法一?
declare @max integer,@id integer?
declare cur_rows cursor local for select 主字段,count(*) from 表名 group by 主字段 having count(*) >; 1?
open cur_rows?
fetch cur_rows into @id,@max?
while @@fetch_status=0?
begin?
select @max = @max -1?
set rowcount @max?
delete from 表名 where 主字段 = @id?
fetch cur_rows into @id,@max?
end?
close cur_rows?
set rowcount 0?
?
?
方法二?
"重复记录"有两个意义上的重复记录,一是完全重复的记录,也即所有字段均重复的记录,二是部分关键字段重复的记录,比如Name