基于关键字段列表删除数据表的重复行

本文关键字:数据表 删除 于关键 字段 列表 | 更新日期: 2023-09-27 18:07:33

我使用以下代码根据一个字段(keyField)的值删除DataTable中的重复行

IEnumerable<DataRow> uniqueContacts = dt.AsEnumerable()
                    .GroupBy(x =>  x[keyField].ToString())
                    .Select(g => g.First());
DataTable dtOut = uniqueContacts.CopyToDataTable();

如何升级这段代码,使我的LINQ根据字段列表的值删除重复项?例如,删除所有具有相同'firstname'和'lastname'的行?

基于关键字段列表删除数据表的重复行

您可以使用匿名类型:

IEnumerable<DataRow> uniqueContacts = dt.AsEnumerable()
                    .GroupBy(row =>  new { 
                        FirstName = row.Field<string>("FirstName"),
                        LastName  = row.Field<string>("LastName")
                    })
                    .Select(g => g.First());

因为你想要一个解决方案,工作与List<string>是未知的编译时,你可以使用这个类:

public class MultiFieldComparer : IEquatable<IEnumerable<object>>, IEqualityComparer<IEnumerable<object>>
{
    private IEnumerable<object> objects;
    public MultiFieldComparer(IEnumerable<object> objects)
    {
        this.objects = objects;
    }
    public bool Equals(IEnumerable<object> x, IEnumerable<object> y)
    {
        return x.SequenceEqual(y);
    }
    public int GetHashCode(IEnumerable<object> objects)
    {
        unchecked
        {
            int hash = 17;
            foreach (object obj in objects)
                hash = hash * 23 + (obj == null ? 0 : obj.GetHashCode());
            return hash;
        }
    }
    public override int GetHashCode()
    {
        return GetHashCode(this.objects);
    }
    public override bool Equals(object obj)
    {
        MultiFieldComparer other = obj as MultiFieldComparer;
        if (other == null) return false;
        return this.Equals(this.objects, other.objects);
    }
    public bool Equals(IEnumerable<object> other)
    {
        return this.Equals(this.objects, other);
    }
}

和这个扩展方法使用这个类:

public static IEnumerable<DataRow> RemoveDuplicates(this IEnumerable<DataRow> rows, IEnumerable<string> fields)
{
    return rows
        .GroupBy(row => new MultiFieldComparer(fields.Select(f => row[f])))
        .Select(g => g.First());
}

那么它就像这样简单:

List<string> columns = new List<string> { "FirstName", "LastName" };
var uniqueContacts = dt.AsEnumerable().RemoveDuplicates(columns).CopyToDataTable();