how to select rows based on distinct values of A COLUMN only

151,239

Solution 1

Looking at your output maybe the following query can work, give it a try:

SELECT * FROM tablename
WHERE id IN
(SELECT MIN(id) FROM tablename GROUP BY EmailAddress)

This will select only one row for each distinct email address, the row with the minimum id which is what your result seems to portray

Solution 2

Try this - you need a CTE (Common Table Expression) that partitions (groups) your data by distinct e-mail address, and sorts each group by ID - smallest first. Then you just select the first entry for each group - that should give you what you're looking for:

;WITH DistinctMails AS
(
    SELECT ID, MailID, EMailAddress, NAME,
        ROW_NUMBER() OVER(PARTITION BY EMailAddress ORDER BY ID) AS 'RowNum'
    FROM dbo.YourMailTable
)
SELECT *
FROM DistinctMails
WHERE RowNum = 1

This works on SQL Server 2005 and newer (you didn't mention what version you're using...)

Solution 3

use this(assume that your table name is emails):

select * from emails as a 
inner join  
(select EmailAddress, min(Id) as id from emails 
group by EmailAddress ) as b 
on a.EmailAddress = b.EmailAddress 
and a.Id = b.id

hope this help..

Share:
151,239
user576510
Author by

user576510

Updated on May 08, 2020

Comments

  • user576510
    user576510 about 4 years

    I need to query a table in order to return rows, but I am not able to query the table correctly. Here is my table view:

    Id                MailId          EmailAddress          Name
    1                 1               [email protected]               Mr. A
    2                 1               [email protected]               Mr. B
    3                 1               [email protected]               Mr. C
    4                 1               [email protected]               Mr. D
    5                 1               [email protected]               Mr. A
    6                 2               [email protected]               Mr. E
    7                 2               [email protected]               Mr. A
    8                 3               [email protected]               Mr. F
    9                 4               [email protected]               Mr. D  
    10                5               [email protected]               Mr. F
    11                6               [email protected]               Mr. D
    

    The result set should return:

    Id                MailId          EmailAddress          Name
    1                 1               [email protected]               Mr. A
    2                 1               [email protected]               Mr. B
    3                 1               [email protected]               Mr. C
    4                 1               [email protected]               Mr. D
    6                 2               [email protected]               Mr. E
    8                 3               [email protected]               Mr. F
    

    In other words: first, I want to select distinct e-mail addresses, and then return rows containing distinct e-mail addresses.

    Note: Just using the "Distinct" keyword will not work here, as it will select distinct rows. My requirement is to select distinct email addresses, and then to select rows containing those addresses.

    Edit: I cannot use the "Group By" keyword either, because for this I will also have to Group By with Id (which is the PK) and doing this will return two rows with the same EmailAddress values but with different Ids.

  • blejzz
    blejzz over 12 years
    see my comment, my answer returns the results you wanted.
  • user576510
    user576510 over 12 years
    it returns error Msg 8120, Level 16, State 1, Line 1 Column 'EmailContact.RowId' is invalid in the select list because it is not contained in either an aggregate function or the GROUP BY clause.
  • Daria Dragomir
    Daria Dragomir over 7 years
    I think you need to include the name field, not just email addr, although he doesn't have an example of it.