Skip to content

Flag to allow for deduplicating authors with authors-config.csv #2524

@yizhong187

Description

@yizhong187

Problem with existing implementation

Right now, the only way to deduplicate authors with multiple git usernames is to specify all the unique authors (along with their duplicated git usernames) within the authors-config.csv file. This is a problem for a large repo, as a user needs to fill in a massive authors-config.csv file with all authors, even if there is only one problematic author with multiple usernames.

What feature(s) would you like to see in RepoSense

There should be a flag that is used alongside --config to allow for displaying all other authors that are not specified in author-config.csv. This would allow for easy deduplication by just specifying the problematic authors and the usernames that need to be collated and treated as a single author.

Example

Image
For a repo like the one in the screenshot above, we should just be able to deduplicate the multiple "Letian" records by having one single line in the authors-config.csv file with the use of the flag.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Status

No status

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions