首页 > 解决方案 > 从列表中删除几乎相同的字符串(名词复数形式)

问题描述

我正在尝试识别和删除列表中名词的复数形式。本质上,我希望通过以下测试:

@Test
public void testRemoveNounPlurals(){
    List<String> listWithDups = List.of("friend", "friends", "dog", "dogs", "serious");

    List<String> filteredList = removeDuplicates(listWithDups); // testing this method

    org.assertj.core.api.Assertions.assertThat(filteredList)
            .hasSize(3)
            .containsOnly("friend", "dog", "serious");
}

不完全存在的实现:

public static List<String> removeDuplicates(List<String> list) {

    List<String> maybePlurals = list.stream()
            .filter(s -> s.endsWith("s"))
            .collect(toList()); // friends, dogs, serious

    return list.stream()
             // correctly removes friends and dogs, should keep 'serious'
            .filter( word -> maybePlurals.contains(word.concat("s"))) 
            .collect(toList());
}

标签: java

解决方案


此解决方案为每个单词添加一个“s”并检查它是否存在,如果存在则将其删除

public static List<String> removeDuplicates(List<String> list) {
    List<String> result = new ArrayList<>();
    result.addAll(list);

    for (String word : list) {
        String words = word + "s";
        result.remove(words);
    }
    return result;
}

我想可以通过添加来完成一些优化

if (word.endsWith("s")) {
     continue;
}

for循环中的第一个


推荐阅读