首页 > 解决方案 > 如何在我的数据框中创建一个新变量来表达现有框架中的一些数据

问题描述

所以我有一个数据框来表达年份、工资、学位、性别和年龄之间的关系。年份有 2015 年和 1996 年。如何创建仅表示收入属于 2015 年的新变量?

请帮助我,我现在很沮丧

数据集看起来像

标签: rdataframevariablesstat

解决方案


使用工资数据 和dplyr,这将创建一个名为 New 的新列,并给出 2006 年的工资值。

Wage %>% mutate(New = ifelse(year==2006,wage,NA))

输出:

year age           maritl     race          education             region       jobclass         health health_ins  logwage      wage       New
 2006  18 1. Never Married 1. White       1. < HS Grad 2. Middle Atlantic  1. Industrial      1. <=Good      2. No 4.318063  75.04315  75.04315
 2004  24 1. Never Married 1. White    4. College Grad 2. Middle Atlantic 2. Information 2. >=Very Good      2. No 4.255273  70.47602        NA
 2003  45       2. Married 1. White    3. Some College 2. Middle Atlantic  1. Industrial      1. <=Good     1. Yes 4.875061 130.98218        NA
 2003  43       2. Married 3. Asian    4. College Grad 2. Middle Atlantic 2. Information 2. >=Very Good     1. Yes 5.041393 154.68529        NA
 2005  50      4. Divorced 1. White         2. HS Grad 2. Middle Atlantic 2. Information      1. <=Good     1. Yes 4.318063  75.04315        NA
 2008  54       2. Married 1. White    4. College Grad 2. Middle Atlantic 2. Information 2. >=Very Good     1. Yes 4.845098 127.11574        NA
 2009  44       2. Married 4. Other    3. Some College 2. Middle Atlantic  1. Industrial 2. >=Very Good     1. Yes 5.133021 169.52854        NA
 2008  30 1. Never Married 3. Asian    3. Some College 2. Middle Atlantic 2. Information      1. <=Good     1. Yes 4.716003 111.72085        NA
 2006  41 1. Never Married 2. Black    3. Some College 2. Middle Atlantic 2. Information 2. >=Very Good     1. Yes 4.778151 118.88436 118.88436

推荐阅读