Below is the SAS code:
LIBNAME mydata "/courses/d1406ae5ba27fe300" access=readonly;
DATA new2; set mydata.addhealth_pds;
LABEL
H1TO31="During your life, how many times have you used marijuana?"
H1TO35="During your life, how many times have you used cocaine?"
H1TO41="During your life, how many times have you used any other illegal drug such as LSD, PCP, ecstasy, mushrooms, speed, ice, heroin or pills without a doctor’s prescription?"
MGRP= "Group of How many times Adolescent used Marijuana"
CGRP= "Group of How many times Adolescent used Cocain"
OGRP= "Group of How many times Adolescent used Other Drugs"
DRUGS="Use drugs either marijuana or cocaine or other drugs";
Recall from previous step, that the data is filtered only for Adolescent who thinking about suicide.
IF H1SU1=1; /*Seriously thinking about comitting suicide*/
IF H1SU8 >= 2 and H1SU8 <=4 ;/*Somewhat Honest to Completly Honest */
Those variables have the same codes for missing data which are 996: refused, 998: Don't know and 999: Not Applicable. Therefore, these codes will be excluded. Here is the additional script in SAS:
/*Remove 996: refused, 998: Don't know and 999: Not Applicable*/
IF H1TO31 = 996 or H1TO31 = 998 or H1TO31 = 999 then H1TO31 = .;
IF H1TO35 = 996 or H1TO35 = 998 or H1TO35 = 999 then H1TO35 = .;
IF H1TO41 = 996 or H1TO41 = 998 or H1TO41 = 999 then H1TO41 = .;
Note: The tables above is cut due to web space.
2. Group Data
The values of these variables are various from 1 to 700 or 900. In order to make it simple, the binning process is performed. The groups will be created into 6 groups with following range:
Group 1: 1 - 5 times
Group 2: 6 - 10 times
Group 3: 11 - 20 times
Group 4: 21 - 50 times
Group 5: > 50 times
data:image/s3,"s3://crabby-images/69832/69832fb265015799733e34fc9702553a37bcf31c" alt=""
/*Create binning for each variables*/
IF H1TO31 LE 5 THEN MGRP = 1;
ELSE IF H1TO31 LE 10 then MGRP = 2;
ELSE IF H1TO31 LE 20 then MGRP = 3;
ELSE IF H1TO31 LE 50 then MGRP = 4;
data:image/s3,"s3://crabby-images/a4b32/a4b324ff0bc38dcfb5b4ed4a1b19f3b97543368d" alt=""
ELSE IF H1TO31 = 997 then MGRP = 9;
IF H1TO35 LE 5 then CGRP = 1;
ELSE IF H1TO35 LE 10 then CGRP = 2;
ELSE IF H1TO35 LE 20 then CGRP = 3;
ELSE IF H1TO35 LE 50 then CGRP = 4;
data:image/s3,"s3://crabby-images/d2f71/d2f71b9d63c63fb6f23791418715a0bae2b00f48" alt=""
ELSE IF H1TO35 = 997 then CGRP = 9;
IF H1TO41 LE 5 then OGRP = 1;
ELSE IF H1TO41 LE 10 then OGRP = 2;
ELSE IF H1TO41 LE 20 then OGRP = 3;
ELSE IF H1TO41 LE 50 then OGRP = 4;
ELSE IF H1TO41 LE 900 then OGRP = 5;
ELSE IF H1TO41 = 997 then OGRP = 9;
Based on table above, it shows that Adolescent who thinking about suicide, 48.11% of them use Marijuana, 10.45% use Cocain and 20.28% use Other Drugs.
3. Create New Variable
It will be created one new variable to know whether the Adolescent who thinking about suicide uses one or more drugs or not.
/*New Variables DRUGS, 1: use one or more drugs, 0: doesn't use drugs*/
IF H1TO31 < 997 or H1TO35 < 997 or H1TO41 < 997 THEN DRUGS=1;
ELSE DRUGS=0
It can be seen that in total there are 400 Adolescent or about 50.4% who thinking about suicide also use drugs either marijuana, cocain or other drugs. The other 49.62% Adolescent are not clear what is the reason of suicide. It is needed further investigation.
Here is the rest of the code.
PROC SORT; by AID;
PROC FREQ; TABLES H1TO31 H1TO35 H1TO41 MGRP CGRP OGRP DRUGS;
RUN;
No comments:
Post a Comment