Home

pavank

Hi Experts, Goodmorning, Here i get different output for left join with Datastep vs Proc sql how works datastep joins and proc sql joins please explain data EXP1; Input ID Name$ Marks $ ; cards; 101 ravi 70% 102 sanjay 80% 103 suraj 88% 105 kirit 75% 104 anshu 87% ; run; data EXP2; Input ID Name$ Sex $; cards; 101 ravi M 102 sanjay M 103 suraj M 104 monit M 105 mukesh M ; run; proc sort data=exp1; by id; run; proc sort data=exp2; by id; run; title 'LEFT JOIN IN DATASTEP'; data leftjoin; merge exp1 (in=a) exp2 (in=b); by id; if a; proc print ; run; title 'LEFT JOIN IN PROC SQL'; proc sql number; select * from exp1 a left join exp2 b on a.id=b.id; quit;

shrikrishna

I need a program to loop through dataset1, which will select 10 usernames from dataset1 filter on those usernames in dataset2 and export that data to the .xlsx file, then select next set of 10 usernames. I have set of 10000 usernames. I am new to SAS, will really appreciate all the help.

thistleandtweed

Hello, I'm dealing with unstructured text data, and I need to conduct unsupervised multi-class classification of it. I managed to create a term-by-document matrix of my corpus by using PROC TEXTMINE using single-value decomposition (SVD). My approach to classifying this data is to conduct K-Means clustering and then analyse the clusters to segregate the text into pre-defined topics automatically. However, after viewing the score table of PROC FASTCLUS, I am a bit lost as to how to continue with my evaluation of the results. For reference, this is how my summary table for PROC FASTCLUS looks like right now. Do let me know if I have to provide more information about my table. Sorry if its a basic question, and thanks in advance!

scobaroy

for example, here is my data about people who use a drug X. ID start_date end_date 001 2020-01-10 2020-02-08 001 2020-02-09 2020-03-09 001 2020-03-01 2020-03-30 001 2021-05-07 2021-06-06 002 2020-07-01 2020-07-30 002 2020-07-26 2020-08-24 002 2020-08-15 2020-09-13 002 2021-05-07 2021-06-06 Assume the supply for this drug is 30 days . It is clear that there are days overlapped, in this case, I would like to add the overlapped days to the last end date for each time interval. e.g. For 001, there are 9 days overlapped (2020-03-01 to 2020-03-09); e.g. For 002, there are 5 days and 10 days overlapped; I wish I could have a simple time interval including start to end, and plus overlapped days. What I want to see is a dataset like this: ID start_date end_date 001 2020-01-10 2020-04-07 (2020-03-30 +9 days) 001 2021-05-07 2021-06-06 002 2020-07-01 2020-09-27 (2020-09-13 +5+10 days) 002 2021-05-07 2021-06-06 Hope I could get some help

Arteem

Hi, I need help for Table of content creation while combining RTF. When I created individual Tables RTF output I have used footnote statement instead line statement under proc report. so now I have empty space between end of table and footnote (due to footnote at end of page). When I write below code ,TOC text is printing in between empty space of table end and footnote. so it is giving incorrect page number for Table of contents. ods rtf text = "^S={outputwidth=100% just=l font_face='Tahoma' font_size=3.0}{\tc\fs0\cf8 &&ttl&i.}"; As an example . Table1 TOC text is printing on previous page which is Table of content page so it is giving page 1 but Table is starting from page 2. same as Table 2 TOC text is printing between table 1 end and footnote so on TOC page for Table 2 it has page number 2 but it should be page 3. if I change footnote with line statement then footnote is printing directly under end of table and thus no empty space so each TOC text printing at top of each table and it is giving correct page number since TOC text at top of each table. in this case for Table 1 page number 2 and table 2 page number 3 and 1st page is TOC page. Can someone let me know if there is any way to correct TOC page number when footnote statement was used for footnote instead line statement.