Hi, All,

I have 6 variables in CSV file. One is rainfall (dependent, at y-axis) and 
others are predictors (at x). I want to do multiple regression and create a 
correlation matrix between rainfall (y) and predictors (x; n1=5). Thus I want 
to read rainfall as a separate variable and others in separate columns, so I 
can apply the algo. However, I am not able to make a proper matrix for them. 

Here are my data and codes?
Please suggest me for the same.
I am new to Python.

RF      P1      P2      P3      P4      P5
120.235 0.234   -0.012  0.145   21.023  0.233
200.14  0.512   -0.021  0.214   22.21   0.332
185.362 0.147   -0.32   0.136   24.65   0.423
201.895 0.002   -0.12   0.217   30.25   0.325
165.235 0.256   0.001   0.22    31.245  0.552
198.236 0.012   -0.362  0.215   32.25   0.333
350.263 0.98    -0.85   0.321   38.412  0.411
145.25  0.046   -0.36   0.147   39.256  0.872
198.654 0.65    -0.45   0.224   40.235  0.652
245.214 0.47    -0.325  0.311   26.356  0.632
214.02  0.18    -0.012  0.242   22.01   0.745
147.256 0.652   -0.785  0.311   18.256  0.924

import numpy as np
import statsmodels as sm
import statsmodels.formula as smf
import csv

with open("pcp1.csv", "r") as csvfile:
    readCSV=csv.reader(csvfile)
    
    rainfall = []
    csvFileList = [] 
    
    for row in readCSV:
        Rain = row[0]
        rainfall.append(Rain)

        if len (row) !=0:
            csvFileList = csvFileList + [row]       
        
print(csvFileList)
print(rainfall)

Please suggest me guys
Thanks

-- 
https://mail.python.org/mailman/listinfo/python-list

Reply via email to