This commit is contained in:
fenago
2021-02-09 03:33:04 +05:00
parent 39139f2b0e
commit eda8fb9fc0
2 changed files with 5 additions and 13 deletions
+2 -6
View File
@@ -435,14 +435,10 @@ X_hd = pd.DataFrame(pd.np.tile(adData, (1, 500)))
```
From the output, you can see that the session crashes because all the
From the output, you can see that the session might crash because all the
RAM provided by Jupyter has been used. The session might restart, and you
will lose all your variables. Hence, it is always good to be mindful of
the resources you are provided with, along with the dataset. As a data
scientist, if you feel that a dataset is huge with many features but the
resources to process that dataset are limited, you need to get in touch
with the organization and get the required resources or build an
appropriate strategy to address these high-dimensional datasets.
the resources you are provided with, along with the dataset.
Strategies for Addressing High-Dimensional Datasets
+3 -7
View File
@@ -196,10 +196,8 @@ This generates the following output:
Caption: The temps\_ndarray vector
Note that the output contains single square brackets, `[` and
`]`, and the numbers are separated by spaces. This is
different from the output from a Python list, which you can obtain using
the following code snippet:
```
print(temps_list)
@@ -232,9 +230,7 @@ Caption: Shape of the temps\_ndarray vector
### Matrices
There are times when you need to convert between vectors and matrices.
Let\'s revisit `temps_ndarray`. You may recall that it has
five elements because the shape was `(5,)`. To convert it into
To convert `temps_ndarray` into
a matrix with five rows and one column, you would use the following
snippet: