mirror of
https://github.com/fenago/data-science.git
synced 2026-05-03 16:10:41 +00:00
added
This commit is contained in:
@@ -435,14 +435,10 @@ X_hd = pd.DataFrame(pd.np.tile(adData, (1, 500)))
|
||||
```
|
||||
|
||||
|
||||
From the output, you can see that the session crashes because all the
|
||||
From the output, you can see that the session might crash because all the
|
||||
RAM provided by Jupyter has been used. The session might restart, and you
|
||||
will lose all your variables. Hence, it is always good to be mindful of
|
||||
the resources you are provided with, along with the dataset. As a data
|
||||
scientist, if you feel that a dataset is huge with many features but the
|
||||
resources to process that dataset are limited, you need to get in touch
|
||||
with the organization and get the required resources or build an
|
||||
appropriate strategy to address these high-dimensional datasets.
|
||||
the resources you are provided with, along with the dataset.
|
||||
|
||||
|
||||
Strategies for Addressing High-Dimensional Datasets
|
||||
|
||||
+3
-7
@@ -196,10 +196,8 @@ This generates the following output:
|
||||
|
||||
Caption: The temps\_ndarray vector
|
||||
|
||||
Note that the output contains single square brackets, `[` and
|
||||
`]`, and the numbers are separated by spaces. This is
|
||||
different from the output from a Python list, which you can obtain using
|
||||
the following code snippet:
|
||||
|
||||
|
||||
|
||||
```
|
||||
print(temps_list)
|
||||
@@ -232,9 +230,7 @@ Caption: Shape of the temps\_ndarray vector
|
||||
### Matrices
|
||||
|
||||
|
||||
There are times when you need to convert between vectors and matrices.
|
||||
Let\'s revisit `temps_ndarray`. You may recall that it has
|
||||
five elements because the shape was `(5,)`. To convert it into
|
||||
To convert `temps_ndarray` into
|
||||
a matrix with five rows and one column, you would use the following
|
||||
snippet:
|
||||
|
||||
|
||||
Reference in New Issue
Block a user