NumPy – Writing Data to Files
In data science and machine learning, we often need to save processed data for later use.
NumPy provides simple and powerful functions to write arrays into files.
Why Writing Data is Important?
Saving data helps in:
- Reusing processed datasets
- Sharing results
- Training machine learning models
- Storing simulations
Import NumPy
import numpy as np
1. Save Array as Binary File (np.save)
import numpy as np
data = np.array([10, 20, 30, 40, 50])
np.save("data.npy", data)
Explanation:
-
Saves data in
.npyformat - Fast and efficient storage
2. Load Saved File (np.load)
import numpy as np
data = np.load("data.npy")
print(data)
3. Save Multiple Arrays (np.savez)
import numpy as np
a = np.array([1, 2, 3])
b = np.array([4, 5, 6])
np.savez("dataset.npz", arr1=a, arr2=b)
Meaning:
- Stores multiple arrays in one file
- Useful for ML datasets
4. Save as Text File (np.savetxt)
import numpy as np
data = np.array([[1, 2, 3],
[4, 5, 6]])
np.savetxt("data.txt", data)
Explanation:
- Saves data in readable format
- Easy to open in Excel or Notepad
5. Save CSV File Format
import numpy as np
data = np.array([[10, 20], [30, 40]])
np.savetxt("data.csv", data, delimiter=",")
Meaning:
- Creates CSV file
- Common format for datasets
6. Save with Formatting
import numpy as np
data = np.array([1.12345, 2.67891, 3.14159])
np.savetxt("formatted.txt", data, fmt="%.2f")
Output:
1.12
2.68
3.14
7. Append Data (Using File Handling)
import numpy as np
data = np.array([100, 200, 300])
with open("data.txt", "ab") as f:
np.savetxt(f, data)
Real-World Applications
1. Data Science
- Save cleaned datasets
- Store transformed data
2. Machine Learning
- Save training datasets
- Export model inputs
3. Simulation
- Store experiment results
- Log computed values
4. Analytics
- Export reports
- Share processed data
Why Use NumPy Writing Functions?
Using NumPy provides:
- Fast binary storage
- Easy dataset export
- Efficient array handling
- Seamless integration with workflows
Combined with Python, it becomes essential for data engineering and machine learning pipelines.
Summary
NumPy provides multiple ways to write data:
np.save()
np.savez()
np.savetxt()
These functions help store data efficiently.
Conclusion
Writing data to files is a key step in any data science workflow. NumPy makes it fast, flexible, and easy to manage datasets for future use.


0 Comments