mirror of
https://github.com/czroth/philosophy-of-data-science.git
synced 2025-12-06 00:18:46 +00:00
Add comment on removing data no longer needed to *Archiving* section.
This commit is contained in:
@@ -106,10 +106,13 @@ An appreciation for neighbouring departments will help align the Data Scientist'
|
||||
|
||||
#### Archiving
|
||||
|
||||
At the end of the data's [life cycle](#data-life-cycle) an evaluation should take place to determine if the data should be archived for possible future analysis.
|
||||
At the end of the data's [life cycle](#data-life-cycle) an evaluation should take place to determine if the data should be archived for possible future analysis or data auditing.
|
||||
Future analysis tools will likely yield greater and more accurate results that present tools.
|
||||
If the results of a future, better analysis results could have practical or historical value (and the archival costs are not prohibitive) archiving the data should be considered.
|
||||
|
||||
If the result of the evaluation is that the data no longer has value relative to its storage cost, the space should be freed up.
|
||||
Practically, this means scheduling data reviews to make and follow through on these determinations. Not holding to this discipline results in a larger data footprint than required.
|
||||
|
||||
#### Privacy
|
||||
|
||||
It is the responsibility of the Data Scientist (and IT Security) to prevent the leakage of private data.
|
||||
|
||||
Reference in New Issue
Block a user