Sunday, 23 November 2014

Managing Data Growth & Data Archival

Recently I have faced one scenario around data archival concerns which made me think and look little deep into data archival solutions & need for the same.

Problem Statement
Business now tracks almost every piece of data that are created as part of any business interactions. Now large terabyte online transaction processing database systems are becoming common. The rate at which data is growing has increased rapidly. Now it takes months rather than years for the data in a typical transactional database to grow from terabytes to petabytes to exabytes. It has resulted in greater challenges in terms of data management.  It has a negative impact on the performance of the application as data operations takes more time to complete. (We could alleviate the problem to a certain extent using caching solutions,  splitting data based on geographies/data range/other conditions or other means.)
Huge data growth also increased the operational cost in terms of provisioning of additional costly storage space; the associated cost involved in physical space/cooling cost/other things. It also brings in additional burden to database management  operations like data movements, database upgrades plus other regular administrative tasks. At the same time, keeping old sensitive information like customer credit card information/healthcare reports in a large production database systems beyond certain period will also pose serious security risk if it is not managed properly.

Solution Considerations
One of the straightforward approaches to the above problem is to remove the unwanted data on a periodic basis. But it will not address the concern completely. As per one study, majority of the data operation involves only data that is created within two-three years. The access to the data that is three-six years older is very minimal, and one more than six years is rare. So  scalable approach is the adoption of tiered storage approach where active data is placed in the costly SAN storage  tier  but inactive one in low-cost storage tier. Let's look at the solution consideration  little deeper for building full fledged data archival solution.


  • One of the primary consideration for data archival solution is to understand the data entities & its relationship in the source system.  Without well-defined meta-data management approach, data archiving exercise is bound to fail.
  • Next is the data retention consideration. Data retention requirements should be looked from the business requirements angle  for instance in terms of compliance requirements ( like PCI-DSS, SOX plus others) plus other considerations. It will determine data retention considerations like how much data needs to reside in primary data storage space, when it needs to be moved to secondary low-cost inactive storage space, then to tape and finally purged.
  • Data retrieval- Even if data moves to secondary storage space, data needs to be retrieved for processing even though it may be for limited scenarios. If applications that may need this data is primarily java based ones, then support of jdbc based retrieval may need to be supported. So concerns around the retrieval mechanism need to be  considered while looking at the overall solution.
  • Data Store & Data Storage Format - There are multiple options here. One of the default option is to have similar data store as the production one. For example, if the production data store is oracle DB, it may be better to use the oracle based one for inactive part also. But a better option would be to leverage Hadoop Stack so that commodity hardware could be leveraged for storage and also could scale easily. And at the same time, real-time access to data would not be required for the majority of the cases, so obvious time delay associated with batch mode data retrieval is acceptable. 
In addition to above considerations, data governance of inactive data should be given same kind of priority as the active one. Otherwise, there is always a tendency to ignore the same for inactive data which may lead to unnecessary cost escalation later on due to things like security breaches, higher cost for data retrieval, non adherence to compliance requirements.

114 comments:

  1. Learning new technology would give oneself a true confidence in the current emerging Information Technology domain. With the knowledge of big data the most magnificent cloud computing technology one can go the peek of data processing. As there is a drastic improvement in this field everyone are showing much interest in pursuing this technology. Your content tells the same about evolving technology. Thanks for sharing this.

    Hadoop Training in Chennai

    ReplyDelete
  2. Processing data was tough long back without the invention of big data. Under to incredible methodology any data can be processed at maximum speed at minimal time. You are maintaining a wonderful blog, and thanks for sharing this information in here.

    Best hadoop training institute in chennai

    ReplyDelete
  3. Data loss prevention products are valuable tools to organizations that want to effectively monitor and control sensitive information such as financial data, personally identifiable information of customers and employees, medical records, intellectual property and other types of important company data.
    virtual data room review

    ReplyDelete
  4. Well Said, you have furnished the right information that will be useful to anyone at all time. Thanks for sharing your Ideas.
    Salesforce training in Chennai | salesforce course in Chennai

    ReplyDelete
  5. Im no expert, but I believe you just made an excellent You certainly understand what youre speaking about, and I can truly get behind that.
    Salesforce Training in Chennai|Salesforce Training|Salesforce Training institutes in Chennai

    ReplyDelete
  6. Cloud computing is a common term for the delivery of hosted services over the internet.Thanks for sharing this webpage.
    Regards,
    Cloud computing Training in chennai | Cloud Training in chennai | Cloud computing course in chennai

    ReplyDelete
  7. I feel satisfied with your blog, you have been delivering useful & unique information to our vision even you have explained the concept as deep clean without having any uncertainty, keep blogging.

    cloud computing training in chennai|cloud computing training*

    ReplyDelete
  8. I really enjoyed while reading your article, the information you have delivered in this post was damn good. Keep sharing your post with efficient news.
    Regards,
    Salesforce Training|Salesforce Training institutes in Chennai

    ReplyDelete
  9. The expansion of internet and intelligence in the business process lead the way to a huge volume of data.
    Selenium Training in Chennai
    It is important to maintain and process these data to be efficient in data handling.
    Selenium Training
    Selenium Training Center in Sholinganallur

    ReplyDelete
  10. Thank you for this valuable information. We are the best erp software solutions in chennai. Contact us on info@bravetechnologies.in.ERP software in Chennai

    ReplyDelete
  11. Thank you for this valuable information.
    erp in chennai

    ReplyDelete
  12. The rationale behind online data storage is simple. By saving your data on remote servers, the risk of catastrophic data loss as a result natural disasters, theft, technical failure, or other disaster is virtually eliminated. Self Storage

    ReplyDelete
  13. t is a one of the great discussion which is very essential for me as well. I must follow the handy discussion and sure that the content will be very useful to me as well. Keep it up. 
    Six Sigma Certification Training in Chennai | Linux Certification Training in Chennai | Microsoft Certification Training in Chennai

    ReplyDelete
  14. Thanks for the useful post. Keep posting more like this.
    Webdesign Lüdenscheid

    ReplyDelete
  15. Nice blog. Thank you for sharing. The information you shared is very effective for learners I have got some important suggestions from it.
    Java Training Center in Chennai | Best J2EE Training Center in Chennai | No.1 Java Training Institution in Velachery | Core Java Training in Chennai

    ReplyDelete
  16. Pretty article! I found some useful information in your blog, it was awesome to read, thanks for sharing this great content to my vision.
    IEEE Project Center in Chennai | Final Year Project Center in Chennai | Diploma Project Center in Chennai

    ReplyDelete
  17. My rather long internet look up has at the end of the day been
    compensated with pleasant insight to talk about with my family and
    friends."Devops Training in Chennai"

    ReplyDelete
  18. I wish to show thanks to you just for bailing me out of this particular trouble.As a result of checking through the net and meeting techniques that were not productive, I thought my life was done.
    "Oracle Training in Chennai"

    ReplyDelete
  19. Thank you for sharing this helpful post with us..keep updating such an awesome article...
    No.1 Electrical Project Center in Chennai | Electrical Project Center in Velachery

    ReplyDelete
  20. Needed to compose you a very little word to thank you yet again regarding the nice suggestions you’ve contributed here.

    mobile website builder

    ReplyDelete
  21. I believe that there would be great opportunities for those who are coming around this area.
    Best Online Software Training Institute | Big Data Analytics Training

    ReplyDelete
  22. It was very great, this is the worthy concept and good efforts. Really helpful for me and I will share this post with my friends. I like more updates from your blog...
    TOEFL Coaching in Chennai
    TOEFL Training in Chennai
    Jmeter Training in Chennai
    IELTS Coaching in Chennai
    TOEFL Coaching in TNagar
    TOEFL Coaching in Velachery
    TOEFL Coaching in Anna Nagar

    ReplyDelete
  23. I have read your blog its very attractive and impressive. I like it your blog.Thank you so much for sharing this pretty post, it was so good to read and useful to improve my knowledge.

    VMware Certification Training in Chennai | VMware Certification Exam Center in Chennai | VMware Exams Center in Taramani | VMware Certification Exams in Chennai

    ReplyDelete
  24. Wow...What an excellent informative blog, really helpful. Thank you so much for sharing such a wonderful post with us.keep updating..
    AWS Certifications in Chennai | AWS Exam Centers in Chennai | AWS Certification Exams in Velachery | AWS Exams in Velachery | AWS Online Exam Center in Velachery

    ReplyDelete
  25. Your Blog is really amazing with useful and helpful content for us.Thanks for sharing.keep updating more information.
    Embedded System Training Institute in Chennai | Embedded Training in Velachery | Embedded System Training in Guindy

    ReplyDelete
  26. Excellent Post! Thank you so much for sharing this pretty post, it was so good to read and useful to improve my knowledge.
    Java Training Institute in Chennai | Java Certification Training in Velachery

    ReplyDelete
  27. Excellent Post! Thank you so much for sharing this pretty post, it was so good to read and useful to improve my knowledge.
    Embedded System Training in Chennai | Embedded Training in Velachery | Embedded Courses in Pallikaranai

    ReplyDelete
  28. This is useful post for me. I learn lot of new information from your article. keep sharing. thank you for share us.
    MCSE Training Institute in Chennai | MCSE Training in Velachery | MCSE Training Center in Chrompet

    ReplyDelete
  29. This is useful post for me. I learn lot of new information from your article. keep sharing. thank you for share us.
    MCSE Training Institute in Chennai | MCSE Training in Velachery | MCSE Training Center in Chrompet

    ReplyDelete
  30. It is amazing blog and good information... I was improve my knowledge... Thanks for sharing such a informative and wonderful post...
    Java Training Institute in Chennai | Java Training Center in Velachery | Java Certification Training in Taramani

    ReplyDelete
  31. Thanks for your informative article. Your post helped me to understand the future and career prospects. Keep on updating your blog with such awesome article.
    PCB Designing Training Institute in Chennai | PCB Training in Velachery

    ReplyDelete
  32. Really nice post. Thank you for sharing your amazing information and informative article,its really useful for us.keep updating such a wonderful blog..
    Embedded Training Institute in Chennai | Embedded Training Center in Velachery

    ReplyDelete
  33. Really nice post. Thank you for sharing your amazing information and informative article,its really useful for us.keep updating such a wonderful blog..
    Embedded Training Institute in Chennai | Embedded Training Center in Velachery

    ReplyDelete
  34. Very informative and interesting blog, it was so good to read and useful to improve my knowledge as updated one,keep updating..This Concepts is very nice Thanks for sharing..
    Selenium Training Institute in Chennai | Selenium Training Center in Velachery | Selenium Courses in T.Nagar

    ReplyDelete
  35. Very informative and interesting blog, it was so good to read and useful to improve my knowledge as updated one,keep updating..This Concepts is very nice Thanks for sharing..
    Selenium Training Institute in Chennai | Selenium Training Center in Velachery | Selenium Courses in T.Nagar

    ReplyDelete
  36. Thanks for sharing your great information..Its really very impressive and informative content.keep updating...
    Linux Certification Training Institute in Chennai | Linux Training in Velachery | Online Linux Training in Madipakkam

    ReplyDelete
  37. Amazing blog. Thank you for sharing. The information you shared is very effective for learners I have got some important suggestions from it..
    Blue Prism Training Institute in Chennai | Blue prism Certification Training in Velachery | Blue Prism Training Center in Adyar

    ReplyDelete
  38. Amazing blog. Thank you for sharing. The information you shared is very effective for learners I have got some important suggestions from it..
    Blue Prism Training Institute in Chennai | Blue prism Certification Training in Velachery | Blue Prism Training Center in Adyar

    ReplyDelete
  39. Pretty article! I found some useful information in your blog, it was amazing to read, thanks for sharing this great content to my vision...
    Embedded Training Institute in Chennai | Embedded Training in Velachery | Embedded Certification Training in Velachery

    ReplyDelete
  40. Pretty article! I found some useful information in your blog, it was amazing to read, thanks for sharing this great content to my vision...
    Embedded Training Institute in Chennai | Embedded Training in Velachery | Embedded Certification Training in Velachery

    ReplyDelete
  41. Thanks for sharing this great article! That is very interesting I love reading and I am always searching for informative articles like this..
    Cisco Certification Training in Chennai | Cisco Certification Courses in OMR | Cisco Certification Exams in Velachery

    ReplyDelete
  42. Wow!!..What an excellent informative post, its really useful.Thank you so much for sharing such a awesome article with us.keep updating..
    VMware Certification Training in Chennai | VMware Training Institute in Velachery | VMware Certification Courses in Medavakkam

    ReplyDelete
  43. Great post.Thanks for one marvelous posting! I enjoyed reading it;The information was very useful.Keep the good work going on!!
    Tally Training Institute in Chennai | Tally Training in Velachery | Best Tally Courses in Guindy | Tally Training Center in Pallikaranai

    ReplyDelete
  44. Great post.Thanks for one marvelous posting! I enjoyed reading it;The information was very useful.Keep the good work going on!!
    Tally Training Institute in Chennai | Tally Training in Velachery | Best Tally Courses in Guindy | Tally Training Center in Pallikaranai

    ReplyDelete
  45. Awesome post.. Really you are done a wonderful job.thank for sharing such a wonderful information with us..please keep on updating..
    PCB Designing Training Institute in Chennai | PCB Training Center in Velachery | PCB Design Courses in Thiruvanmiyur

    ReplyDelete
  46. Your article is really an wonderful with useful content, thank you so much for sharing such an informative information. keep updating.
    MultiMedia Training Center in Chennai | MultiMedia Training Courses in Velachery | MultiMedia Training Institutes in OMR

    ReplyDelete
  47. Pretty blog, so many ideas in a single site, thanks for the informative article, keep updating more article.
    Software Testing Training Institute in Chennai | Software Testing Training Institutes in Velachery

    ReplyDelete
  48. Pretty blog, so many ideas in a single site, thanks for the informative article, keep updating more article.
    Software Testing Training Institute in Chennai | Software Testing Training Institutes in Velachery

    ReplyDelete
  49. Your blog is really useful for me, and I gathered some information from this blog.Thanks a lot for sharing this amazing article..
    CCNP Training Institute in Chennai | CCNP Training Center in Velachery | CCNP Training Courses in Pallikaranai | CCNP Training in Taramani | CCNP Courses in Medavakkam

    ReplyDelete
  50. Pretty blog, so many ideas in a single site, thanks for the informative article, keep updating more article.
    Oracle Training Institute in Chennai | Oracle Certification Training in Velachery | Oracle Courses in Pallikaranai

    ReplyDelete
  51. Very interesting article.Helps to gain knowledge about lot of information. Thanks for posting information in this blog...
    Java Training Institute in Chennai | Java Training Center in Velachery | Advanced java Courses in Porur

    ReplyDelete
  52. Don’t focus on having a great blog. Focus on producing a blog that’s great for your readers.
    MATLAB Training in Chennai | MATLAB Training in Velachery | MATLAB Training in Nanganallur

    ReplyDelete
  53. Very interesting blog which helps me to get the in depth knowledge about the technology, Thanks for sharing such a nice blog..
    Java Project Center in Chennai | Java Project Center in Velachery | Java Projecs in Perungudi

    ReplyDelete
  54. I wanted to thank you for thisblog fantastic read!! I definitely enjoyed every bit of it. I have you saved as a favorite to check out new stuff you post…

    ReplyDelete
  55. "It is amazing and wonderful to visit your site.Thanks for sharing this information,this is useful to me...
    data science courses"

    ReplyDelete
  56. Free software has two unique major problems that have influenced my design decisions, because often they are avoidable and can make software less robust, less usable, and harder to maintain. data science course in india

    ReplyDelete
  57. In industries, machine learning using python has become popular. This is because it has standard libraries which are used for scientific and numerical calculations. best coding course

    ReplyDelete
  58. It is truly a well-researched content and excellent wording. I got so engaged in this material that I couldn’t wait to read. I am impressed with your work and skill. Thanks for sharing. Oracle integration cloud service training

    ReplyDelete
  59. A debt of gratitude is in order for the blog entry amigo! Keep them coming...
    data scientist training and placement

    ReplyDelete
  60. Informative blog, big thumbs up for sharing this blog with us.
    Data Science Online Training

    ReplyDelete
  61. Really impressed! Everything is very open and very clear clarification of issues. It contains true facts. Your website is very valuable. Thanks for sharing.
    data analytics training in hyderabad

    ReplyDelete
  62. Thank you so much for doing the impressive job here, everyone will surely like your post.
    full stack web development course malaysia

    ReplyDelete
  63. You really make it look so natural with your exhibition however I see this issue as really something which I figure I could never understand. It appears to be excessively entangled and incredibly expansive for me.
    business analytics training in hyderabad

    ReplyDelete
  64. Crack Free Download. Notezilla lets you quickly take notes on PostIt-Esq desktop sticky notes and place them on websites, documents, folders .ELOoffice Crack

    ReplyDelete
  65. Risk management software promotes a structured and systematic approach to risk management, improving an organization's ability to identify, assess, mitigate, and monitor risks effectively. It enhances risk visibility, facilitates informed decision-making, and helps organizations proactively manage uncertainties, thereby reducing potential losses and enhancing operational resilience.

    ReplyDelete
  66. Digital marketing encompasses a variety of online strategies and tactics used by businesses to reach and engage with their target audience.

    ReplyDelete
  67. I appreciate the step-by-step guidance in this post. Your clarity and insight make a real difference in how easily I can understand and apply the concepts.
    Data science courses in Noida

    ReplyDelete
  68. Managing data growth and implementing effective data archival strategies are crucial for maintaining system performance and ensuring data accessibility. Organizations should regularly assess their data storage needs and employ solutions such as tiered storage, automated archiving, and data lifecycle management to optimize resources. By doing so, they can enhance data retrieval efficiency while minimizing costs.

    Data Science Courses in Kolkata

    ReplyDelete

  69. This analysis of managing data growth and archival needs highlights critical challenges faced by organizations as they grapple with rapidly increasing data volumes. The emphasis on understanding data entities and relationships, coupled with compliance requirements, is essential for developing effective data retention strategies. The proposed tiered storage approach makes sense, as it optimizes costs while ensuring that active data remains readily accessible.

    Using a Hadoop-based solution for inactive data is a forward-thinking choice, particularly for organizations that do not require real-time access to all datasets. This approach can significantly reduce operational costs while providing the scalability needed for massive datasets.

    Moreover, the reminder that data governance must extend to archived data is crucial. Neglecting governance in inactive data can lead to compliance risks and unnecessary costs, ultimately undermining the benefits of a well-structured archival system. This commentary serves as a valuable guide for businesses looking to implement a robust data management strategy. Data science courses in Gurgaon

    ReplyDelete
  70. This comment has been removed by the author.

    ReplyDelete
  71. This is an insightful article on the crucial aspects of managing data growth and data archival. Data science courses in Visakhapatnam

    ReplyDelete
  72. Managing data growth and implementing data archival strategies are crucial for maintaining performance and ensuring long-term data accessibility. By regularly archiving outdated data and using scalable storage solutions, businesses can optimize their storage and improve overall system efficiency.

    Data science courses in Pune

    ReplyDelete
  73. Well-written and very informative! I appreciate how you covered not only the technical aspects but also the strategic benefits of effective data growth management. Your insights are timely and relevant for organizations of all sizes. Keep up the great work.
    Data science Courses in Sydney

    ReplyDelete
  74. "Really enjoyed this post! It’s clear you’ve put a lot of effort into making it both informative and engaging."
    Data science Courses in Canada

    ReplyDelete
  75. "I’m constantly amazed at how well you explain everything. Great job once again
    Data science Courses in London

    ReplyDelete
  76. "Excellent article on managing data growth and archival! With the increasing volume of data, this topic has become even more crucial for organizations. I especially liked your point about balancing between accessibility and storage costs. Your tips on creating an efficient data lifecycle strategy are very practical. Keep up the great work!"
    Data science courses in Glasgow

    ReplyDelete
  77. Your insights on managing data growth and archival processes are very well articulated. A valuable resource for data professionals.

    Data science courses in France

    ReplyDelete
  78. Great job on this piece! The way you explained the topic was clear, concise, and engaging. I learned so much—thank you!
    technical writing course

    ReplyDelete
  79. Effective data growth management involves implementing scalable storage solutions, data compression, and tiered storage strategies. Regularly archive older, infrequently accessed data to cost-efficient storage while ensuring compliance with retention policies. Use automated archival processes and maintain robust metadata for quick retrieval. Employ data lifecycle management tools to balance performance, cost, and accessibility as data volumes grow over time.
    Thank you
    Data science Courses in Berlin






    ReplyDelete
  80. This post effectively highlights the challenges of managing rapid data growth and the importance of a robust data archival strategy. It outlines key considerations, including retention policies, storage formats, and retrieval mechanisms, while emphasizing the need for proper governance to balance performance, cost, and security in a scalable manner. Nice Article!
    Data science Courses in Ireland

    ReplyDelete
  81. Your insights into managing data growth through archival processes are valuable. It’s a critical aspect of data management that’s often overlooked. Thanks for shedding light on this important topic!
    digital marketing course in chennai fees

    ReplyDelete
  82. Wonderful post! Data management is such a critical aspect in today’s world, and this article provides valuable tips on how to handle data growth. Archiving is definitely something many businesses overlook, so your insights are much needed. Thank you for sharing
    Top 10 Digital marketing courses in pune

    ReplyDelete
  83. I’m thrilled with the results I’ve seen after completing the Digital Marketing Course in Dwarka. The strategies I learned have made a huge difference in my campaigns.

    ReplyDelete
  84. I really like your writing style, wonderful information, thankyou for putting up : D.
    digital marketing course in varanasi

    ReplyDelete