Performance of handling zip file in memory using zip archive

Question

0.00/5 (No votes)

See more:

Hi I am using Zip archive[System.IO.Compression] for collect the multiple xml memory stream and convert into zip memory stream then sent to remote server [where I am storing zip file].

My intention is not to store any files[including xml file,zip file] in local machine,
and not to use any 3rd party dll for compression.

Below code is working fine and giving result as I expected. But still i am worrying about performance[In production env, we will get more records to zip storage]

kindly advise the best solution in terms of performance and memory leak.

What I have tried:

Extension Method to convert List<any object> into XML

/// <summary>
       /// Get the List of T and serialize into XML Memeory stream
       /// </summary>
       /// <typeparam name="T"></typeparam>
       /// <param name="dataToSerialize"></param>
       /// <returns></returns>
       public static MemoryStream Serialize<T>(this T dataToSerialize)
       {
           try
           {
               if (dataToSerialize == null) throw new ArgumentNullException();
               var serializer = new XmlSerializer(dataToSerialize.GetType());
               MemoryStream memstream = new MemoryStream();
               serializer.Serialize(memstream, dataToSerialize);

               return memstream;

           }
           catch
           {
               throw;
           }

       }

Extension method to convert into zip memory stream from List of different record.
Input value like dir["xmlfilename",List<object data for xml conversion>]

/// <summary>
     /// Combine all file memory stream and convert into Zip memory stream
     /// </summary>
     /// <typeparam name="T"></typeparam>
     /// <param name="dataToSerialize"></param>
     /// <returns></returns>
     public static MemoryStream SerializeIntoZip<T>(this Dictionary<string,T> dataToSerialize)
     {
         var outStream = new MemoryStream();
         try
         {
             if (dataToSerialize == null) throw new ArgumentNullException();
             using (var archive = new ZipArchive(outStream, ZipArchiveMode.Create, true))
             {
                 foreach(var data in dataToSerialize)
                 {
                     var fileInArchive = archive.CreateEntry(data.Key, CompressionLevel.Optimal);
                     using (var entryStream = fileInArchive.Open())
                     {

                         using (var fileToCompressStream = data.Value.Serialize()) // Calling existing file stream method
                         {
                             entryStream.Flush();
                             fileToCompressStream.Seek(0, SeekOrigin.Begin);
                             fileToCompressStream.CopyTo(entryStream);
                             fileToCompressStream.Flush();
                         }
                     }
                 }

             }

             outStream.Seek(0, SeekOrigin.Begin);
             return outStream;
         }
         catch
         {

             throw;
         }


     }

My main method

Dictionary<string, IList> listofobj =
                new Dictionary<string, IList>() {
                   {"fileName_XX.xml", GetXXList1()}, 
                   {"fileName_YY.xml", GetYYList()}};

var mem = listofobj.SerializeIntoZip();

//{
// code to upload zip memory stream to S3 bucket
//}

Posted 4-Jul-17 4:48am

santhosepriya

Updated 4-Jul-17 5:12am

Add a Solution

1 solution

Add a Solution

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

KarstenK · Accepted Answer · 2017-07-04T05:12:00

Normally isnt the RAM the problem on the machines, but the disk I/O and so you should optimize the disk read and write operations. It would make sense to make a file I/O thread and a data compression thread, so both operation wont hurt each other performance. The file I/O should only work on one file a time, it should first write one pending and than read one ahead. The memory stream than gets the compressing thread and the I/O thread can read the next file. For the remote transfer is some buffering very useful, but check the free memory.

At best you make some realistic tests of it to find some optimization. For instance if the files are small than read, compress and write a bunch of files to a temp to transfer them later.