Large Files without the Trials

Large Files
Without the Trials

Aaron VanDerlip and Sally Kleinfeldt
Plone Symposium East 2010

Thursday, June 3, 2010

Acknowledgments
• Bioneers provides environmental education
and social connectivity through
conferences, radio and TV, books, and online
materials
• Engaged Jazkarta to build a ﬁle asset server
based on Plone to help them organize,
capture, and store multimedia and textual
content with ﬁles as large as 5 GB.


Acknowledgments

• Aaron VanDerlip - Project Manager
• Kapil Thangavelu - Developer


What is a Big File?

• Anything that makes you wait...


Plone Problems with
Big Files

1.Uploading/Downloading
2.Versioning


Uploading Big Files

• Both the user and a Zope thread are
waiting for the ﬁle transfer

Uploading Big Files

• Browser encodes ﬁle in multipart mime
format
• Zope must undo this encoding
• CPU and memory intensive, and SLOW
• Zope thread is blocked during this process


Downloading Big Files

• ...the same thing happens in reverse


Learning from Rails
• Get ﬁle encoding/unencoding and read/
write operations out of Plone
• Web servers are really good at this -
Apache, Nginx, and Lighttpd
• Our implementation uses Apache
• Apache ﬁle streaming is fast and threads
are cheap


Learning from Rails

• Uploads: Apache plus mod_porter
http://therailsway.com/tags/porter
• Downloads: Apache plus mod_xsendfile
http://john.guen.in/past/2007/4/17/
send_files_faster_with_xsendfile/
• ...and of course ZODB Blob storage


Mod Porter
• Parses the multipart mime data
• Writes the file to disk
• Changes the Request to contain a pointer
to the temp file on disk
• All done efficiently in C code inside your
Apache process


Mod Porter


Apache Conﬁg for
Mod Porter
LoadModule apreq_module /usr/lib/Apache2/modules/mod_apreq2.so

LoadModule porter_module /usr/lib/Apache2/modules/mod_porter.so

# Apache has a default read limit of 64MB, set it higher

APREQ2_ReadLimit 2G

...

Porter On

# Files below this size will not be handled by mod-porter

PorterMinSize 14M

# Where the uploaded files are stored

PorterDir /mnt/uploads-Apache


X-Sendfile

• HTTP header
• Set an X-Sendfile header and the path of a
file on your response
• Apache does the rest


Apache Conﬁg for
X-Sendﬁle
LoadModule xsendfile_module /usr/lib/Apache2/modules/mod_xsendfile.so

...

EnableSendfile On

XSendFile on

# Config to send file resources directly from blob storage

XSendFilePath /mnt/bioneers/var/blobstorage


Using X-Sendﬁle
from Python
def download(self, response, file_path):

response.setHeader("X-Sendfile",

file_path)


Blob Storage
• Uploads
• Blob.consumeFile moves ﬁle from
Apache’s temp area to blob storage
(ZODB/blob.py)
• Uses os.rename, ﬁle never enters Plone
• Downloads
• Served directly from blob storage

Upload Process


What About Really
Really Big Files?
• Use FTP
• Supports continuation and batching
• Handles ﬁles too large for browser limits
• Content editors use FTP to transfer ﬁles to
an upload directory


Uploading with FTP


ore.bigﬁle
• Minimally intrusive, works with the grain of
Plone
• Provides Big File content type
• IFrontendFileServer interface deﬁnes two
methods that provide web server support
for upload and download
• Apache and Nginx implementations
provided


ore.bigﬁle
Limitations

• Upload directory is hardcoded
• Possibility of error on very large images
which Mod Porter intercepts


Versioning Big Files


Solution
• Bypass CMFEditions - no ﬁle size limitation
• Create a new version only when ﬁle
changes (not metadata)
• Allow old versions to be purged
• Version information stored on Big File
object using annotations


Conclusion
• ore.bigﬁle solves the Big File problem for a
particular use case, not feature complete
• It does so by taking advantage of mature
web server technology
• The code is minimally intrusive
• It provides a strategy for implementation
we can learn from as we improve Plone’s
Big File story


http://svn.objectrealms.net/
view/public/browser/ore.bigﬁle

Questions


Large Files without the Trials

Recommended

Recommended

More Related Content

Similar to Large Files without the Trials

Similar to Large Files without the Trials (20)

Recently uploaded

Recently uploaded (20)

Large Files without the Trials