<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Yewei&#039;s Blog</title>
	<atom:link href="http://yewei.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://yewei.wordpress.com</link>
	<description>on the way</description>
	<lastBuildDate>Wed, 12 May 2010 13:38:07 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='yewei.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Yewei&#039;s Blog</title>
		<link>http://yewei.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://yewei.wordpress.com/osd.xml" title="Yewei&#039;s Blog" />
	<atom:link rel='hub' href='http://yewei.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Three tasks for this month</title>
		<link>http://yewei.wordpress.com/2010/05/12/three-tasks-for-this-month/</link>
		<comments>http://yewei.wordpress.com/2010/05/12/three-tasks-for-this-month/#comments</comments>
		<pubDate>Wed, 12 May 2010 13:34:21 +0000</pubDate>
		<dc:creator>yewei</dc:creator>
				<category><![CDATA[Drizzle]]></category>

		<guid isPermaLink="false">http://yewei.wordpress.com/?p=7</guid>
		<description><![CDATA[    These days I undertood drizzle better as a whole picture in additon to storage engine.Thanks to Jay&#8217;s slides. They are very good resources for newcomers to drizzle. Before adding multi-column record feature and index feature to cloud-based storage engine, &#8230; <a href="http://yewei.wordpress.com/2010/05/12/three-tasks-for-this-month/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=yewei.wordpress.com&amp;blog=13590214&amp;post=7&amp;subd=yewei&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>    These days I undertood drizzle better as a whole picture in additon to storage engine.Thanks to Jay&#8217;s <a href="http://www.joinfu.com/slides/">slides</a>. They are very good resources for newcomers to drizzle. Before adding multi-column record feature and index feature to cloud-based storage engine, I will first finish following 3 tasks in this month.</p>
<p>  Task 1 <strong>Code clean up and refractoring<br />
</strong>  <br />
I had managed to port mysql-awss3 to drizzle when I applied this project. This work involved lots of &#8220;copy and paste&#8221;, which was more about &#8220;make it work&#8221;. I need go through all existing code to do some necessary refractoring, fix bugs and adjust code style to drizzle standard.</p>
<p>  Task 2 <strong>Pass cloud service information into storage engine<br />
</strong>  <br />
Authentication information of cloud service like S3 key id and key value is hard-coded for now. I need to figure out a way how to pass it into storage engine. I find that Zimin has begun to solve the same probem and a <a href="https://blueprints.launchpad.net/drizzle/+spec/key-value-pass-down-to-storage-engine">blueprint </a>has been registered.</p>
<p>  Task 3<strong> Support Rackspace Cloud File</strong></p>
<p>Cloud cursor now have some private member function to do S3 operations like getting bucket list and putting an object. To support Rackspace, I plan to create a new class as cloud service utility with all necessary virtual methods. Two sub classes for S3 and Rackspace are responsible for implementation for these methods. Cousor has a reference of this unitily which is instansiated according to engine option.</p>
<p>There will be a webinar for new drizzle developer on 15th May. The agenda presented by Jay pipes looks great. Most of my concerned problema will be addressed. Thanks Jay!</p>
<p>Looking forward to meeting you drizzle guys in webinar:)</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/yewei.wordpress.com/7/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/yewei.wordpress.com/7/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/yewei.wordpress.com/7/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/yewei.wordpress.com/7/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/yewei.wordpress.com/7/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/yewei.wordpress.com/7/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/yewei.wordpress.com/7/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/yewei.wordpress.com/7/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/yewei.wordpress.com/7/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/yewei.wordpress.com/7/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/yewei.wordpress.com/7/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/yewei.wordpress.com/7/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/yewei.wordpress.com/7/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/yewei.wordpress.com/7/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=yewei.wordpress.com&amp;blog=13590214&amp;post=7&amp;subd=yewei&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://yewei.wordpress.com/2010/05/12/three-tasks-for-this-month/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/5caaccb44cdbf57dbea4148298959c55?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yewei</media:title>
		</media:content>
	</item>
		<item>
		<title>Project Details of Cloud Based Storage Engine</title>
		<link>http://yewei.wordpress.com/2010/05/10/project-details-of-cloud-based-storage-engine/</link>
		<comments>http://yewei.wordpress.com/2010/05/10/project-details-of-cloud-based-storage-engine/#comments</comments>
		<pubDate>Mon, 10 May 2010 03:32:32 +0000</pubDate>
		<dc:creator>yewei</dc:creator>
				<category><![CDATA[Drizzle]]></category>

		<guid isPermaLink="false">http://yewei.wordpress.com/?p=4</guid>
		<description><![CDATA[This project will provide a Clould-based Storage Engine for Drizzle &#8211; allowing storing and retrieving of data from Cloud Storage Services including both AWS S3 and Rackspace Cloud Files. In this project, I plan to implement a non-transactional cloud-based storage &#8230; <a href="http://yewei.wordpress.com/2010/05/10/project-details-of-cloud-based-storage-engine/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=yewei.wordpress.com&amp;blog=13590214&amp;post=4&amp;subd=yewei&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This project will provide a Clould-based Storage Engine for Drizzle &#8211; allowing storing and retrieving of data from Cloud Storage Services including both AWS S3 and Rackspace Cloud Files. In this project, I plan to implement a non-transactional cloud-based storage engine. This project is based on a mysql engine mysql-awss3.My work is mainly about porting this engine to dirzzle and adding some new features. New features include:</p>
<p><strong>1) multiple column format support</strong></p>
<p>mysql-awss3 seems only support (key,blob) record, while this engine will support more complex record. In this case, item content stored in cloud should be in some format. I will design and implement a row buffer format for mutiple-column record, with other engines(e.g., CSV engine, Heap engine) as reference.</p>
<p><strong>2) index support</strong></p>
<p>To support index, taking AWS S3 for example, we can use two buckets to store data, one for record (say bucket1) and the other for index (say bucket2).For bucket1, item key stores primary key and item content stores data of all columns. For bucket2, item key stores other keys of a table, and item content stores a set of item keys in bucket1, with each one separated by comma. Keys are in KeyTupleFormat. Since keys in buckets2 may not be unique, addition information should be added upon KeyTupleFormat.<br />
In this way, we can issue two REST requests to get one record data given the index. The first request gets item key of record data by searching index, and the second request gets record data by seaching item key.For full table scan, like mysql-awss3, we can fetch all records in sequence by setting &#8220;marker&#8221; field of each request header as the key of last record fetched.<br />
Or we can use only one bucket. We distinguish column data and index based on prefix of item key in one bucket. The advantage of two buckets is that we can do a full table scan more quickly. Besides, the implementation for the two features above also will be more low-coupling, so I can implement them one by one more easily.</p>
<p>With respect to code implementation, in this project I will</p>
<ol>
<li>Implement a StorageEngine</li>
<li>Implement a Cursor</li>
<li>Design data structures for table(and index) information like ArchiveShare and TinaShare</li>
<li>Add or Rewrite some uitility functions(e.g., HTTP request and response handlers, XML handlers) if necessary.</li>
</ol>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/yewei.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/yewei.wordpress.com/4/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/yewei.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/yewei.wordpress.com/4/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/yewei.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/yewei.wordpress.com/4/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/yewei.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/yewei.wordpress.com/4/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/yewei.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/yewei.wordpress.com/4/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/yewei.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/yewei.wordpress.com/4/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/yewei.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/yewei.wordpress.com/4/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=yewei.wordpress.com&amp;blog=13590214&amp;post=4&amp;subd=yewei&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://yewei.wordpress.com/2010/05/10/project-details-of-cloud-based-storage-engine/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/5caaccb44cdbf57dbea4148298959c55?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yewei</media:title>
		</media:content>
	</item>
	</channel>
</rss>
