I think I independently came up with the same idea to solve the distributed gfs ...

seymourz · on Aug 13, 2009

Also, does the gfs1 still have single master, so that the gfs2 has mulitple Bigtable tablets serving as distributed masters for gfs2? Is this the cause for "In fact, it just makes the bottleneck limitations of the system’s single-master design more apparent than would otherwise be the case.", as stated in the article?

vicaya · on Aug 15, 2009

gfs1 is still single master, but the workload is much simpler in this case: it serves the gfs2 master bigtable cluster exclusively. Most of the documented gfs master failures are due to misbehaved map-reduce clients. Also the gfs1 master can be down for extended period of time without affecting the master operations, due to the nature of the cluster (you're unlikely to create a million files per second resulting in much compaction and splits in metadata tablets)

The quote you mentioned actually meant that if you use Bigtable on top of gfs1, the single master failure is more apparent due to the low latency requirement of the application that use the Bigtable.

seymourz · on Aug 18, 2009

Is this vacaya related to the vacaya of hypertable? :-)

seymourz · on Aug 13, 2009

Ahaa , we had just conceived this way several months ago in China ...