Back in the old days when we used CVS of all things for our version control we had a wonderful tool called bonsai to help query the repository for changes. You could list changes on a per directory basis if you needed which was great for keeping an eye on certain chunks of code. I recall there being a way of getting an RSS feed from it and I used it when I was the module owner of the extension manager to see what changes were landed that I hadn’t noticed in bugs.
Fast forward to today and we use mercurial instead. When we switched there was much talk of how we’d get tool parity with CVS but bonsai is something that has never been replaced fully. Oh hgweb is decent at looking at individual files and browsing the tree, but you can’t get that list of changes per directory from it. I believe you can use the command line to do it but who wants to do that? Lately I’ve been finding need of those directory RSS feeds more and more. We’re now periodically uplifting the Add-on SDK repository to mozilla-central, it’s really important to spot changes that have been made to that directory in mozilla-central so we can also land them in our git repository and not clobber them the next time we uplift. I’m also the module owner of toolkit, which is a pretty big sprawling set of files. It seems like everytime I look I find something that landed without me noticing. I don’t make for a good module owner if I’m not keeping an eye on things so I’d really like to see when new files are added there.
So I introduce the Hg Change Feed, the result of mostly just a few days of work. Every 10 minutes it pulls new changes from mozilla-central and mozilla-inbound. A mercurial hook looks over the changes and adds information about them to a MySQL database. Then a simple django app displays that information. As you browse through the directories in the tree it shows only changesets that affected files beneath that directory. For any directory you can also get an RSS feed of the same. Plug that into IFTTT and you have an automated system to notify you in pretty much any way you’d like about new changes you’d be interested in.
Some simple examples. For tracking changes to Add-on SDK I’m watching http://hgchanges.fractalbrew.com/mozilla-inbound/file/addon-sdk/source. For toolkit I’m looking at http://hgchanges.fractalbrew.com/mozilla-inbound/file/toolkit?types=added. Types takes a comma separated list of “added”, “removed” and “modified” to filter which changes you’re interested in. There’s no UI on the site for changing that right now, you’re welcome to add some!
One other neat trick that this does is mostly ignore merge changesets. Only if a merge actually makes a change not already present in either of the merge parents (mostly happens when resolving merge conflicts) will it show up in the list of changes, because really you don’t need to hear about changes twice.
So play with it, let me know if you find it useful or if you think things are missing. I can also add other mercurial repositories if people want. Some caveats:
- It only retains the last 2000 changesets from any repository in an effort to keep the DB size small and fast, it also only shows the last 200 changesets for each page, or just the last 20 in the feeds. These can be tweaked easily enough and I’ve done basically no benchmarking to say those are the right values.
- The site isn’t as fast as I’d like, particularly listing changes for the top level directory takes nearly 5 seconds. I’ve thrown some basic caching in place to help alleviate that for now. I bet someone who has more MySQL and django experience than me could tell me what I’m doing wrong.
- I’m off on vacation tomorrow so I guess I’m announcing this then running away, sorry if that means it takes me a while to respond to comments.
Want to help out and make it better? Go nuts with the source. There’s a readme that hopefully explains how to set up your own instance.
9 thoughts on “Get notifications about changes to any directory in mercurial”
There’s been many times I’ve wanted this, awesome! Any chance we can get it for comm-central too? 🙂
I should have anticipated that request and done the work earlier! I’ll start importing the repository and it will probably show up on the site pretty quickly, but it won’t have a full set of changesets listed for a while.
Well, since comm-central is much lower traffic I decided to only cache the most recent 1000 changesets, let me know if that is a problem. Because it is a much smaller repository the indexing went fast and it is all up there now.
Thanks, that should be perfect!
This is awesome!
Maybe long-term it’s something that could live inside or alongside DXR (http://dxr.mozilla.org/) ?
I made it standalone because I knew I could knock it out quickly and I wanted to play with mercurial hooks (awesome!) and refresh my django memories (still awesome!). But long term sure I’d love to see it more formally owned and supported.
IMO it should be part of hgweb. The Selenic bug is http://bz.selenic.com/show_bug.cgi?id=2034
I totally agree, but given that we’ve been waiting three years on that being fixed I’m not holding out a lot of hope.
Someone did a write-up—and a Planet post? My archives don’t go back that far—on devmo about getting this info using Yahoo! pipes: Filter a Mercurial Changelog feed by Pushlog directory paths. I can confirm that it did work at one time, because I set a couple up when it first appeared, but one of them was broken from the beginning, and I later removed the other when it broke (since I didn’t want to read the Pipes docs…). So for anyone wanting to give it a go, YMMV, etc.
Comments are closed.