Once of the difficulties in running the OASIS test suite was picking out real test fails from noise. Because I haven't yet implemented some XSLT elements and functions, there are many tests which will fail but which don't represent an actual bug. There are some other tests which have HTML output which I also don't do, want to use alternative text encodings (which I think is outside spec), where there's some implementor discretion, and there are a few where the test itself is wrong.
I've extended the XSLT test runner to read an list of expected fails, and adjust the test results accordingly. Individual tests can be marked as expected compile or runtime fails, and the summary output is annotated accordingly. It's a little thing, but it helps :)