Supplemental information

families.dat: A tab-separated table describing cases of paralogous co-occurence in operons. Columns are organisms designated by NCBI tax_ids; rows are InterPro domains. Each cell is a comma-separated list of operons containing that InterPro domain; entries or each operon are semi-colon separated RefSeq proteins that contain that InterPro domain. Email me with any questions.

select.txt: Lists the the 35 families selected in this study. Each of the 35 families are listed in turn; after each family InterPro designator and function (in parentheses), all organisms in which that family occurs are listed (indented by a single tab), and after each organism the operon structures are listed (indented by double tabs).

morgan(at)mbi.ucla.edu