0 9 Structure structure NN 10 13 and and CC 14 24 expression expression NN 25 27 of of IN 28 31 the the DT 32 37 human human JJ 38 43 GATA3 gata3 NN 44 48 gene gene NN 48 49 . . . 51 56 GATA3 GATA3 NNP 56 57 , , , 58 59 a a DT 60 66 member member NN 67 69 of of IN 70 73 the the DT 74 78 GATA GATA NNP 79 85 family family NN 86 90 that that WDT 91 93 is be VBZ 94 104 abundantly abundantly RB 105 114 expressed express VBN 115 117 in in IN 118 121 the the DT 122 134 T-lymphocyte T-lymphocyte NNP 135 142 lineage lineage NN 142 143 , , , 144 146 is be VBZ 147 154 thought think VBN 155 157 to to TO 158 169 participate participate VB 170 172 in in IN 173 179 T-cell t-cell NN 180 188 receptor receptor NN 189 193 gene gene NN 194 204 activation activation NN 205 212 through through IN 213 220 binding binding NN 221 223 to to TO 224 233 enhancers enhancer NNS 233 234 . . . 235 237 To to TO 238 248 understand understand VB 249 254 GATA3 gata3 NN 255 259 gene gene NN 260 270 regulation regulation NN 270 271 , , , 272 274 we we PRP 275 281 cloned clone VBD 282 285 the the DT 286 291 human human JJ 292 296 gene gene NN 297 300 and and CC 301 304 the the DT 305 306 5 5 CD 306 307 ’ ' SYM 308 311 end end NN 312 314 of of IN 315 318 the the DT 319 324 mouse mouse NN 325 330 GATA3 gata3 NN 331 335 gene gene NN 335 336 . . . 337 339 We we PRP 340 344 show show VBP 345 349 that that IN 350 353 the the DT 354 359 human human JJ 360 365 GATA3 gata3 NN 366 370 gene gene NN 371 379 contains contain VBZ 380 383 six six CD 384 389 exons exon NNS 390 401 distributed distribute VBN 402 406 over over IN 407 409 17 17 CD 410 412 kb kb NN 413 415 of of IN 416 419 DNA DNA NNP 419 420 . . . 421 424 The the DT 425 428 two two CD 429 434 human human JJ 435 440 GATA3 gata3 NN 441 445 zinc zinc NN 446 453 fingers finger NNS 454 457 are be VBP 458 465 encoded encode VBN 466 468 by by IN 469 472 two two CD 473 481 separate separate JJ 482 487 exons exon NNS 488 494 highly highly RB 495 504 conserved conserve VBN 505 509 with with IN 510 515 those those DT 516 518 of of IN 519 524 GATA1 GATA1 NNP 524 525 , , , 526 529 but but CC 530 532 no no DT 533 538 other other JJ 539 549 structural structural JJ 550 560 homologies homology NNS 561 568 between between IN 569 574 these these DT 575 578 two two CD 579 584 genes gene NNS 585 588 can can MD 589 591 be be VB 592 597 found find VBN 597 598 . . . 599 602 The the DT 603 608 human human JJ 609 612 and and CC 613 618 mouse mouse NN 619 624 GATA3 gata3 NN 625 638 transcription transcription NN 639 644 units unit NNS 645 650 start start VBP 651 653 at at IN 654 655 a a DT 656 661 major major JJ 662 672 initiation initiation NN 673 677 site site NN 677 678 . . . 679 682 The the DT 683 691 promoter promoter NN 692 700 sequence sequence NN 701 709 analysis analysis NN 710 712 of of IN 713 718 these these DT 719 722 two two CD 723 728 genes gene NNS 729 737 revealed reveal VBD 738 742 that that IN 743 747 they they PRP 748 751 are be VBP 752 760 embedded embed VBN 761 767 within within IN 768 769 a a DT 770 773 CpG CpG NNP 774 780 island island NN 781 784 and and CC 785 790 share share VBP 791 801 structural structural JJ 802 810 features feature NNS 811 816 often often RB 817 822 found find VBN 823 825 in in IN 826 829 the the DT 830 839 promoters promoter NNS 840 842 of of IN 843 855 housekeeping housekeeping NN 856 861 genes gene NNS 861 862 . . . 863 870 Finally finally RB 870 871 , , , 872 874 we we PRP 875 879 show show VBP 880 884 that that IN 885 886 a a DT 887 890 DNA DNA NNP 891 899 fragment fragment NN 900 910 containing contain VBG 911 914 the the DT 915 920 human human JJ 921 926 GATA3 gata3 NN 927 940 transcription transcription NN 941 945 unit unit NN 945 946 , , , 947 948 3 3 CD 949 951 kb kb NN 952 960 upstream upstream RB 961 965 from from IN 966 969 the the DT 970 980 initiation initiation NN 981 985 site site NN 986 989 and and CC 990 991 4 4 CD 992 994 kb kb NN 995 1005 downstream downstream RB 1006 1010 from from IN 1011 1014 the the DT 1015 1030 polyadenylation polyadenylation NN 1031 1035 site site NN 1035 1036 , , , 1037 1045 displays display VBZ 1046 1052 T-cell t-cell NN 1053 1064 specificity specificity NN 1064 1065 . . .