[PDF] Investigations of the Underlying Mechanisms of HIF-1α and CITED2 Binding to TAZ1

Abstract

The TAZ1 domain of CREB binding protein is crucial for transcriptional regulation and recognizes multiple targets. The interactions between TAZ1 and its specific targets are related to the cellular hypoxic negative feedback regulation. Previous experiments reported that one of the TAZ1 targets CITED2 is an efficient competitor of another target HIF-1{\alpha}. Here by developing the structure-based models of TAZ1 complexes we have uncovered the underlying mechanisms of the competitions between HIF-1{\alpha} and CITED2 binding to TAZ1. Our results are consistent with the experimental hypothesis on the competition mechanisms and the apparent affinity. In addition, the simulations prove the dominant position of forming TAZ1-CITED2 complex in both thermodynamics and kinetics. For thermodynamics, TAZ1-CITED2 is the lowest basin located on the free energy surface of binding in the ternary system. For kinetics, the results suggest that CITED2 binds to TAZ1 faster than HIF-1{\alpha}. Besides, the analysis of contact map and f values in this study will be helpful for further experiments on TAZ1 systems.

Full PDF

IInvestigations of the Underlying Mechanisms of HIF-1 α andCITED2 Binding to TAZ1 September 4, 2019

Wen-Ting Chu , Xiakun Chu , Jin Wang , ∗ State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy ofSciences, Changchun, Jilin, 130022, China Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, NY, 11794, USA ∗ Corresponding Author: [email protected] a r X i v : . [ q - b i o . B M ] S e p bstract The TAZ1 domain of CREB binding protein is crucial for transcriptional regulation and recognizes mul-tiple targets. The interactions between TAZ1 and its speciﬁc targets are related to the cellular hypoxic nega-tive feedback regulation. Previous experiments reported that one of the TAZ1 targets CITED2 is an efﬁcientcompetitor of another target HIF-1 α . Here by developing the structure-based models of TAZ1 complexeswe have uncovered the underlying mechanisms of the competitions between HIF-1 α and CITED2 bindingto TAZ1. Our results are consistent with the experimental hypothesis on the competition mechanisms andthe apparent afﬁnity. In addition, the simulations prove the dominant position of forming TAZ1-CITED2complex in both thermodynamics and kinetics. For thermodynamics, TAZ1-CITED2 is the lowest basinlocated on the free energy surface of binding in the ternary system. For kinetics, the results suggest thatCITED2 binds to TAZ1 faster than HIF-1 α . Besides, the analysis of contact map and φ values in this studywill be helpful for further experiments on TAZ1 systems. Intrinsically disordered proteins (IDPs) behave as disordered/unstructured forms at physiological con-ditions in isolated states, but sometimes undergo conformational changes to folded form upon binding to thepartners . Such binding-coupled-folding scenario has signiﬁcantly refreshed our understanding on the proteinstructure-function paradigm. Generally, IDPs are widely involved in many critical physiological processes, suchas transcription and translation regulation, cellular signal transduction, protein phosphorylation, and molecularassembles .The transcriptional adaptor zinc-binding 1 (TAZ1) is a protein domain of CREB binding protein (CBP),which plays an important role in the transcriptional regulation . One of its binding partners, the α -subunit ofthe transcription factor (hypoxia inducible factor) HIF-1 (HIF-1 α ), interacts with TAZ1 through its intrinsicallydisordered C-terminal transactivation domain, which is related to the transcriptional regulation of genes that arecrucial for cell survival during low levels of oxygen . Another binding partner CITED2, which occupies adifferent but partially overlapped binding site from that of HIF-1 α , acts as a negative feedback regulator thatattenuates HIF-1 transcriptional activity by competing for TAZ1 binding through its own disordered transacti-vation domain . Intriguingly, both the two ligands of TAZ1 have a conserved LP(Q/E)L region (LPQL inHIF-1 α ; LPEL in CITED2) that is essential for negative feedback regulation . These LPQL and LPELdomains bind with the same place of TAZ1 surface at bound state (see Fig. 1).Though the structures of both TAZ1-HIF-1 α and TAZ1-CITED2 complexes have been deposited to ProteinData Bank , little is known about how binding afﬁnity or binding mechanism will be inﬂuenced if twoligands co-exist. Both HIF-1 α and CITED2 bind to TAZ1 with the same high afﬁnity ( K d = ± . Inaddition, HIF-1 α and CITED2 utilize partially overlapped binding sites to form complexes with TAZ1 ,suggesting that HIF-1 α and CITED2 are binding competitors to TAZ1. The NMR experiments of Wright etal. observed that TAZ1-CITED2 complex is dominant in the TAZ1:HIF-1 α :CITED2 solvation with 1:1:1molar ratio. The ﬂuorescence anisotropy competition experiments found that CITED2 exhibits an apparent K d of 0.2 ± α complex, while HIF-1 α displaces TAZ1-bound CITED2 with a muchhigher apparent K d (0.9 ± µ M) . These experimental results indicate that CITED2 is extremely effectivein displacing HIF-1 α from the TAZ1â ˘A ¸SHIF-1 α complex.The possible mechanism for displacement of HIF-1 α from its complex with TAZ1 by CITED2 wasproposed that CITED2 binds to TAZ1-HIF-1 α complex through its N-terminal region, displacing the dynamicaland weakly interacting α helix of HIF-1 α , then competing through an intramolecular process for binding to theLP(Q/E)L site. However, it is challenging to prove such replacing mechanism experimentally. In our previousworks, we have explored the complex association processes and uncovered the binding and folding mechanismas well as the key interactions with structure-based model and molecular dynamics (MD) simulations . Herein this study, we developed the binary models of both TAZ1-HIF-1 α binding and TAZ1-CITED2 binding aswell as the ternary model of TAZ1-HIF-1 α -CITED2 binding to unveil the mechanism of the replacing processes(including HIF-1 α replacing the TAZ1-bound CITED2 as well as CITED2 replacing the TAZ1-bound HIF-1 α ).Our studies quantiﬁed the free energy surface of the overall competition processes and further suggest that it iseasier to form TAZ1-CITED2 complex than TAZ1-HIF-1 α complex in thermodynamics or kinetics, in line with2 IF-1 α C-terHIF-1 α N-ter KIXN-terKIXC-ter KIXN-terKIXC-terCITED2N-ter CITED2C-ter α A α B α C α A α A α C LPQL LPEL

AB C D

SDLACRLLGQ SMDESGLPQL TSYDCEVNAP IQGSRNLLQG EELLRALDQV NTDFIDEEVLM SLVIEMGLDR IKELPELWLG QNEFDFMTDF VCKQQPSRVS

HIF-1 α CITED2 α A α B α C α AL1 L2 L3L1 L2

Fig. 1

The sequences (A) and binding postures (B-D) of HIF-1 α (51 a.a., corresponding to 776-826 in 1L8C)and CITED2 (50 a.a., corresponding to 220-269 in 1R8U). HIF-1 α includes 3 main alpha helices α A (783-788), α B (796-804), α C (815-824), and 3 main loop regions, L1 (776-782), L2 (789-795), L3 (805-814). LPQL motifis included in L2 region. CITED2 has only one helix, α A (225-235), and 2 main loop regions, L1 (220-224), L2(236-269). LPEL motif is included in L2 region. The complex structures (B, TAZ1-HIF-1 α ; C, TAZ1-CITED2)are extracted from the NMR structures 1L8C and 1R8U. Their superimposed structure is illustrated in the panel D. α A, LPQL of HIF-1 α and α , LPEL of CITED2 share the same binding surface of TAZ1. The alpha helices of HIF-1 α and CITED2 are labeled both in the sequences and in the complex ﬁgures. The essential conserved motifs areemphasized with pink boxes (in the sequences) and with magenta and dark blue cartoons (in the complex ﬁgures). the experimental ﬁndings. This study has potential implication in for the underlying details and mechanisms oftranscription regulations by TAZ1. We used a weighted structure-based model of TAZ1-HIF-1 α and TAZ1-CITED2 complexes according tothe NMR structures 1L8C and 1R8U , respectively. The contact map of the weighted structure-based modelwas collected based on the 20 conﬁgurations in PDB, taking into account the NMR structural ﬂexibility. Theparameters of the model were calibrated carefully in order to be consistent with the experimental measure-ments. In details, the strengths of intra-chain interactions of HIF-1 α and CITED2 were tuned according to theexperimental helical content at unbound state. It was reported that both HIF-1 α (776-826) and CITED2 (220-269) behaved as random coil at unbound state , thus the helical contents of isolated HIF-1 α and CITED2were set to be below 10% in our simulations. Then the strengths of inter-chain interactions between TAZ1and HIF-1 α as well as between TAZ1 and CITED2 were adapted according to the experimental dissociationconstant ( K d about 10 nM for both TAZ1-HIF-1 α and TAZ1-CITED2 ). In our model, both TAZ1-HIF-1 α and TAZ1-CITED2 complexes with inter-molecular interaction strengths of 1.10 and 0.95 have similar afﬁnitieswith the experiments (corresponding to about -6.06 kT binding energy). The experimental K d and the simulated K d of single ligand binding to TAZ1 (binary system) process are listed in Tab. 1.After performing replica exchange molecular dynamics (REMD) simulations for sufﬁcient sampling with28 replicas ranging from about 0.50 to 1.86 (simulation temperature, room/experimental temperature is about0.99) on both TAZ1-HIF-1 α and TAZ1-CITED2 complexes, The weighted histogram analysis method (WHAM)algorithm was applied on the REMD trajectories to collect statistics and to obtain the free energy distribu-tions as well as other characteristics at the experimental temperature. Firstly the binding free energy proﬁle wasquantiﬁed by projecting the free energy onto the fraction of inter-molecular native contacts ( Q inter ), which can3 ab. 1 The experimental K d (as well as apparent K d ), the simulated binding energy (the energy difference betweenthe ﬁnal state and the initial state, ∆ G) of different systems. The experimental K d values are extracted from theref . Note that the experimental apparent K d does not equal to the simulated K d because of the different calculatingmethod. The method of obtaining the simulated K d is described in the SI Appendix Method section. Binary systemprocess Exp. K d (nM) Simu. K d (nM) Simu. ∆ G (kT)UB to HB 10 2.515 -6.75UB to CB 10 2.465 -6.76Ternary systemprocess Exp. app. K d (nM) Simu. K d (nM) Simu. ∆ G (kT)UB to HB 900 62.8 -5.14UB to CB 0.2 1.38 -7.05be considered as the reaction coordinate of binding. As shown in Fig. S1, TAZ1-HIF-1 α and TAZ1-CITED2have similar binding afﬁnity ( ∆ G bind ), which agrees with the experimental measured K d . However, the bind-ing barrier ( ∆ G ‡ ) of TAZ1-HIF-1 α is signiﬁcantly higher than that of TAZ1-CITED2 ( ∆ G ‡ (TAZ1-HIF-1 α ) isabout two times of ∆ G ‡ (TAZ1-CITED2)). The thermodynamic results suggest that when binding to TAZ1, thekinetic binding rate of HIF-1 α should be lower than that of CITED2, though the similar binding stabilities areestablished.The ﬂexible binding or binding coupled folding behavior can be illustrated by the free energy surface along Q inter , Q intra and along Q inter , helical content (Fig. S2). Q intra is the fraction of intra-molecular native contacts,which acts as folding reaction coordinates. Upon binding, Q intra and the helical content of HIF-1 α change from0.30 to 0.75 and 0% to 35% (bottom of the basins of unbound and bound states), respectively. But the changesof the Q intra and helical content of CITED2 during binding are not as remarkably large as that of HIF-1 α , theyalter from 0.24 to 0.42 and from 0% to 17%, respectively. In addition, it is obvious that at the transition states,the Q intra and helical content of HIF-1 α or CITED2 are similar as that at the unbound state, but are differentfrom that at the bound state. This accords with the induce-ﬁt binding mechanism. α -CITED2 complex After obtained the tuned TAZ1-HIF-1 α and TAZ1-CITED2 models, we constructed the ternary TAZ1-HIF-1 α -CITED2 model by putting HIF-1 α and CITED2 together with TAZ1 into the same simulation sphere.REMD simulations with 28 replicas ranging from 0.50 to 1.86 temperature were performed on ternary TAZ1-HIF-1 α -CITED2 complex, with both HIF-1 α and CITED2 unbound as the initial state. Firstly, the free energyat experimental temperature was obtained and projected on both the binding reaction coordinates of TAZ1-HIF-1 α and TAZ1-CITED2 ( Q inter (TAZ1-HIF-1 α ) and Q inter (TAZ1-CITED2)). As shown in Fig. 2, three mainlower basins and other smaller basins can be recognized on the free energy surface at experimental temperature.The lowest basin corresponds to CITED2 bound state (CB state, 0.00 kT), which means that TAZ1-CITED2 isthe dominant state of all. This is consistent with the NMR results of ref . HIF-1 α bound basin (HB state) isabout 1.91 kT higher than CB state, which is another bound state. Because the LPQL region of HIF-1 α andLPEL region of CITED2 share the same binding site on the surface of TAZ1, HIF-1 α and CITED2 can notfully bind with TAZ1 at the same time. The IS state, with HIF-1 α partly bound and LPEL of CITED2 occupiedthe binding site, is the main intermediate state between HB and CB states with about 3.07 kT less stable thanCB state. The α C helix and C-terminus of HIF-1 α bind with one side of TAZ1; α A helix and LPEL motif ofCITED2 bind with the other side of TAZ1 at the IS state. The other states, including the both unbound state (UBstate, about 7.05 kT higher than CB state) and the HB1 state near HB state (with part of HIF-1 α bound, morethan 5 kT higher than CB state), have much higher free energy values than the CB state. The HB1 state containsa few conﬁgurations with similar free energy values and with α A of CITED2 bound ( Q inter (TAZ1-HIF-1 α )about 0.61 and Q inter (TAZ1-CITED2) ∼ α and4 .0 0.2 0.4 0.6 0.8 1.00.00.20.40.60.81.0 Q inter (TAZ1-HIF-1 α ) Q i n t e r ( T A Z - C I T E D ) LPQLLPEL

UB HBCB IS HB1

CITED2N-ter CITED2C-ter HIF-1 α N-terHIF-1 α C-terNC (0.00, 0.57)0.00 kT (0.28, 0.57)3.07 kT(0.00, 0.00)7.05 kT (0.61, 0.00)1.91 kT

HIF-1 α C-ter HIF-1 α N-terCITED2N-ter CITED2C-ter CITED2C-terCITED2N-terHIF-1 α N-terHIF-1 α C-ter α A α B α C α A Fig. 2

The free energy surface at experimental temperature as a function of Q inter (TAZ1-HIF-1 α ) and Q inter (TAZ1-CITED2), as well as the main states on the free energy surface (HB1 is not a basin on free energy surface butan area near HB state with part of CITED2 bound). The values of the reaction coordinates (two Q inter values) as wellas the free energy at these main states are labeled in this ﬁgure. TAZ1 is shown in yellow ribbons (the bound Zn + ions are shown with purple spheres), HIF-1 α is shown in green ribbons, and CITED2 is shown in magenta ribbons.The LPQL and LPEL motifs are colored in blue. The free energy unit is kT (k is Boltzmann constant). CITED2 are consistent with that proposed in the schematic mechanism for displacement of HIF-1 α from itscomplex with TAZ1 by CITED2 .In addition, it is worth noting that though TAZ1-HIF-1 α complex and TAZ1-CITED2 complex have similarbinding afﬁnity of the binary system in both experiment and theory, the TAZ1-HIF-1 α state (HB) and TAZ1-CITED2 state (CB) have different free energy value in the ternary system. The free energy results suggest thatCITED2 has more opportunity to bind to TAZ1 than HIF-1 α in the ternary system Tab. 1. The binding barrierdata discussed above (Fig. S1) indicates that the binding rate of CITED2 associated with TAZ1 is higher thanthat of HIF-1 α associated with TAZ1, suggesting that CITED2 may bind to TAZ1 before HIF-1 α in the ternarysystem. Moreover, in the ternary system, the binding free energy of HIF-1 α in the ternary system (-5.14 kT,see Tab. 1) is higher than that in the binary system (-6.75 kT); the binding free energy of CITED2 in the ternarysystem (-7.05 kT) is lower than that in the binary system (-6.76 kT). The results suggests that the bindingafﬁnity of CITED2 to TAZ1 is much higher than HIF-1 α to TAZ1 in the ternary system, which is consistentwith the tendency of experimental apparent K d (Tab. 1).We then calculated the distribution of the helical content and Q intra on both binding coordinates in orderto show the conformational changes of HIF-1 α and CITED2 during binding. As shown in Fig. S3, HB andHB1 states have similar Q intra (HIF-1 α ) and helical content of HIF-1 α , which is much higher than that of CBand IS states. Likewise, the Q intra (CITED2) and helical content of CITED2 decrease from CB and IS states tothe HB and HB1 states. Similar behavior can also be found in LPQL and LPEL motifs. As shown in Fig. 3,5 inter (TAZ1-HIF-1 α ) Q inter (TAZ1-HIF-1 α ) Q i n t e r ( T A Z - C I T E D ) Q i n t e r ( T A Z - C I T E D ) A B

Fig. 3

Mean Q inter between TAZ1 and LPQL of HIF-1 α (A) and Q inter between TAZ1 and LPEL of CITED2 (B)as a function of Q inter (TAZ1-HIF-1 α ) and Q inter (TAZ1-CITED2). The locations of UB, single-ligand-bound (HBor CB), and IS states are labeled as gray, blue, and pink circles. CBUB HBIS

CIH or CUH processHIC or HUC process

CBUB HB

UH processUC process direct binding replacing

A B

Fig. 4

Direct binding (starting from UB state) and replacing (starting from HB or CB state) processes. The proba-bility of each binding or replacing process is roughly illustrated with the width of the arrow. The binding time (mean

FPT on ) of each process is labeled on the arrow. LPQL/LPEL leaves from the binding site when the other ligand almost fully binds to TAZ1. Therefore, in thedirect binding process (from UB to HB/CB state), LPQL/LPEL occupies the binding site at the early part ofthe binding process (after transition state). However in the replacement process (from CB to HB or from HB toCB state via IS state), LPQL/LPEL partly binds to TAZ1 at the late binding process (after IS state or after HB1state). α -CITED2 complex As shown in the free energy proﬁle in Fig. 2, there are four main states (UB, HB, CB, and IS) in the ternarybinding system. From the UB state, TAZ1 can bind to HIF-1 α or CITED2 to reach HB or CB state via directbinding process (Fig. 4A). HIF-1 α can replace the TAZ1-bound CITED2 to reach HB state via IS state or UBstate (CIH or CUH replacing process), on the other hand CITED2 can replace the TAZ1-bound HIF-1 α toreach CB state via IS state or UB state (HIC or HUC replacing process, see Fig. 4B). Aiming to explore thedetails in the binding processes of the two ligands to TAZ1, we performed kinetic simulations with differentinitial states at the experimental temperature. Firstly, for direct binding (UB as the initial state), mean bindingtime (mean ﬁrst passage time, mean FPT on ) values for direct binding are 1.431 ns and 0.286 ns, respectively(Tab. 2). CITED2 binds to TAZ1 faster than HIF-1 α , which is consistent with the analyses above. Aimingto obtain the binding probability for the two different ligands, 200 individual kinetic simulations started withvarying both unbound (both HIF-1 α and CITED2 isolated in one sphere) conﬁgurations were performed. Thisternary system reaches CB state ﬁrst (UB to CB state, denoted as UC pathway) in 177 of the 200 runs, andreaches HB state ﬁrst (UB to HB state, denoted as UH pathway) in the other 23 of the 200 runs.6 ab. 2 Kinetic binding time (mean ﬁrst passage time of binding, mean

FPT on , ns) of different binding processes.200 kinetic runs were performed with each starting state (UB, HB, or CB). Direct bindingprocess mean

FPT on UB to HB 1.431UB to CB 0.286Replacingprocess mean

FPT on CB to HB 163.202HB to CB 60.758We then investigated the replacing process. 200 kinetic runs started with varying TAZ1-CITED2 conﬁg-urations (CB state) and 200 kinetic runs started with varying TAZ1-HIF-1 α conﬁgurations (HB state) wereperformed, respectively. The mean FPT on of the replacement of CITED2 by HIF-1 α is 163.202 ns, which ishigher than that of the replacement of HIF-1 α by CITED2 (60.758 ns as listed in Tab. 2). There are 2 differentpathways of the replacing process, one is via the IS state (denoted as CIH or HIC pathway), the other is viathe UB state (denoted as CUH or HUC pathway). The kinetic runs suggest that the probability of the pathwaywith IS state as processing state (CIH or HIC) is much higher (more than 10 folds) than that of the pathwaywith UB state as processing state (CUH or HUC), which means that the pathways via the IS state (CIH/HIC)are more favorable than those via the UB state (CUH/HUC). The mean FPT on of of each possible pathway hasbeen labeled in Fig. 4. Note that these data were collected for the successful binding pathways, as a result themean FPT on from CB to HB (or from HB to CB) in Tab. 2 is higher than that of the CIH/CUH (or HIC/HUC)pathway due to including the unsuccessful binding attempts. Moreover, it will need exceptionally more time(nearly 100 folds) for the process of replacement (CIH/CUH or HIC/HUC) than for the direct binding process(UH or UC), because the leaving process of the ﬁrst binding ligand is time-consuming. The barrier of theleaving process of the ﬁrst binding ligand in CIH or HIC process is about 4 to 5 kT, which is much higher thanthat of the direct binding process (UH or UC, about 1 to 2 kT). Intriguingly, we found the mean FPT on valuesof the replacing pathways via IS and UB states (successful bindings) are similar (see Fig. 4). The free energysurface proﬁle (see Fig. 2) suggests that the barrier of the transition from one ligand bound state (CB or HB) toIS state is lower than the barrier from one ligand bound state (CB or HB) to UB state. As a result, it is easierto reach the IS state than to reach the UB state from the one ligand bound state. However, the second step ofreplacing process vis IS (from IS to CB or HB state) has a bit higher barrier than that via UB (from UB to CBor HB state).The contact maps between TAZ1 and HIF-1 α and between TAZ1 and CITED2 at the transition state (TS)of the different pathways are illustrated in Fig. 5 to show which part is important for the initial binding process.Because CUH and HUC pathways include the UH and UC binding pathways as the second binding step, herewe analyze the inter-chain contact maps of UH, UC (direct), as well as CIH, HIC (replacing) pathways. Asshown in Fig. 5A and B, for the direct binding process, LPQL motif (residue 792-795 in 1L8C) and C-terminusof HIF-1 α have strong interactions with TAZ1; N-terminus and LPEL motif (residue 243-246 in 1R8U) ofCITED2 are crucial for the initial binding to TAZ1. Additionally, there are abundant non-native interactionsformed in transition states, implying that TS is highly non-speciﬁc. In contrast, for the replacement via Istate (Fig. 5C and D), only C-terminus of HIF-1 α and N-terminus of CITED2 are important for the transitionstate. The interactions between LPQL/LPEL motif and TAZ1 are relatively weak or vanish. And the non-nativeinteractions are highly oriented. Therefore, the LPQL/LPEL motif may take different roles in the direct bindingand replacing pathways. φ value analysis We then divided HIF-1 α and CITED2 into several parts for analysis (illustrated in Fig. 1) and calculatedthe evolution of the interactions between these parts and TAZ1 in different pathways. As shown in Fig. 6,7 B

780 790 800 810 820 350 360 370 380 390 400 410 420 430 440 R e s i due nu m be r H I F - α R e s i due nu m be r C I T E D C D

780 790 800 810 820 350 360 370 380 390 400 410 420 430 440 R e s i due nu m be r H I F - α Residue number TAZ1 0 0.1 0.2 0.3 0.4 0.5 220 230 240 250 260 350 360 370 380 390 400 410 420 430 440 R e s i due nu m be r C I T E D Residue number TAZ1 0 0.1 0.2 0.3 0.4 0.5

Fig. 5

The probability of the interactions between TAZ1 (residue 345 to 439, as well as 3 Zn + ) and HIF-1 α (residue 776 to 826) or between TAZ1 and CITED2 (residue 220 to 269) at the transition state of the binding process(0 . < Q inter < .

1) in the direct binding processes, UH (A) and UC (B) pathways, as well as in the replacingprocesses, CIH (C) and HIC (D) pathways. Native contacts are labeled in triangles and non-native contacts arelabeled as circles. For non-native contacts, a contact is considered formed if the distance between the two residues islower than 10.0 Å. different pathways have different interacting regions and orders. For HIF-1 α direct binding processes, L2region (including LPQL motif) and α C reach TAZ1 ﬁrst in the UH pathway (Fig. 6A), followed by the α Bhelix and L3 region. The α A helix and L1 bind to TAZ1 last. In addition, Fig. 6A indicates that Q inter betweenthe N-terminus (including α A and L1) is lower than 0.2 at the basin of bound state ( Q inter (TAZ1-HIF-1 α )about 0.61). This suggests that the N-terminus of HIF-1 α does not bind to TAZ1 closely at bound state. Wethen calculated the Q intra within the different parts of the ligand. It is clear in Fig. S4A that HIF-1 α folds asbinding to TAZ1, except for the part of α A helix. At unbound state, the folding level of α C is higher thanthat of α A and α B. The situation is a bit different in the CIH pathway. As illustrated in Fig. 6C, because ofthe existence of CITED2, α A, α B, L1, L2 (including the LPQL motif) of HIF-1 α begin to bind to TAZ1 untilCITED2 is away from the binding site. The C-terminus (including the α C and L3) binds to TAZ1 ﬁrst. Incontrast, the existence of CITED2 causes the folding of α B of HIF-1 α later and α C earlier in the CIH pathwaythan that in the UH pathway (see Fig. S4A). And this does not have an effect on the folding of α A of HIF-1 α .For the CITED2 direct binding processes, it seems that the L1 region of the N-terminus of CITED2 bindsto TAZ1 ﬁrst in the UC pathway (Fig. 6B), followed by the other parts. And the CITED2 folds gradually asbinding to TAZ1 (shown in Fig. S4B). However, in the HIC pathway, α A and L1 of CITED2 get in touch withTAZ1 ﬁrst, while the C-terminus, L2 region (including the LPEL motif), occupies the binding site until HIF-1 α leaves the TAZ1. The existence of HIF-1 α causes the folding of α A earlier in the HIC pathway than that in theUC pathway. Moreover, the CIH pathway, the C-terminus (including the α C and L3) of HIF-1 α is the last partaway from TAZ1, which is also the ﬁrst part that binds to TAZ1; for HIC pathway, the N-terminus (includingthe α A and L1) of CITED2 leaves TAZ1 last, which is the ﬁrst part that binds to TAZ1.Aiming to ﬁnd out the crucial residues in the binding process, we calculated the φ value of different bindingpathways. For direct binding with UH pathway (see Fig. S5A and B), most φ values of HIF-1 α are lower than0.2, except for the Gly791, Leu792, and Gln824. The residues with relatively high φ values locate on the α A,LPQL motif, and the C-terminus of HIF-1 α . While in the CIH pathway with the existence of CITED2 (see Fig.8 m ean Q α A α B α CL1L2L3LPQL 0 0.2 0.4 0.6 0.8

C DE F m ean Q Q inter (TAZ1-HIFA-1 α ) α AL1L2LPEL 0 0.2 0.4 0.6 0.8 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7Q inter (TAZ1-CITED2) 0 0.2 0.4 0.6 0.8 1 m ean Q α A α B α CL1L2L3LPQL

A B α AL1L2LPEL

Fig. 6

The mean Q inter between different parts of ligand (HIF-1 α or CITED2) and TAZ1 as a function of bindingin the UH (A), UC (B), CIH (C and E), and HIC (D and F) pathways. Panels A, C, D show the Q inter curves ofHIF-1 α ; Panels B, E, and F show the Q inter curves of CITED2. S6A and B), the φ values of α A and LPQL motif decrease and the φ values of the C-terminus increase whencompared with the UH pathway. These results agree with the analysis of native contact distribution above. Forboth UC and HIC pathways (see Fig. S5C and D, Fig. S6C and D), the N-terminus of CITED2 has higher φ values than other parts of CITED2, especially the Phe222 and Ile223.Some experimental φ values of HIF-1 α are labeled in Fig. S5B. Most simulated φ values at these residuelocations are lower than 0.2, in agreement with the experimental φ values. The experimental results demon-strate that native hydrophobic binding interactions have not been created yet at the rate-limiting transition statefor binding between TAZ1 and HIF-1 α , which is consistent with our ﬁndings that the ligand changes its con-formation and folds after the transition state. However, the mutation V825A (red dot in Fig. S5B) has a muchhigher experimental φ value (0.34) than the simulated one (0.15). We have noticed that both ∆∆ G Eq and ∆∆ G TS are negative , which means that this mutation will stabilize both the transition state and the bound state. But inthe theoretical method of φ value calculation, we assume that the mutation will break the interactions betweenthis residue and the others . And the theoretical φ value calculation will be sensitive for the residue siteswith high φ values and accurate for the φ value results driven by native contacts. Perhaps this can explain thedifference between these two φ values. 9 Conclusions

The TAZ1 domain of CREB binding protein is reported to be associated with two different targets that shareparts of the binding site. It has been found that in the solution of TAZ1 with two different targets HIF-1 α andCITED2, TAZ1 prefers to form complex with CITED2 rather than HIF-1 α . Aiming to determine the underlyingmechanism of the competitive binding between HIF-1 α and CITED2, we performed coarse-grained moleculardynamics simulations by developing the structure-based model of TAZ1, HIF-1 α and CITED2 systems. Severalmain states (UB, HB, CB, and IS) and a sub-state HB1 can be quantiﬁed on the free energy surface of ternarysystem (TAZ1, HIF-1 α , and CITED2). The simulated mechanism is consistent with the previous experimentalhypothesis about the replacing process and the apparent afﬁnity. The results suggest the dominant position offorming TAZ1-CITED2 complex in both thermodynamics and kinetics. In addition, the analysis about the inter-chain contacts between TAZ1 and target shows the different binding order of each domain in direct binding andreplacing processes. In the replacing process CIH, the crucial LPQL and α B domain of HIF-1 α access theirbinding site after the intermediate state, which is different from that in the direct binding process UH. Besides,the different binding processes (CIH and UH) will also change the distribution of the HIF-1 α φ values. For thereplacing process HIC, the intermediate state locates close to the CITED2 bound state. The different bindingprocesses (HIC and UC) will not change the distribution of the CITED2 φ values signiﬁcantly. NMR structures 1L8C and 1R8U were used for preparing the initial models of TAZ1-HIF-1 α andTAZ1-CITED2 complexes. Full-length HIF-1 α and CITED2 are 51 and 50 a.a. proteins (776-826 and 220-269), respectively. The initial coarse-grained C α structure-based model (SBM) of TAZ1-HIF-1 α and TAZ1-CITED2 complexes was generated using SMOG on-line toolkit . There are 3 Zn + ions linked with TAZ1with coordination bonds, modeled by one bead with 2 positive charge (+2e) for each ion. In the present work,the weighted contact map was built with all the 20 conﬁgurations in each NMR structure. Each native contactwas identiﬁed by the CSU algorithm . The weighted coefﬁcient (for intermolecular contacts and the contactswithin HIF-1 α /CITED2) is the frequency of occurrence in all the conﬁgurations, similar as the method inour previous studies . All simulations were performed with Gromacs 4.5.5 . The coarse grained moleculardynamics simulations (CGMD) used Langevin equation with constant friction coefﬁcient γ = .

References [1] P. E. Wright and H. J. Dyson,

J. Mol. Biol. , 1999, , 321–331.[2] A. K. Dunker, J. D. Lawson, C. J. Brown, R. M. Williams, P. Romero, J. S. Oh, C. J. Oldﬁeld, A. M.Campen, C. M. Ratliff, K. W. Hipps et al. , J. Mol. Graph. Model. , 2001, , 26–59.[3] V. N. Uversky, Protein Sci. , 2002, , 739–756.[4] H. J. Dyson and P. E. Wright, Curr. Opin. Struct. Biol. , 2002, , 54–60.[5] H. J. Dyson and P. E. Wright, Nat. Rev. Mol. Cell Biol. , 2005, , 197–208.106] A.-C. Gavin, M. Bösche, R. Krause, P. Grandi, M. Marzioch, A. Bauer, J. Schultz, J. M. Rick, A.-M.Michon, C.-M. Cruciat et al. , Nature , 2002, , 141.[7] Y. Ho, A. Gruhler, A. Heilbut, G. D. Bader, L. Moore, S.-L. Adams, A. Millar, P. Taylor, K. Bennett,K. Boutilier et al. , Nature , 2002, , 180.[8] P. E. Wright and H. J. Dyson,

Nat. Rev. Mol. Cell Biol. , 2015, , 18.[9] Z. Arany, L. E. Huang, R. Eckner, S. Bhattacharya, C. Jiang, M. A. Goldberg, H. F. Bunn and D. M.Livingston, Proc. Natl. Acad. Sci. U. S. A. , 1996, , 12969–12973.[10] S. A. Dames, M. Martinez-Yamout, R. N. De Guzman, H. J. Dyson and P. E. Wright, Proc. Natl. Acad.Sci. U. S. A. , 2002, , 5271–5276.[11] S. J. Freedman, Z.-Y. J. Sun, F. Poy, A. L. Kung, D. M. Livingston, G. Wagner and M. J. Eck, Proc. Natl.Acad. Sci. U. S. A. , 2002, , 5367–5372.[12] S. Bhattacharya, C. L. Michels, M.-K. Leung, Z. P. Arany, A. L. Kung and D. M. Livingston, Genes Dev. ,1999, , 64–75.[13] R. N. De Guzman, M. A. Martinez-Yamout, H. J. Dyson and P. E. Wright, J. Biol. Chem. , 2004, ,3042–3049.[14] S. J. Freedman, Z.-Y. J. Sun, A. L. Kung, D. S. France, G. Wagner and M. J. Eck,

Nat. Struct. Mol. Biol. ,2003, , 504.[15] R. B. Berlow, H. J. Dyson and P. E. Wright, Nature , 2017, , 447.[16] W.-T. Chu, X. Chu and J. Wang,

Proc. Natl. Acad. Sci. U. S. A. , 2017, , E7959–E7968.[17] X. Chu, Y. Wang, L. Gan, Y. Bai, W. Han, E. Wang, J. Wang et al. , PLoS Comput. Biol. , 2012, , e1002608.[18] X. Chu, F. Liu, B. A. Maxwell, Y. Wang, Z. Suo, H. Wang, W. Han and J. Wang, PLoS Comput. Biol. ,2014, , e1003804.[19] Y. Wang, C. Tang, E. Wang and J. Wang, PLoS Comput. Biol. , 2012, , e1002471.[20] W.-T. Chu, J. Clarke, S. L. Shammas and J. Wang, PLoS Comput. Biol. , 2017, , e1005468.[21] X. Chu, L. Gan, E. Wang and J. Wang, Proc. Natl. Acad. Sci. U. S. A. , 2013, , E2342–E2351.[22] W.-T. Chu and J. Wang,

ACS Cent. Sci. , 2018, , 1015–1022.[23] S. Kumar, J. M. Rosenberg, D. Bouzida, R. H. Swendsen and P. A. Kollman, J. Comput. Chem. , 1992, ,1011–1021.[24] S. Kumar, J. M. Rosenberg, D. Bouzida, R. H. Swendsen and P. A. Kollman, J. Comput. Chem. , 1995, ,1339–1350.[25] I. Lindström, E. Andersson and J. Dogan, Sci. Rep. , 2018, , 7872.[26] C. Clementi, H. Nymeyer and J. N. Onuchic, J. Mol. Biol. , 2000, , 937–953.[27] J. K. Noel, P. C. Whitford, K. Y. Sanbonmatsu and J. N. Onuchic,

Nucleic Acids Res. , 2010, , W657–W661.[28] J. K. Noel, P. C. Whitford and J. N. Onuchic, J. Phys. Chem. B , 2012, , 8692–8702.[29] H. Lammert, A. Schug and J. N. Onuchic,

Proteins: Struct., Funct., Bioinf. , 2009, , 881–891.1130] V. Sobolev, A. Sorokine, J. Prilusky, E. E. Abola and M. Edelman, Bioinformatics , 1999, , 327–332.[31] B. Hess, C. Kutzner, D. Van Der Spoel and E. Lindahl, J. Chem. Theory Comput. , 2008, , 435–447.[32] G. A. Tribello, M. Bonomi, D. Branduardi, C. Camilloni and G. Bussi, Comput. Phys. Commun. , 2014, , 604–613. 12

I Appendix:Investigations of the Underlying Mechanisms of HIF-1 α and CITED2 Binding to TAZ1 September 4, 2019

NMR structures 1L8C and 1R8U were used for preparing the initial models of TAZ1-HIF-1 α and TAZ1-CITED2 complexes. Full-length HIF-1 α and CITED2 are 51 and 50 a.a. proteins(776-826 and 220-269), respectively. The initial coarse-grained C α structure-based model (SBM) ofTAZ1-HIF-1 α and TAZ1-CITED2 complexes was generated using SMOG on-line toolkit . Thereare 3 Zn + ions linked with TAZ1 with coordination bonds, modeled by one bead with 2 positivecharge (+2e) for each ion. In the present work, the weighted contact map was built with all the 20conﬁgurations in each NMR structure. Each native contact was identiﬁed by the CSU algorithm .The weighted coefﬁcient (for intermolecular contacts and the contacts within HIF-1 α /CITED2) is thefrequency of occurrence in all the conﬁgurations, similar as the method in our previous studies . Thepotential energy function consists of both bonded and non-bonded terms. Additionally, we introducedthe charge characteristics into our SBM model to study the electrostatic interactions in this system.As a result, the potential energy form used in this study is given in the following equation: V = ∑ bonds ε r ( r − r ) + ∑ angles ε h ε θ ( θ − θ ) + ∑ dihedral ε h ε ( n ) φ ( − cos ( n × ( φ − φ )))+ ∑ contacts ε i j (cid:18) σ i j r i j (cid:19) − (cid:18) σ i j r i j (cid:19) ! + ∑ non − contacts ε NC (cid:18) σ NC r i j (cid:19) + ε DH V Debye − H ¨ uckel (S1)In Eq. S1, ε r = ε , ε θ = ε , ε ( ) φ = ε and ε ( ) φ = . ε . Electrostatic interactions

The electrostatic interactions are calculated by the

Debye − H ¨ uckel model, which can quantify thestrength of charge-charge attraction and repulsion at various salt concentrations: V Debye − H ¨ uckel = Γ DH × K coulomb B ( κ ) ∑ i , j q i q j exp (cid:0) − κ r i j (cid:1) ε r i j (S2)In Eq. S2, K coulomb = πε = . kJ · mol − · nm · e − is the electric conversion factor; B ( κ ) is thesalt-dependent coefﬁcient; κ − is the Debye screening length, which is directly inﬂuenced by thesolvent ion strength (IS)/salt concentration C salt ( κ ≈ . √ C salt ); ε is dielectric constant, which isset to 80 during the simulations. Γ DH is the energy scaled coefﬁcient which aims to make the totalenergy balanceable. In our model, Lys and Arg have a positive point charge (+e), Asp and Glu have anegative point charge (-e). All the charges are placed on the C α atoms. Under the physiological ionicstrengths ( C salt = . M ), κ is 1.24 nm − , so we set Γ DH = .

535 in our simulations, so that V DH for two opposite charged atoms located at a distance of 0.5 nm matches the native contact energy.When a native contact is an ionic pair (salt bridge), we rescaled its interaction strength by setting ε DH = . . More detailsof Debye − H ¨ uckel model can be found in these papers .2 .2 Parameter calibration For angle and dihedral terms, some hinge regions were deﬁned according to the structural ﬂex-ibility of the NMR data, aiming to collect the information of conformational change during ligandbinding. In this method, local interactions are weakened by decreasing the site-speciﬁc constantsfrom the previous studies . When variances of angle and dihedral are higher than 12.82 and 40.50degrees, the potential energies of them are higher than 1.0 kJ/mol. Because the structure of TAZ1is stable among the NMR structures of each complex, here we calculated the hinge regions of TAZ1between TAZ1-HIF-1 α and TAZ1-CITED2 to ensure the conformational ﬂexibility of TAZ1 in theternary complex. Then the hinge regions of HIF-1 α or CITED2 were measured within their structuresin 1L8C or 1R8U, respectively. In this model, if the angle or dihedral belong to the hinge regions, ε θ or ε ( n ) φ is rescaled by setting ε h = .

01, mimicking the ﬂexibility. Otherwise ε h = α , intra-CITED2 terms, as well as the inter-molecular terms be-tween TAZ1 and HIF-1 α , between TAZ1 and CITED2. These parts have different parameters ofLennard-Jones potential ( ε i j ): V nativenon − bond = α T V TAZ intra + α H V HIF − α intra + α C V CIT ED intra + β H V TAZ − HIF − α inter + β C V TAZ − CIT ED inter (S3)As shown in Eq. S3, the α parameters are set for intra-molecular terms and the β parameters are setfor inter-molecular terms. The strength α T was set to 1.0. Other intra-molecular parameters α H and α C were tuned according to the helical content of HIF-1 α and CITED2 at unbound state. The inter-molecular parameters β H and β C were tuned according to the dissociation constant K d in experiment.There are a little differences between the structures of TAZ1 in 1L8C and 1R8U. Therefore wekept the intra-TAZ1 native contacts with the ratio of distances R i j in 1L8C and 1R8U in the range of0.8 to 1.25, and discarded the other native contacts to make the TAZ1 more “ﬂexible”. For the MDsimulation with structure-based model was run with reduced units, the simulation temperature shouldbe calibrated ﬁrstly. However, there is no melting temperature of TAZ1 reported. Because there isa critical phenomenon that the three Zn + ions stabilize the structure of TAZ1, we built 4 differentmodels of TAZ1 (TAZ1 with 3 Zn + , 2 Zn + , 1 Zn + , as well as TAZ1 without Zn + ) to ﬁnd outthe effect of Zn + on the thermodynamic stability of TAZ1. As shown in Fig. S7, at temperatureabout 0.99, TAZ1 with 3 Zn + is stable but TAZ1 without Zn + unfolds. As a result, the simulationtemperature is set to 0.99, mimicking the room temperature.In the experiment, the isolate HIF-1 α or CITED2 was considered as ”random coil” . Herewe calibrated the model to make the helical content of HIF-1 α /CITED2 below 10%. The helicalcontent can not decline by just altering the strength of the native contact within HIF-1 α /CITED2. Wethen adjusted the dihedral potential of HIF-1 α and CITED2 empirically by adding a term V ( φ ) = k φ cos [ φ − δ ] , where δ = . ◦ . We tuned the value of k φ and found that when it equals to 1.0 or0.9, the helical content of HIF-1 α /CITED2 is about 10%.In order to achieve sufﬁcient sampling, TAZ1-HIF-1 α and TAZ1-CITED2 were placed in a spherewith a radius of 6 nm, leading to an effective concentration for the components of TAZ1 ( (cid:2) C (cid:3) Sim )about 1.83 mM ( (cid:2) C (cid:3) Sim = V , where V is the box volume in units of ˚A , 1660 is the unit transfer-ring constant from units of molecules per ˚A to units of mol/L, similar settings can be also found inprevious papers ). Therefore, K d = [ L ] [ R ][ LR ] = (cid:16) P ub (cid:2) C (cid:3) Sim (cid:17) (cid:16) P ub (cid:2) C (cid:3) Sim (cid:17) P b [ C ] Sim = P ub (cid:2) C (cid:3) Sim P b (S4)3here P ub and P b are the fractions of population of unbound states and bound states at equilibrium,respectively. From the experimental K d , we can obtain the ratio of P ub / P b at simulation condition(initial concentration (cid:2) C (cid:3) Sim ). Then we can obtain ∆ G by applying the Boltzamann distribution ∆ G = kT ln ( P ub / P b ) . At equilibrium of the simulations, P b is far larger than P ub and close to 1. Asa result, when the experimentally determined K d between TAZ1 and HIF-1 α /CITED2 is 10 nM ,the binding free energy between ligand and TAZ1 in Eq. S4 is about -6.06 kT in our model, if theeffective simulation concentrations were applied. The method of binding free energy calculation isthe same as that in the previous papers . The strengths of inter-molecular interactions ( β H and β C ) were tuned by performing a series of REMD simulations on TAZ1-HIF-1 α and TAZ1-CITED2complexes, respectively. As shown in Fig. S8, both β H and β C are set to be 1.1 and 0.95 in our model.In the ternary system, we can obtain the simulated K d by using the free energy difference betweenbound state (HB or CB) and unbound state (UB). All simulations were performed with Gromacs 4.5.5 . The coarse grained molecular dynamicssimulations (CGMD) used Langevin equation with constant friction coefﬁcient γ = .

0. The cutofflength for non-bonded interactions was set to 3.0 nm. The MD time step was set to 0.5 fs and thetrajectories were saved every 2 ps. To enhance the sampling of binding events, a strong harmonicpotential was added if the distance between the center of mass of TAZ1 and HIF-1 α , TAZ1 andCITED2 is greater than 6 nm .For thermodynamic simulations (binding and unbinding for multiple times), REMD simulationsand long-time MD simulations were performed to overcome the energy barriers between bound andunbound states. We deﬁne that a native contact is formed if the C α -C α distance between any givennative atom pair is within 1.2 times of its native distance. Then the proﬁles of free energy curve orsurface can be obtained by using WHAM algorithm . The REMD simulations have been tested tobe converged that the fraction of all the native contacts in the ternary system (Q) becomes equilibratedand stable after about 150 ns of each replica (see Fig. S9).For kinetic simulations, 200 individual MD runs started with varying conﬁgurations and velocitieswere performed on different processes respectively: direct binding (both unbound state to TAZ1-HIF-1 α or TAZ1-CITED2 state) and replacement (CITED2 binding to TAZ1 by replacing HIF-1 α andHIF-1 α binding to TAZ1 by replacing HIF-1 α ). φ value of binding The calculation of φ value is referred to the previous simulation papers . The φ i j for eachinter-molecular native contact pair between residue i and j was computed from the probability offormation, P i j : φ i j = ∆∆ F T S − U ∆∆ F B − U ≈ P T Si j − P Ui j P Bi j − P Ui j (S5)where ∆∆ F is the free energy difference between the wild-type and mutated protein, P i j is the prob-ability of formation of contact between i and j . Here, U, TS, B correspond to the unbound state,transition state ( Q inter ∼ . − . φ i value of residue i can becalculated from the average of φ i j that are involved with residue i .4 P o t en t i a l m ean f o r c e ( k T ) Q inter HIF-1 α CITED2

Fig. S1

The free energy curves as a function of the fraction of inter-molecular native contacts ( Q inter ) ofTAZ1-HIF-1 α (green line) and TAZ1-CITED2 (magenta line). The free energy unit is kT (k is Boltzmannconstant). References [1] S. A. Dames, M. Martinez-Yamout, R. N. De Guzman, H. J. Dyson and P. E. Wright,

Proc. Natl.Acad. Sci. U. S. A. , 2002, , 5271–5276.[2] R. N. De Guzman, M. A. Martinez-Yamout, H. J. Dyson and P. E. Wright, J. Biol. Chem. , 2004, , 3042–3049.[3] J. K. Noel, P. C. Whitford, K. Y. Sanbonmatsu and J. N. Onuchic,

Nucleic Acids Res. , 2010, ,W657–W661.[4] C. Clementi, H. Nymeyer and J. N. Onuchic, J. Mol. Biol. , 2000, , 937–953.[5] J. K. Noel, P. C. Whitford and J. N. Onuchic,

J. Phys. Chem. B , 2012, , 8692–8702.[6] H. Lammert, A. Schug and J. N. Onuchic,

Proteins: Struct., Funct., Bioinf. , 2009, , 881–891.[7] V. Sobolev, A. Sorokine, J. Prilusky, E. E. Abola and M. Edelman, Bioinformatics , 1999, ,327–332.[8] W.-T. Chu, X. Chu and J. Wang, Proc. Natl. Acad. Sci. U. S. A. , 2017, , E7959–E7968.[9] Y. Levy, J. N. Onuchic and P. G. Wolynes,

J. Am. Chem. Soc. , 2007, , 738–739.[10] A. Azia and Y. Levy,

J. Mol. Biol. , 2009, , 527–542.[11] O. Givaty and Y. Levy,

J. Mol. Biol. , 2009, , 1087–1097.5 .0 0.2 0.4 0.6 0.8 1.00.00.20.40.60.81.0 Q inter (TAZ1-HIF-1 α ) Q inter (TAZ1-HIF-1 α ) Q i n t r a ( H I F - α ) Q i n t r a ( C I T E D ) H e li c a l c on t en t o f H I F - α H e li c a l c on t en t o f C I T E D Q inter (TAZ1-CITED2) Q inter (TAZ1-CITED2) A BC D

Fig. S2

The free energy surfaces projected on (A) both the fraction of inter-molecular native contacts( Q inter ) of TAZ1-HIF-1 α and the fraction of intra-molecular native contacts ( Q intra ) of HIF-1 α ; (B) both Q inter of TAZ1-HIF-1 α and the helical content of HIF-1 α ; (C) both Q inter of TAZ1-CITED2 and Q intra ofCITED2; (D) both Q inter of TAZ1-CITED2 and the helical content of CITED2. The free energy unit is kT(k is Boltzmann constant). .0 0.2 0.4 0.6 0.8 1.00.00.20.40.60.81.0 Q inter (TAZ1-HIF-1 α ) Q inter (TAZ1-HIF-1 α ) Q i n t e r ( T A Z - C I T E D ) Q i n t e r ( T A Z - C I T E D ) Q inter (TAZ1-HIF-1 α ) Q inter (TAZ1-HIF-1 α ) Q i n t e r ( T A Z - C I T E D ) Q i n t e r ( T A Z - C I T E D ) A BC D

Fig. S3

Mean Q intra of HIF-1 α (A) and CITED2 (B) as well as mean helical content of HIF-1 α (C) andCITED2 (D) as a function of Q inter (TAZ1-HIF-1 α ) and Q inter (TAZ1-CITED2). DE F m ean Q α A α B α Call HIF-1 α m ean Q Q inter (TAZ1-HIF-1 α ) α Aall CITED2 0 0.2 0.4 0.6 0.8 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7Q inter (TAZ1-CITED2)

A B m ean Q α A α B α Call HIF-1 α α Aall CITED2

Fig. S4

The mean Q intra within different parts of ligand (HIF-1 α or CITED2) as a function of bindingin the UH (A), UC (B), CIH (C and E), and HIC (D and F) pathways. Panels A, C, and D show the Q intra curves of HIF-1 α ; Panels B, E, and F show the Q intra curves of CITED2. α α α α α α α α A α B α C LPQL α A LPEL φ v a l ue φ v a l ue Residue number TAZ1Residue number TAZ1 Residue number HIF-1 α Residue number CITED2

A BC D

Fig. S5

The φ values of TAZ1 (residue 345 to 439, as well as 3 Zn + ) in UH (A) and UC (C) pathwaysas well as the φ values of HIF-1 α (residue 776 to 826) in UH pathway (B) and CITED2 (residue 220 to269) in UC pathway (D). The experimental φ values (in ref ) are shown in dots. The secondary structuresas well as the LPQL/LPEL motif are labeled. α α α α α α α α A α B α C LPQL α A LPEL φ v a l ue φ v a l ue Residue number TAZ1Residue number TAZ1 Residue number HIF-1 α Residue number CITED2

A BC D

Fig. S6

The φ values of TAZ1 (residue 345 to 439, as well as 3 Zn + ) in CIH (A) and HIC (C) pathwaysas well as the φ values of HIF-1 α (residue 776 to 826) in CIH pathway (B) and CITED2 (residue 220 to269) in HIC pathway (D). The secondary structures as well as the LPQL/LPEL motif are labeled. H ea t c apa c i t y Temperature3 Zn Fig. S7

The heat capacity curves of TAZ1 with 3 Zn + (red), 2 Zn + (green), 1 Zn + (blue), as well asTAZ1 without Zn + (magenta). A B -14-12-10-8-6-4-2 0 2 4 6 8 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 F ( k T ) Q inter (TAZ1-HIF-1 α ) β H =1.0 β H =1.1 β H =1.2 -10-8-6-4-2 0 2 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 F ( k T ) Q inter (TAZ1-CITED2) β C =0.92 β C =0.95 β C =0.98 Fig. S8

Free energy as a function of HIF-1 α binding (A) and CITED2 binding (B) with different param-eters. ig. S9 The fraction of all the native contacts in the ternary system as a function of simulation time of theREMD run (total 1 µ s per replica). The native contacts include the intra-molecular (within TAZ1, HIF-1 α ,and CITED2) and the inter-molecular (between TAZ1 and HIF-1 α , between TAZ1 and CITED2) contacts.The right panel (B) is the ﬁrst 400 ns. [12] X. Chu, Y. Wang, L. Gan, Y. Bai, W. Han, E. Wang, J. Wang et al. , PLoS Comput. Biol. , 2012, , e1002608.[13] Y. Wang, L. Gan, E. Wang and J. Wang, J. Chem. Theory Comput. , 2012, , 84–95.[14] K.-i. Okazaki, N. Koga, S. Takada, J. N. Onuchic and P. G. Wolynes, Proc. Natl. Acad. Sci. U.S. A. , 2006, , 11844–11849.[15] D. De Sancho and R. B. Best,

Mol. Biosyst. , 2012, , 256–267.[16] S. M. Law, J. K. Gagnon, A. K. Mapp and C. L. Brooks, Proc. Natl. Acad. Sci. U. S. A. , 2014, , 12067–12072.[17] J. Wang, Y. Wang, X. Chu, S. J. Hagen, W. Han and E. Wang,

PLoS Comput. Biol. , 2011, ,e1001118.[18] D. Ganguly and J. Chen, Proteins: Structure, Function, Bioinformatics , 2011, , 1251–1266.[19] R. B. Berlow, H. J. Dyson and P. E. Wright, Nature , 2017, , 447.[20] W.-T. Chu and J. Wang,

ACS Cent. Sci. , 2018, , 1015–1022.[21] B. Hess, C. Kutzner, D. Van Der Spoel and E. Lindahl, J. Chem. Theory Comput. , 2008, ,435–447.[22] G. A. Tribello, M. Bonomi, D. Branduardi, C. Camilloni and G. Bussi, Comput. Phys. Commun. ,2014, , 604–613.[23] S. Kumar, J. M. Rosenberg, D. Bouzida, R. H. Swendsen and P. A. Kollman,

J. Comput. Chem. ,1992, , 1011–1021.[24] S. Kumar, J. M. Rosenberg, D. Bouzida, R. H. Swendsen and P. A. Kollman, J. Comput. Chem. ,1995, , 1339–1350. 1225] Y. Levy, S. S. Cho, J. N. Onuchic and P. G. Wolynes, J. Mol. Biol. , 2005, , 1121–1145.[26] I. Lindstr¨om, E. Andersson and J. Dogan,

Sci. Rep. , 2018,8