LIPS program

LIPS program#

Introduction to LIPid-facing Surface#

TMKit integrates the LIPS (LIPid-facing Surface) method[1] to generate helix surfaces, LIPS scores, and entropy scores, facilitating the analysis of transmembrane protein interactions.

You can also refer to our recent publication[2] for further illustration. Transmembrane (TM) proteins are anchored to the membrane via α-helices, allowing them to interact with lipids. Therefore, studying lipid-accessible protein properties, such as helix orientation relative to lipids, is crucial. Below, we detail a helix orientation prediction process used in the LIPS method, which highlights the relationship between specialized structural features and the biological functions of membrane proteins.

Notably, TM proteins are enriched with coiled coils in TM regions, which contain heptad repeats—periodically recurring seven-amino-acid sequences labeled ABCDEFG. A heptad repeat generates seven distinct helical faces, with each of the seven residues alternately considered as an anchoring residue. According to Adamian and Liang[1], each anchoring residue is complemented by two adjacent residues (two positions apart), forming one of the seven surfaces. Thus, starting from the first residue A, the seven helical faces are ADE, BEF, CFG, DGA, EAB, FBC, and GCD, as demonstrated below.

../../_images/lips.png — **Caption**: Schematic illustration of helical surfaces generated using the LIPS method. (a) shows the representations of the heptad repeat (seven residues **ABCDEFG**) in sequence and structural contexts. (b) shows the seven canonical surfaces of transmembrane α-helices, which are generated by taking each of the seven residues as an anchoring residue. Each surface consists of an anchoring residue and two residues complementing the anchoring residue. The seven surfaces are **ADE**, **BEF**, **CFG**, **DGA**, **EAB**, **FBC**, and **GCD**.#

In the LIPS pipeline, this helical partitioning is applied systematically:

Sliding through the TM protein sequence, each residue is assigned to one of the seven helical faces. Each helical face receives an entropy score and a lipophilicity score. These scores are integrated to compute the LIPS score, which estimates helix orientation. Since the majority of residues involved in TM helix interactions align with heptad repeats, both face-level LIPS scores and residue-level lipophilicity scores may aid in identifying interaction sites within TM proteins. In MBPred, the significance of helical face-related scores in predicting interaction sites is demonstrated through:

Mean Decrease in Impurity (Gini Importance), and Leave-One-Out Cross-Validation tests. These findings reinforce the functional relevance of heptad repeat-derived structural features in TM protein interactions.

Example usage#

LIPS replies on an input of a multiple sequence alignment (MSA). We use the MSA of protein 1xqf chain A to generate LIPS results. We have put it in ./data/msa/.

Please note that TMKit includes a method that serves as a wrapper for the external lips.pl library, which can be accessed via the provided link. The lips.pl version in TMKit has been slightly modified from the original implementation, making it more convenient to retrieve results. The results will be saved in ./data/lips/.

Attributes#

Arribute	Description
`prot_name`	name of a protein in the prefix of a PDB file name (e.g., 1xqf in 1xqfA.pdb)
`file_chain`	chain of a protein in the prefix of a PDB file name (e.g., A in 1xqfA.pdb)
`df_prot`	Pandas dataframe storing protein names and chain names
`msa_path`	path where a protein MSA file is placed
`sv_fp`	path to save files

Output#

We show below that for a single protein 1xqf chain A, what kind of output the program will give users. First, it has 7 files for 7 surfaces, each having their residues residing. Finally, it outputs a summary file with 7 surfaces with LIPOPHILICITY, ENTROPY, and LIPS scores.

save path is: tmkit/data/example/1xqfA/
     SURFACE 0:
A  0.018 1.125
D  0.740 5.710
K  0.804 6.798
...
P  0.615 4.539
     SURFACE 1:
V  0.026 1.158
K  0.804 6.798
A  0.573 4.896
 ...
P  0.615 4.539
     SURFACE 2:
A  0.552 4.238
A  0.573 4.896
D  0.865 2.885
 ...
R  1.174 1.749
     SURFACE 3:
D  0.740 5.710
D  0.865 2.885
N  0.679 5.217
 ...
V  0.751 3.144
     SURFACE 4:
K  0.804 6.798
N  0.679 5.217
A  0.697 4.852
...
P  0.615 4.539
A  0.018 1.125
V  0.026 1.158
     SURFACE 5:
A  0.573 4.896
A  0.697 4.852
F  1.621 2.735
 ...
R  1.174 1.749
V  0.026 1.158
A  0.552 4.238
     SURFACE 6:
D  0.865 2.885
F  1.621 2.735
M  0.975 4.820
 ...
V  0.751 3.144
A  0.552 4.238
D  0.740 5.710

SURFACE LIPOPHILICITY ENTROPY   LIPS
      1.834      4.846    8.889
      1.729      4.852    8.389
      1.770      4.912    8.694
      1.777      4.746    8.435
      1.791      4.749    8.507
      1.815      4.885    8.865
      1.767      4.948    8.741