Data Management for Combinatorial Chemistry
Maryjo Zaborowski
Chiron Corporation
4560 Horton Street
Emeryville, CA 94608 USA
Tel: +1-510-601-3542
Sheila H. DeWitt
Parke-Davis Pharmaceutical Research
Division of Warner-Lambert Company
2800 Plymouth Road
Ann Arbor, Michigan 48105 USA
http://www.netsci.org/Science/Combichem/feature13.html
Introduction
Combinatorial chemistry and high throughput synthesis techniques have done more than change the field of chemistry as applied to drug discovery. These new synthetic technologies have introduced dramatically different chemical and biological information-handling requirements. With the advent of combinatorial chemistry and high throughput synthesis, many companies are now reporting 10- to 50-fold increases in the number of compounds made and tested annually. At most companies, existing information technology and business operations, such as compound registration practices and assay result visualization, cannot accommodate this increase.
In order to gauge the current use of combinatorial techniques and the information systems which manage results from these efforts, a questionnaire was distributed to chemical information specialists from several companies. Replies were received from seventeen companies which ranged from small biotechnology groups to multi-national pharmaceutical companies. The questionnaire and results are provide and the results are summarized.
Experience and Direction - Status of Synthetic Effort
While there has been a significant amount writen about combinatorial approaches to chemistry, the area is still very immature. Of the companies surveyed, only three reported "long-standing" combinatorial synthesis and/or high throughput synthesis programs (> 2 years). Six companies reported medium-length (1-2 years) programs. Four companies reported young programs (6 months - 1 year). The remaining companies were initiating programs and/or purchasing libraries from outside sources. Of the three companies with >2 years experience, two companies practice combinatorial synthesis only. The third company emphasizes combinatorial mixtures but also pursues high throughput synthesis.
Combinatorial chemistry is not used exclusively for lead generation. Each company with established groups reported programs in both chemistry and high throughput synthesis, although larger resources were reported to be allocated to combinatorial chemistry. Companies with less than a year of experience reported emphasis on high throughput synthesis.
Synthesis Emphasis - Types of Chemistry(s) Employed
Seven companies reported the use of solid phase organic synthesis (SPOS) for novel, non-oligomeric compounds. Most of these companies reported a library inventory of 50-100 combinatorial libraries (non-deconvolution mixtures).
Nine companies reported exclusive use of solution phase chemistry. Seven (including some of the above category) indicated some creation of peptoid and/or peptide libraries.
Number of Compounds per Mixture
Six companies routinely prepare mixtures of >100 compounds/pool. Each of these companies also reported the use of SPOS chemistry. Three of these companies reported having combinatorial efforts exclusively. Five companies reported the routine preparation of mixtures of 10-100 compounds/pool. Two companies routinely prepare mixtures of 1-10 compounds pool.
Information Systems/Access to Data
All companies reported having traditional chemical registration systems for registering discrete compounds. However, only six companies reported having an information system for the registration and storage of combinatorial chemistry products. Of those six, three reported having Library Design software. An interesting point to come out of this survey was that only five companies claimed to be using combinatorial chemistry assay data effectively.
Ten companies reported using relational databases (e.g. Oracle) to store assay results and related library information. Six companies reported no central storage of assay data; information is saved in Excel. One company stated that they use their chemistry software to store assay data.
Tracking the synthesis, testing and heritage of deconvolution products was reported as the second most pressing information problem after registration. This answer was common to companies with >1 year experience with combinatorial chemistry and companies with combinatorial chemistry information systems.
Company Information
1. Company Classification
2: Biotech
14: Pharmaceutical
1: Chemical
2. Synthesis Direction
5: Combinatorial mixtures only
0: High throughput synthesis only
6: Emphasis on combinatorial mixtures, but high
throughput synthesis is pursued.
5: Emphasis on high throughput synthesis, but
combinatorial mixtures are pursued.
3. Purpose of Combinatorial Chemistry Effort
7: Lead generation
2: Lead optimization
8: Combinatorial mixtures for lead generation and high
throughput synthesis for lead optimization.
4. Emphasis of Synthetic Efforts
9: Use of known solution phase chemistry
7: Use of solid phase chemistry (e.g. peptides,
peptoids)
7: Development of solid phase organic synthesis (SPOS)
for novel, non-oligomeric compounds
5. Location of Efforts
10: Central laboratory
8: Decentralized throughout chemistry group
6. Status of Synthesis Effort
3: Not yet operational
4: 1 year
6: 1-2 years
3: >2 years
7. Total Number of Combinatorial Mixtures Generated to Date
3: 0
2: 1-10
3: 10-100
8: >100
8. Total Number of Compounds in EACH Combinatorial Mixture
2: 1-10
5: 10-100
6: >100
9. Total Number of Discrete Compounds (generated to date by high throughput or parallel synthesis methods)
2: 0
4: 1-100
4: 100-1000
6: >1000
10. Combinatorial Chemistry Project Composition
5: Stand-alone group, chemists only - one supervisor
1: Stand-along group, multi-disciplinary (includes, for example,
chemists, modelers, engineers) - one supervisor
0: Collaborative effort, chemists only - multiple supervisors
3: Collaborative effort, multi-disciplinary BUT dedicated to the
effort - multiple supervisors
7: Collaborative effort, multi-disciplinary BUT each pat is NOT
solely dedicated to the effort - multiple supervisors.
Combinatorial Chemistry System Development
11. Information System Presently in Place
4: None
6: Chemical planning and synthesis
11: Structure registration
10: Data registration
5: Library design
3: Deconvolution of mixtures
9: Assay result storage
12. Accessibility of Information
2: Local, not shared among labs
2: Local, shared via central system
11: Central, host-based
13. Vision for Compound Registration System
7: Separate systems for combinatorial chemistry and traditional
chemistry
9: Central system for both combinatorial and traditional
chemistries
14. Where are Assay Results Stored?
1: Within registration system
6: Within desktop programs (e.g. Excel)
10: Within relational databases on host
15. Primary Users of Current System
14: Chemists
2: Information scientists
5: Biologists
16. Development Philosophy
6: Utilize vendor system
3: Internal development - best solution for competitive advantage
6: Internal development - needs not met by vendor systems
17. Development of Information System Driven By:
1: Registration group
8: Chemists
0: Modeling group
2: Programming group
18. Preferred System Architecture
1: Single system - single interface
8: Multiple systems - single interface
3: Multiple systems - multiple interfaces
19. Individuals Involved in Systems Development for Combinatorial Chemistry
6: 0
8: 1-5
1: 5-10
1: >10
20. Have You Investigated Multiple Vendor Registration Systems?
5: Yes
10: No
21. Which ones?
MDL, Daylight, Tripos, Chemical Design
22. Have You Investigated Multiple Vendor Systems for Library Design, Assay Result Handling, etc?
1: Yes
11: No
23. Important Features of Vendor Systems
10: Flexibility
7: Customization potential
5: Most built-in features
4: No internal development or support needed
Information Handling Issues
24. Most Pressing Concerns
10: Registration of large numbers of compounds
8: Tracking and deconvolution of mixtures
8: Association of assay results with mixtures
5: Association of assay results with discrete compounds
1: Reduction of raw assay data
2: User compliance
25. Is Your Combinatorial Chemistry Data Used Effectively?
5: Yes
7: No
26. Which Tools Are/Would Be Most Useful?
Cluster/trend analysis
+/- hit discrimination
Electronic notebook
27. Storage of Negative Data
13: Useful
1: Not useful
28. Accuracy of Negative Data from Combinatorial Chemistry Testing
1: Assume full accuracy
11: Assume some "likelihood"
1: Assume no accuracy
Information Technology Staff
29. With the changing synthetic technology and improving information technology, is the skill requirement for IT people increasing dramatically?
4: Yes
7: Probably
4: No
30. What skills are necessary for IT personnel to support a combinatorial chemistry/high throughput synthesis effort?
Chemistry and computers, mathematics, statistics, C
programming, Oracle database management, knowledge of coming
information technology
31. Desired skills
client-server tools
NetSci, ISSN 1092-7360, is published by Network Science Corporation. Except where expressly stated, content at this site is copyright (© 1995 - 2010) by Network Science Corporation and is for your personal use only. No redistribution is allowed without written permission from Network Science Corporation. This web site is managed by:
- Network Science Corporation
- 4411 Connecticut Avenue NW, STE 514
- Washington, DC 20008
- Tel: (828) 817-9811
- E-mail: TheEditors@netsci.org
- Website Hosted by Total Choice