FAQs for ChemSpider
The ChemSpider service
has been online since March 24th 2007 but we allowed beta-testers access to the system in January 2007. The FAQs below reflect on
some of the most common questions they asked as well as some of the questions that we are expecting to be asked. If you have a question
for us please send us your comments to our general inquiries email address at ChemSpider Feedback.
ALL ABOUT CHEMSPIDER
What is ChemSpider?
ChemSpider is a chemistry
search engine. It has been built with the intention of aggregating and indexing chemical structures and their associated information
into a single searchable repository and make it available to everybody, at no charge. ChemSpider is a value-added offering of publicly available chemical
structures since many additional properties have been added to each of the chemical structures. We intend ChemSpider to offer the
fastest chemical structure searches available online and delivered with the flexibility and usability necessary to encourage repeat
usage.
Why did you name this service
ChemSpider?
Anybody setting up a
website with an intention of garnering traffic will tell you that a memorable moniker for the website does help. The ChemSpider domain
name contains labels connecting the Chemistry nature of the website with the future “spidering” aspects of the site –
connecting across the web to bring together disparate chemistry-related resources.
What is the ChemSpider logo based
on?
The ChemSpider
logo represents the chemical structure of (1Z,3Z,5Z,7Z)-1,2,3,4,5,6,7,8-octaethylcycloocta-1,3,5,7-tetraene on a web-based background. It represents our ability to perform chemical structure searches across the world wide
web.
Why are you providing this service
for free?
ChemSpider
is provided as a public service. The possibility to index and search all publicly available chemical structures via a website was
originally conceived by our team about 5 years ago. At that time there were many technology hurdles, little time and no commercial
reasons to pursue the project. Since that time technologies have matured and this project is presently undertaken as a hobby project
with the intention of pushing the technologies and having an impact on the chemistry community at large. This project is presently
self-funded.
Will this service stay free also
in the future?
At present there is
no intention to turn ChemSpider into a “for-charge” service. While we may seek sponsorship or advertising revenues in
the future our present focus is on improving our technologies and developing a following for ChemSpider.
Is ChemSpider the
first application of its kind?
ChemSpider is a software
technology utilized to perform structure and text-based queries of a very large dataset of millions of chemical structures and associated
data. ChemSpider is not the first application to allow this.
In terms of the gold
standard in curated data CAS remains the premier offering.
The CAS registry is the largest current database of chemical substance information in the world and contains
more than 30 million organic and inorganic substances and 58 million sequences. You are referred to the CAS website should you be
unaware of the CAS registry and services delivered by the organization. The need for this reference is, of course, unlikely.
The PubChem database provides information on the biological activities of small
molecules and is a component of NIH's
Molecular Libraries Roadmap Initiative. The Pubchem database presently contains close to 6 million compounds and is the result of combining data from a
group of contributors.
eMolecules provides an indexing and search
service to help find information about chemicals on web sites across the world wide web. eMolecules delivers chemical data
aggregated by searching the internet, and from submissions from data providers such as chemical suppliers and academic researchers.
While ChemSpider is
not the first application to allow free access to search a database it does offer a series of unique capabilities. The database is
the largest online database available at the time of writing (over 10 million compounds). The system offers the greatest flexibility
in structure based searching including exact structure, substructure and soon will include similarity of structure including matching stereochemistry
and isotopes. These capabilities are enabled utilizing a structure drawing applet. ChemSpider also offers access to a
series of calculation services utilizing a transaction based system.
Is ChemSpider selling a software
package ?
ChemSpider is a web-based
application delivering access to a database of publicly available chemical structures enhanced with related data generated by a series
of prediction engines. Neither ChemSpider nor the associated database are part of a software package and will not be made available
for sale.
BEHIND THE WEB OF CHEMSPIDER
A) Database Content
Where are you sourcing your content
databases?
Our database is built
up from an aggregate of data sourced from a series of publicly available data sources and from our collaborators. The
list of data providers is available online. This
list will continue to expand and will be updated on a regular basis. ChemSpider also has the ability to accept data submitted by our
users. In this way we are offering the opportunity to develop an Open Chemistry Community of users.
I have collected set of structures
of and want to make this available via ChemSpider. How do I do it?
The ChemSpider website
offers to any chemists the ability to add to the database content. In order to submit the data through the clearance process simply
use the outlined process and forms.
B) Online Services
Why are some of the properties
not calculated for some of my molecules?
The algorithms utilized
at the online services page are provided by third parties. All algorithms have specific limitations and predictions cannot be performed
in all cases. We apologize for this and welcome your feedback when you identify issues with the online services. We cannot guarantee
any resolution.
Please explain how the Simple Search Screen works for text searching?
The simple search screen allows searching based on Systematic Name, Synonym, Trade Name, SMILES string or InChI string. The search works by first searching on the synonym. if nothing is found then it converts the synonym to an InChI string if possible (using ACD/Labs Name to Structure conversion software). If nothing is found using the InChI as the basis of the search then a search is performed using the originally input text to search as a substring of a synonym.
How good are your logP value
calculations relative to others?
The logP calculations
performed online at ChemSpider are performed using a number of different logP predictors. ACD/Labs, one of the algorithm providers, have reported a number of
comparisons of performance themselves and
there have been a series of reviews
reporting similar comparisons.
Why do you only allow fairly
simple molecules as inputs to your nomenclature generation program?
The online transaction
server utilized to generate the standardized nomenclature is described at the ACD/Labs
website. The Free Naming is limited to molecules containing no more than 50 atoms, and no more than 3 rings, with
atoms from among only H, C, N, P, O, S, F, Cl, Br, I, Li, Na, and K.
Which other interactive services
do you plan to install at your site ?
At present we do not have any other services awaiting installation.
If you have services that you would like to suggest that we integrate with, or have your own services that you would like to connect
into ChemSpider please contact us at feedbackATchemspiderDOTcom.
C) The Computing Behind ChemSpider
Will the users have access to
the source code or ChemSpider database?
While ChemSpider is
provided as a public service it is not an Open Source development. Neither the source code nor the aggregate database will be made
available for download. Individual molecules or small hit lists can be downloaded from the database. Certain predicted properties
may be excluded from the download.
What programming language(s)
are you using?
ChemSpider is developed
using a series of components and programming languages. Its core is written as highly optimized C/C++. The database engine
is Microsoft SQL Server and the Web user interface is programmed using state of the art ASP.NET and C#. Various server-side scripting is done in Perl.
What is the software and hardware platforms?
ChemSpider is hosted on a x86 server-class PC running Windows 2003 Server
RIGHTS TO USE
Am I allowed to use the data
I find on ChemSpider in a publication ?
Yes, all data displayed
onscreen at ChemSpider can be used in a publication.
May I use your service in my
teaching class ?
Absolutely. We would
especially like the academic community to benefit from the information available on ChemSpider.
May I download the data and use it in my own database(s)?
You have limited rights in this regard. You can only assemble a database of 5000 structures or less, and their associated properties, from our database without our permission. You can download up to 1000 structures per day from the website. Please contact us at feedbackATchemspiderDOTcom to request an extension outside this constraint. We are willing to provide the ENTIRE database of ChemSpider structures at your request - the file will consist of InChI Strings, InChIKeys and ChemSpider IDs. These constraints are under regular review so please feel free to engage us in conversation.
CHEMICAL STRUCTURE REPRESENTATION
AND SEARCH ENGINE
Why does the structure drawing
applet load very quickly the second time I use the ChemSpider website?
The Structure Drawing
Applet (SDA) is cached the first time you use the website and applet. Provided that you do not clear the cache the SDA will load very
quickly for subsequent searches.
Why isn’t the ChemDraw
Plug-in supported yet?
This is simply a matter
of time, effort and requests. Should our user base determine that the SDA is insufficient to support their needs and request support
for other structure drawing editors we will consider adding support for other applications. Please send your requests to
feedbackATchemspiderDOTcom.
Can I perform chemical structure and substructure
searches directly from the SDA applet?
Yes, substructure searches
can be performed directly from the SDA applet. Please refer to the SDA manual
for how to draw the appropriate substructures for searching.
Our company policy does not allow
Java. How can I input molecules for searching into ChemSpider?
ChemSpider has been
integrated to ACD/ChemSketch PC-based software, both the ChemSketch Freeware
and the Commercial ChemSketch versions. ACD/ChemSketch can be
run on Macintosh and Linux systems under an appropriate emulator.
You allow submission of SMILES, InChI strings and InChIKeys for structure searching. What are SMILES and InChI?
SMILES
is a simple yet comprehensive chemical language in which molecules and reactions can be specified using ASCII characters representing
atom and bond symbols. A SMILES string is human readable,
is very compact, and if canonicalized produces a unique string that can be used as a universal identifier for a specific chemical
structure. For further details the reader is referenced to the excellent tutorial on the
Daylight website.
The IUPAC International
Chemical Identifier (InChI) is a non-proprietary identifier for chemical substances. It
was developed in a collaboration between IUPAC and NIST with the intention of enabling easier linking of diverse data compilations.
The InChI source code and application programming interface is made available at no charge to the public. The reader is referred to
the project pages for ongoing status reports.
PROVIDING FEEDBACK TO CHEMSPIDER
How do I report bugs to ChemSpider
developers?
Please send all bug
reports at bugs@chemspider.com. We will do our utmost to respond to you quickly but please note that
registered users do receive preferred treatment.
How do I submit new feature requests
for the ChemSpider service?
Please send all new
feature requests to feedback@chemspider.com. We will do our utmost to respond to you
quickly but please note that registered users do receive preferred treatment. Our receipt of new feature requests is no indication
of our commitment to include your request in our future developments.