Find Jobs
Hire Freelancers

Write a Python script that will parse PubChem to download all chemicals with given properties and run this script

$30-250 USD

Cerrado
Publicado hace casi 7 años

$30-250 USD

Pagado a la entrega
There is a public website with all chemical compunds call PubChem: [login to view URL] We need to download information about all molecules with less than 11 atom. It can be done in the following way: 1. Use advanced search available on the website: [login to view URL] and search for the following string: ((0:10[HeavyAtomCount]) AND 0:0[TotalFormalCharge]) AND 0:0[IsotopeAtomCount] It will return the list of all compunds with less than 10 heavy atoms, but some of them are ionic compunds not molecules and some contain more than 10 atoms. 2. We need to sort the results by complexity 3. Then we need to check all the results and use two filters: Filter A: remove compounds with more than 10 atoms in Molecular Formula Filter B: remove compunds that contain a dot sign (".") in Canonical SMILES 4. All the components that are not removed by those filters should be collected in CSV text file that contains the following columns: * PubChem CID * Molecular Formula * Canonical SMILES * Molecular Weight * Chemical Names * IUPAC Name * If 2D structure XML file is presented (yes/no) * If 3D structure XML file is presented (yes/no) 5. For each compound that match our filters we should also download it 2D and 3D structures as XML files and place them in two folders. File names should be like "[login to view URL]" and "[login to view URL]" where 101826982 is PubChem CID of this compound The results: The results of this project should be A. A ZIP archive with many xml files with 2D and 3D structures of the and one [login to view URL] file. B. Python script(s) that generates this CSV file and download XML files Deadline for this project: August 24th, 2017, 13:00 London time ========================== For your information: PubChem supports API that makes this project much easier: REST Tutorial: [login to view URL] REST Documentation: [login to view URL] Other API documentation: [login to view URL] List of properties: [login to view URL] Example how to download needed properties of several substances: [login to view URL],129251212,5460638,5460696/property/MolecularFormula,MolecularWeight,CanonicalSMILES,Complexity,Charge,HeavyAtomCount,IsotopeAtomCount/XML Python wrapper for PubChem: [login to view URL]
ID del proyecto: 14952675

Información sobre el proyecto

18 propuestas
Proyecto remoto
Activo hace 7 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos
18 freelancers están ofertando un promedio de $200 USD por este trabajo
Avatar del usuario
First of all thank you for excellent description! I can create Python scraper and collect all data you want (including 2D and 3D files) in less than 3 days. Thanks. Roman Relevant Skills and Experience I Python developer and one of the best web scraping experts here that's why I'm sure you'll be impressed with my work. I'll create Scrapy spider which will collect all information you want. Proposed Milestones $170 USD - Python script + data
$170 USD en 3 días
4,9 (404 comentarios)
7,7
7,7
Avatar del usuario
We are experts in software development, worked in companies like Adobe, Dell etc. Java, PHP, Python, HTML, CSS, Javascript, Selenium with Python and Java, Web Development and Web Design, Web Scraping Relevant Skills and Experience We are experts in software development, worked in companies like Adobe, Dell etc. Java, PHP, Python, HTML, CSS, Javascript, Selenium with Python and Java, Web Development and Web Design, Web Scraping Proposed Milestones $155 USD - lets chat
$155 USD en 3 días
4,9 (29 comentarios)
6,2
6,2
Avatar del usuario
Hello Client, Hope you are doing well ! I have great experience of extracting information from websites . I provide best solutions at fastest speed with the cheapest cost. Your satisfaction is my only priority. I would like to take a chance to show my experience.I have more than 8 years of experience in Scraping,Crawling and Scraping tools,: Python,Data Processing,Data Entry,Excel,,Web Search,Data Mining,Data Science,lead collection.I am a highly skilled provider seeking an opportunity to leverage my expertise and demonstrate my high level of technical an administrative skills,I have completed 10to20+ projects on my up work profile in Scraping,Crawling and Scraping tools,: Python,Data Processing,Data Entry,Excel,,Web Search,Data Mining,Data Science,lead collection. I can achieve the results that you are asking for. Clients willing to have long term project relationship are most welcome Thanks.
$30 USD en 0 día
4,9 (7 comentarios)
5,2
5,2
Avatar del usuario
I read your project brief. I can do your project by using PubChemPy wrapper of Python to search for chemicals on PubChem according to the criteria you specified and deliver a CSV file with molecular data. Relevant Skills and Experience I'm an experienced Python developer. I've developed several command-line tools in Python. I also have experience in other programming languages such as PHP, JavaScript etc. Feel free to contact me if you would like to ask any question or want some clarifications. My timezone is GMT +05:00. Proposed Milestones $90 USD - 50% Project Completion $90 USD - 100% Project Completion
$180 USD en 5 días
4,7 (17 comentarios)
4,6
4,6
Avatar del usuario
Search Pub chem for 10 atom compounds. Filter down the results based on the specified criteria. convert to csv. Relevant Skills and Experience Python Web Automation Web Services Chemistry Software Architecture Algorithm Proposed Milestones $155 USD - csv and py script
$155 USD en 3 días
4,3 (12 comentarios)
3,9
3,9
Avatar del usuario
Hi, I'm a professional software engineer with 4 years of experience in Python, Java, Scala. I can help you with the download of molecular data.
$110 USD en 3 días
4,7 (5 comentarios)
2,8
2,8
Avatar del usuario
Yes, I am new here, but we have been working on Python,Django,Web Crawling/Data Scraping for last 7 years. Relevant Skills and Experience We have used Flask and iFrame to achieve the desired results on Python 2 & 3. Proposed Milestones $977 USD - End I saw your project details and found interesting. Can very well execute this with your support.
$977 USD en 3 días
4,6 (5 comentarios)
2,2
2,2
Avatar del usuario
Hi, I have a web scraping history with python. I fully undestood your userstories and I also had a look API for it. I can provide you that you want.
$150 USD en 2 días
4,5 (2 comentarios)
1,6
1,6
Avatar del usuario
A proposal has not yet been provided
$110 USD en 1 día
5,0 (3 comentarios)
1,4
1,4
Avatar del usuario
Hello, I have over 4 years of professional python experience. Let me help you with the implementation of your python tool. Relevant Skills and Experience Over 4 years of professional python programming experience. Experience with the REST API and python data management packages (e.g., pandas which can be used to nicely filter the acquired data). Proposed Milestones $100 USD - Delivery of the python script.
$88 USD en 5 días
5,0 (1 comentario)
0,6
0,6
Avatar del usuario
Hello. We were carefully reviewing the requirements of the job description, so our developers can work on your project without delay. We have years of working on projects related on any available CMS, from "scratch" with core php and php-frameworks(Yii/Yii2, Laravel, CodeIgniter), JavaScript, jQuery, AJAX, HTML5, CSS3, Bootstrap, javascript-frameworks, 3d desidg, graphic design etc. However, I shall discuss about the requirements and functionalities in details to have a better understanding about time frame and price. We are glad to chat with You and discuss all in details. Contact us and we will reply immediately. Waiting for Your reply! Best regards.
$257 USD en 10 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
Hi, I have gone check your projects descriptions so I can do it perfectly. I have 5 years+ experience in Software development, Software architecture, Python, Java. I have completed lot of projects in Python which is related to your projects descriptions. My Skills: Software development, Software architecture, Asterisk PBX, Java, Python, C programming, .Net, OCR, HTML, Meta traders, Engineering. Further details will be discuss over chat Thanks.
$30 USD en 3 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
Hi, I hope you have not granted this project to someone else :) I have a script ready that does the followings: 1. get list of cids that match your search criteria 2. pull the required properties for all cids 3. stores all the info in local sqlite db so that future runs will be a lot faster as we will only need to pull properties of new molecules. 4. extracts from local db all molecules that match given filters and saves them in csv file. 5. what is left is pulling 2D/3D xml which should be doable in 1 day. Hope to hear from you soon. Best Regards, -Ray-
$165 USD en 2 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
Parse compounds from PubChem website, filter and scrape the results to extract desired information, to be delivered in .zip and .csv files, with specific naming scheme. PubChem's APIs are available. Relevant Skills and Experience I'm a Python expert, with a lot of experience in parsing and scrapping data from webpages, and a fast learner, to interact with PubChem's APIs. Proposed Milestones $111 USD - Sample files for validation $111 USD - Complete dataset
$222 USD en 3 días
0,0 (0 comentarios)
0,0
0,0

Sobre este cliente

Bandera de UNITED KINGDOM
Chalfnt St Giles, United Kingdom
5,0
139
Forma de pago verificada
Miembro desde ene 13, 2017

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.