Job Description
Dice is the leading career destination for tech experts at every stage of their careers. Our client, AOA Software Solutions, LLC, is seeking the following. Apply via Dice today!
Job Role: DataDog Systems Engineer
Job Location: Reston VA or Washington DC (Remote)
Job Description
Immediate long term contract opportunity! We are seeking a DataDog Systems Engineer to support our customers Systems Monitoring initiatives as a backend administrator/engineer of the tool, along with several upcoming migration efforts to make DataDog the primary monitoring tool, enterprise wide (on-prem and off). The selected candidate will be responsible for software tool administration for systems and applications monitoring tools. Expertise with the monitoring tool, DataDog, is required as a backend Administrator/Engineer (not seeking frontend users of the tool, this is a backend support role). We are seeking candidates that have experience implementing DataDog from scratch, along with basic Java understanding. Responsibilities include:
* DataDog Administration experience on Linux platform to instrument Java based applications running on Tomcat Application Server. Manages, configures and maintains the DataDog tool on Linux platform.
* Responsible for Network Monitoring, Infrastructure/Server Monitoring (Linux, Windows, AIX) using Data Dog, Application, SNMP and Log Monitoring. Configuration experience in Infrastructure Monitoring, Network Monitoring and Centralized Loggin or similar Administration experience with ELK Stack – Elasticsearch (search and analytics engine), Logstash (ingest pipeline) and Kibana (visualization and creating dashboards).
* Configure centralized logging of all logs from different sources like WebSphere / Tomcat and IHS WebServers on AIX servers to Data Dog on Linux. Knowledge of Load Balancers like F5 to route logs to Log server. Handling different types of Log formats.
* Creates required dashboards with data visualization in Data Dog.
* Manages, configures and maintains the DataDog APM tool on Linux platform.
* Responsible for Java Applications- instrumentation with Data Dog, set up health rules and fine tune monitoring in Data Dog.
* Setup End User Monitoring / Browser Real User Monitoring of Data Dog for applications, using Java script injection.
* Provides support to all significant production issues.
Although this position is remote, our client does have meetings onsite (Reston VA or Wash DC) 1-2 times per month; the selected candidate must be able to attend these meetings.
Required Skills
* Bachelor of Science in Computer Science or related field (i.e., Engineering, Applied Science, Math, etc.) or equivalent experience.
* Seeking 8+ years of hands-on IT Engineering experience, with a minimum of 3 years hands-on experience installing, integrating, managing and maintaining monitoring tool DataDog (backend administration support role, not frontend user of the tool)
* Strong knowledge of Java is required. Tomcat web servers highly preferred.
* Strong Linux platform (Red Hat) background. Understanding of SSL setup on Linux servers. Installing CA certs etc. = Highly preferred
* Log Management experience with ELK Stack – Elasticsearch (search and analytics engine), Logstash (ingest pipeline), and Kibana (visualization and creating dashboards)
* Experience with Network Monitoring and knowledge on Network components like Switches, Routers, Palo Alto Network utilization SNMP, F5 Load Balancers, WebSeal, Info Blocks, Gigamon, Network Mapping is a plus.
* Automation experience with scripting (Python, Shell, ANSIBLE) preferred.
* Any AWS Cloud knowledge = Big plus
Additional Comments:
* one submission slot
* H1B is acceptable
* Mostly remote, with some visits to Washington DC and Reston VA office as needed. Nothing specific here, just needs to be open. DMV is preferred, open to candidates in surrounding states who can be onsite with a 1-2 day notice. No set time on onsite visits, just based on situations that arise.
* They are looking to consolidate all monitoring tools / log mgt into DataDog (nagios, appD, Solorwinds to DataDog). Goal is to migrate everything into DataDog.
Must-Have:
* DataDog (backend administration, not frontend user). Log Management specifically would be ideal. Or applications
* Monitoring of Java applications via DataDog
* Linux OS (datadog sits on Linux OS)
* For DataDog, prefers candidates that has a background in networking monitoring, cloud mgt, log management and application (infrastructure module covered already) .
* Safe/Agile methodology background