Responsibilities
TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.
Why Join Us
Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day. To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.
What You’ll Be Doing
- Build a unified data storage format and query engine in different scenarios (high availability/high throughput, large volume/sequential or random access).
- Build an efficient system for model parameter management, sharding, and deduplication for LLMs.
- Develop multi-level/hierarchical storage architecture, not limited to HBM/DDR/disk.
- Optimize the training system for availability and fault tolerance; improve the data consistency, and capacity of the system.
- Research and implement state of the art indexing/storage structures for machine learning on latest hardware.
Qualifications
- Proficient in the use of C++/Python in the Linux environment.
- Proficient in the design, development, maintenance and continuous optimization of large-scale distributed systems, and be able to identify potential problems in complex systems.
- Have participated in optimizations for parameter-server-like systems, or indexing structure of query engines; or have experience in using/optimizing large-scale distributed storage systems such as HDFS and PFS.
- Strong communication skills and develop new solutions based on issues that arise.
Bonus
- Understand open source storage/engine projects such as Redis, RocksDB, Presto, etc.; understand common Machine learning file storage formats such as parquet, TFRecord, IndexRecordIO, etc.
- Familiar with one of the machine learning frameworks (TensorFlow/PyTorch/Jax).
- Have experience in one of the following fields: database systems, distributed storage, AI infrastructure, HW/SW co-design, High performance computing, ML hardware architecture (GPU, accelerators, networking), machine learning frameworks, operating systems.
- ACM/OI competitive programming experiences.
Job Information:
Compensation Description (annually)
The base salary range for this position in the selected city is $137750 - $237500 annually. Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.
Our company benefits are designed to convey company culture and values, to create an efficient and inspiring work environment, and to support our employees to give their best in both work and life. We offer the following benefits to eligible employees:
We cover 100% premium coverage for employee medical insurance, approximately 75% premium coverage for dependents and offer a Health Savings Account(HSA) with a company match. As well as Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life and AD&D insurance plans. In addition to Flexible Spending Account(FSA) Options like Health Care, Limited Purpose and Dependent Care.
Our time off and leave plans are: 10 paid holidays per year plus 17 days of Paid Personal Time Off (PPTO) (prorated upon hire and increased by tenure) and 10 paid sick days per year as well as 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.
We also provide generous benefits like mental and emotional health benefits through our EAP and Lyra. A 401K company match, gym and cellphone service reimbursements. The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
#J-18808-LjbffrSimilar Jobs
- View Job
Software Engineer, Storage
Seattle - View Job
Senior Software Engineer, Storage
Seattle - View Job
Software Engineer, Distributed Storage
Seattle - View Job
Software Dev Engineer - Embedded, Runtime, Storage, System & Performance , Annapurna Labs
Seattle - View Job
Software Dev Engineer - Embedded, Runtime, Storage, System & Performance , Annapurna Labs
Seattle