About
InMail requests: please be succinct.
Team builder, problem solver, coffee to code…
Articles by Jerene
Activity
-
No other database can do this today. Run your graph workloads, perform text and vector search without managing data silos. Leverage Spanner for your…
No other database can do this today. Run your graph workloads, perform text and vector search without managing data silos. Leverage Spanner for your…
Liked by Jerene Yang
-
One of my teammates asked me a pretty interesting question (you know who you are 😉): Is the bar for hiring different at OpenAI? Quantitatively…
One of my teammates asked me a pretty interesting question (you know who you are 😉): Is the bar for hiring different at OpenAI? Quantitatively…
Shared by Jerene Yang
Experience
Education
-
Carnegie Mellon University
Operating Systems, Parallel Systems and Architecture, Algorithms, Artificial Intelligence, Principles of Programming Languages
- Full Merit Based Scholarship from Infocomm Development Authority of Singapore (now GovTech)
- Graduated in 3 years with 2 degrees
- Graduated with both college and school honors
- Dean's Lists
- Senior Thesis with Turing Award Winner Professor Manuel Blum (Research won 1st place for Alcoa Undergraduate Research at the annual Meeting of the Minds)
-
Group Theory, Graph Theory, Number Theory, Real Analysis I & II, Numerical Analysis, Combinatorics
-
Bridge Club - 1st in Annual Interschools Championships (Team Captain), 4th in Pacific Asia Bridge Federation U21 category representing Singapore
Computer Science Club Chairman
GCE A Levels
Patents
-
DATA DRIVEN RELATIONAL ALGORITHM FORMATION FOR EXECUTION AGAINST BIG DATA
Issued US WO 2016029026 A1
Techniques are described herein for creating an algorithm for batch mode processing against big data. The techniques involve receiving one or more user commands from a set number of commands that correspond one-to-one with a set number of low-level database operations. In a preferred embodiment, the set of database operations includes only FILTERS, SORTS, AGREGGATES, and JOINS. In the algorithm formation process, database operations are performed on a sample population of records. The user…
Techniques are described herein for creating an algorithm for batch mode processing against big data. The techniques involve receiving one or more user commands from a set number of commands that correspond one-to-one with a set number of low-level database operations. In a preferred embodiment, the set of database operations includes only FILTERS, SORTS, AGREGGATES, and JOINS. In the algorithm formation process, database operations are performed on a sample population of records. The user drills down to a set of useful records by performing database operations against the results of the previous database operations. While the database cluster is receiving operations, the system is tracking the operations in a dependency graph. The chains selected within the dependency graph indicate which operations are used to create the algorithm. To generate the algorithm, the database cluster reverse engineers the logic for performing those operations against big data.
Other inventorsSee patent -
EXECUTING CONSTANT TIME RELATIONAL QUERIES AGAINST STRUCTURED AND SEMI-STRUCTURED DATA
Issued US WO 2016029018 A3
Techniques are described herein for performing database operations against location and access transparent metadata units called fat pointers organized into globally distributed data structures. The fat pointers are created by extracting values corresponding to a particular key and paring each value with a reference to the local location and server that has the native format record containing the value. The fat pointers may be transferred to any server in the cluster, even if the server is…
Techniques are described herein for performing database operations against location and access transparent metadata units called fat pointers organized into globally distributed data structures. The fat pointers are created by extracting values corresponding to a particular key and paring each value with a reference to the local location and server that has the native format record containing the value. The fat pointers may be transferred to any server in the cluster, even if the server is different from the server that has the native format record. In general, most operations are performed against fat pointers rather than the native format records. This allows the cluster to perform work against arbitrary types of data efficiently and in a constant amount of time despite the variable sizes and structures of records.
Other inventorsSee patent -
SYSTEMS, METHODS, AND INTERFACES FOR ADAPTIVE PERSISTENCE
Issued US 20140237147
A storage module may be configured to service I/O requests according to different persistence levels. The persistence level of an I/O request may relate to the storage resource(s) used to service the I/O request, the configuration of the storage resource(s), the storage mode of the resources, and so on. In some embodiments, a persistence level may relate to a cache mode of an I/O request. I/O requests pertaining to temporary or disposable data may be serviced using an ephemeral cache mode. An…
A storage module may be configured to service I/O requests according to different persistence levels. The persistence level of an I/O request may relate to the storage resource(s) used to service the I/O request, the configuration of the storage resource(s), the storage mode of the resources, and so on. In some embodiments, a persistence level may relate to a cache mode of an I/O request. I/O requests pertaining to temporary or disposable data may be serviced using an ephemeral cache mode. An ephemeral cache mode may comprise storing I/O request data in cache storage without writing the data through (or back) to primary storage. Ephemeral cache data may be transferred between hosts in response to virtual machine migration.
Other inventorsSee patent -
SYSTEMS AND METHODS FOR STORAGE VIRTUALIZATION
Issued US US 20140223096 A1
An I/O manager may be configured to service I/O requests pertaining to ephemeral data of a virtual machine using a storage device that is separate from and/or independent of a primary storage resource to which the I/O request is directed. Ephemeral data may be removed from ephemeral storage in response to a removal condition and/or trigger, such as a virtual machine reboot. The I/O manager may manage transfers of ephemeral virtual machine data in response to virtual machines migrating between…
An I/O manager may be configured to service I/O requests pertaining to ephemeral data of a virtual machine using a storage device that is separate from and/or independent of a primary storage resource to which the I/O request is directed. Ephemeral data may be removed from ephemeral storage in response to a removal condition and/or trigger, such as a virtual machine reboot. The I/O manager may manage transfers of ephemeral virtual machine data in response to virtual machines migrating between host computing devices. The I/O manager may be further configured to cache virtual machine data, and/or manage shared file data that is common to two or more virtual machines operating on a host computing device.
Other inventors
Courses
-
AI
15-381
-
Algorithms
15-451
-
Combinatorics
21-301
-
Number Theory
21-441
-
Numerical Methods
21-369
-
OS
15-410
-
Parallel Computer Architecture and Programming
15-418
-
Principals of Real Analysis II
21-356
-
Set Theory
21-329
Projects
-
Fusion io - ioVDI
Lead Engineer
- Developed Write Vectoring - Key feature in ioVDI
- Heavy involvement in all stages of the product: Idea conception, prototype design, product scoping, competitive analysis, key feature development, packaging, POC, UI and documentation
- 2 Approved Patents, 1 Pending
- 98% read offload, 80% write offload, 3X increase in VM density
- 300 fully functional Windows 7 ultimate VMs per 2U server
- GUI prototype
- CLI design -
Fusion io - Write Vectoring
Two of us wrote the patent pending write vectoring feature that enabled 500% increase in the number of server VMs running per physical machine.
-
CAPTCHA Breaker
As a side hobby, I used different techniques and parallel computation algorithms to break CAPTCHAs. Here are some of CAPTCHAs that I've broken:
1. Captcha.net
2. NuCaptcha
3. Phpcaptcha.org -
Graphical Numerical Inference: a.k.a. Brain Surgery for Excel
-
Awards: Won the ALCOA Undergraduate Research Award First Place at CMU's annual Meeting Of The Minds.
Mentor: Professor Manuel Blum
Excel's drag and auto-fill feature works for most simple numerical cases like addition. However, it fails when someone gives it a checkerboard pattern with 1s and 0s and tries to extend the pattern. Excel is unable to expand this obvious pattern because its entire inference is based on a static snapshot of the final data. Graphical Extrapolating…Awards: Won the ALCOA Undergraduate Research Award First Place at CMU's annual Meeting Of The Minds.
Mentor: Professor Manuel Blum
Excel's drag and auto-fill feature works for most simple numerical cases like addition. However, it fails when someone gives it a checkerboard pattern with 1s and 0s and tries to extend the pattern. Excel is unable to expand this obvious pattern because its entire inference is based on a static snapshot of the final data. Graphical Extrapolating Numerical Inferencer for Excel (GENIE), on the other hand, takes a dynamic approach by monitoring how the sequence is being filled. It will then try to figure out how the user is filling up the entries. After that, it picks up where the user has left off and fills in the rest of the entries.Other creators
Honors & Awards
-
Phi Beta Kappa Honor Society
CMU
Member
-
Undergraduate Research Award
ALCOA - CMU
Won first place during the CMU's 2012 Meeting of the Minds.
-
Dean's List
CMU
5 out of 6 Semesters.
-
MENSA
MENSA USA
Member
Languages
-
English
Native or bilingual proficiency
-
Chinese
Native or bilingual proficiency
-
Japanese
Limited working proficiency
-
French
Limited working proficiency
Recommendations received
2 people have recommended Jerene
Join now to viewMore activity by Jerene
-
Distributed counting is a challenging problem in computer science. Achieving convergence across millions of counters on multiple globally deployed…
Distributed counting is a challenging problem in computer science. Achieving convergence across millions of counters on multiple globally deployed…
Liked by Jerene Yang
-
I have a confession to make. . . . Sometimes, I like my own posts. 🥸 Because if I see 99 likes, you can bet your a$s I’m tapping that button to…
I have a confession to make. . . . Sometimes, I like my own posts. 🥸 Because if I see 99 likes, you can bet your a$s I’m tapping that button to…
Liked by Jerene Yang
-
Wrote some very rough thoughts on the topic: https://lnkd.in/gkHHRQGg
Wrote some very rough thoughts on the topic: https://lnkd.in/gkHHRQGg
Liked by Jerene Yang
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More