- What types of data structures are routinely used for in-memory real time transaction scoring? I've used doubly circular linked lists to store (say) 20 most recen transactions with time stamp and other attributes, per merchant / per customer.
- What kind of metrics work well in this context? Amang many metrics, I've used last transaction or time to 5-th previous transaction.
- Do you use a lot of rather small lookup tables that you can upload in memory, to store historical data, such as merchant summary statistics broken down per day, for the last 3 months (one entry per merchant per day)?
- How do you optimize server performance? For instance, at 2am, when the volume of transactions is 5 times lower than at peak time, do you use the analytic servers for other tasks, such as end-of-day re-scoring?
- At peak time (severe peaks), do you use a simplified model that requires less memory, if you lack bandwidth?
- Have anybody used the Hadoop environment to feed into a true real time processing system (that is, with no latency), such as credit card processing?
- For data science ROI to be positive, should advanced analytics / data science costs (in terms of people, extra hardware and software) represent less than 10% of the cost of general computer architecture (servers, engineers, basic data processing and reporting)? Is there a magic number, and if it is not 10%, what would it be?

Tags:

© 2019 Data Science Central ® Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Statistics -- New Foundations, Toolbox, and Machine Learning Recipes
- Book: Classification and Regression In a Weekend - With Python
- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions