[problem record] - PDF generated by Google browser HTML

chaney1992 2021-08-08 20:39:57
problem record pdf generated google


cause :

Because the project needs to realize the silent printing effect of web pages , Then the silent printing effect cannot be achieved by directly using the browser printing function .

The preview interface will pop up when the browser prints ( Here's the picture ), Unable to achieve silent printing .

Solution :

Google browser provides a way to html Print directly to pdf And save it as a file , And then pdf Print silently .

Before invoking Google commands , Need to get the current Google installation location :

public static class ChromeFinder
{
#region Get application directory
private static void GetApplicationDirectories(ICollection<string> directories)
{
if (RuntimeInformation.IsOSPlatform(OSPlatform.Windows))
{
const string subDirectory = "Google\\Chrome\\Application";
directories.Add(Path.Combine(Environment.GetFolderPath(Environment.SpecialFolder.ProgramFiles), subDirectory));
directories.Add(Path.Combine(Environment.GetFolderPath(Environment.SpecialFolder.ProgramFilesX86), subDirectory));
}
else if (RuntimeInformation.IsOSPlatform(OSPlatform.Linux))
{
directories.Add("/usr/local/sbin");
directories.Add("/usr/local/bin");
directories.Add("/usr/sbin");
directories.Add("/usr/bin");
directories.Add("/sbin");
directories.Add("/bin");
directories.Add("/opt/google/chrome");
}
else if (RuntimeInformation.IsOSPlatform(OSPlatform.OSX))
throw new Exception("Finding Chrome on MacOS is currently not supported, please contact the programmer.");
}
#endregion
#region Get the current program directory
private static string GetAppPath()
{
var appPath = AppDomain.CurrentDomain.BaseDirectory;
if (appPath.EndsWith(Path.DirectorySeparatorChar.ToString()))
return appPath;
return appPath + Path.DirectorySeparatorChar;
}
#endregion
#region lookup
/// <summary>
/// Try to find Google Apps
/// </summary>
/// <returns></returns>
public static string Find()
{
// about Windows, Let's first check the registry . This is the safest way , Non default installation locations are also considered . Please note that ,Chrome x64 At present (2019 year 2 month ) Also installed in the program file (x86) in , And use the same registry key !
if (RuntimeInformation.IsOSPlatform(OSPlatform.Windows))
{
var key = Registry.GetValue(@"HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432Node\Microsoft\Windows\CurrentVersion\Uninstall\Google Chrome","InstallLocation", string.Empty);
if (key != null)
{
var path = Path.Combine(key.ToString(), "chrome.exe");
if (File.Exists(path)) return path;
}
}
// Collect common executable file names 
var exeNames = new List<string>();
if (RuntimeInformation.IsOSPlatform(OSPlatform.Windows))
exeNames.Add("chrome.exe");
else if (RuntimeInformation.IsOSPlatform(OSPlatform.Linux))
{
exeNames.Add("google-chrome");
exeNames.Add("chrome");
exeNames.Add("chromium");
exeNames.Add("chromium-browser");
}
else if (RuntimeInformation.IsOSPlatform(OSPlatform.OSX))
{
exeNames.Add("Google Chrome.app/Contents/MacOS/Google Chrome");
exeNames.Add("Chromium.app/Contents/MacOS/Chromium");
}
// Check the running Directory 
var currentPath = GetAppPath();
foreach (var exeName in exeNames)
{
var path = Path.Combine(currentPath, exeName);
if (File.Exists(path)) return path;
}
// Find Google program files in the general software installation directory 
var directories = new List<string>();
GetApplicationDirectories(directories);
foreach (var exeName in exeNames)
{
foreach (var directory in directories)
{
var path = Path.Combine(directory, exeName);
if (File.Exists(path)) return path;
}
}
return null;
}
#endregion
}

1、 Command mode : 

Start the Google process by command , Incoming web address 、pdf Save location and other information , take html convert to pdf:

/// <summary>
/// function cmd command
/// </summary>
/// <param name="command"></param>
private void RunCMD(string command)
{
Process p = new Process();
p.StartInfo.FileName = "cmd.exe";
p.StartInfo.UseShellExecute = false; // Whether to use the operating system shell start-up 
p.StartInfo.RedirectStandardInput = true;// Accept input from the calling program 
p.StartInfo.RedirectStandardOutput = true;// Get the output information from the calling program 
p.StartInfo.RedirectStandardError = true;// Redirect standard error output 
p.StartInfo.CreateNoWindow = true;// Don't show program window 
p.Start();// Start the program
// towards cmd Window sends input information 
p.StandardInput.WriteLine(command + "&exit");
p.StandardInput.AutoFlush = true;
//p.StandardInput.WriteLine("exit");
// Write the command to be executed to the standard input . Use here & Is the symbol of a batch command , Whether the previous command is executed successfully or not (exit) command , If not implemented exit command , Call back ReadToEnd() The method will fake death
// Similar symbols and && and || The former means that the following command will not be executed until the previous command is executed successfully , The latter means that the following command will not be executed until the previous command fails
// obtain cmd The output of the window 
 p.StandardOutput.ReadToEnd();
p.WaitForExit();// Wait for the program to finish executing and exit the process 
 p.Close();
}
public void GetPdf(string url, List<string> args = null)
{
var chromeExePath = ChromeFinder.Find();
if (string.IsNullOrEmpty(chromeExePath))
{
MessageBox.Show(" Failed to get Google browser address ");
return;
}
var outpath = Path.Combine(AppDomain.CurrentDomain.BaseDirectory, "tmppdf");
if (!Directory.Exists(outpath))
{
Directory.CreateDirectory(outpath);
}
outpath = Path.Combine(outpath, DateTime.Now.Ticks + ".pdf");
if (args == null)
{
args = new List<string>();
args.Add("--start-in-incognito");// Stealth mode 
args.Add("--headless");// No interface mode 
args.Add("--disable-gpu");// Ban gpu Speed up 
args.Add("--print-to-pdf-no-header");// Print generation pdf No header, no footers 
args.Add($"--print-to-pdf=\"{outpath}\" \"{url}\"");// Print generation pdf To the specified directory 
 }
string command = $"\"{chromeExePath}\"";
if (args != null && args.Count > 0)
{
foreach (var item in args)
{
command += $" {item} ";
}
}
Stopwatch sw = new Stopwatch();
sw.Start();
RunCMD(command);
sw.Stop();
MessageBox.Show(sw.ElapsedMilliseconds + "ms");
}

The main command parameters include :

a)  --headless: No interface

b) --print-to-pdf-no-header : Print generation pdf Do not include headers and footers

c) --print-to-pdf: Print the page as pdf, The parameter value is the output address

Existing problems :

    • In this way, multiple Google processes will be generated ( As many as 5 individual ), And frequently create processes when performance is poor , Will lead to pdf slower
    • In some cases , The process created by Google : Failed to exit completely , Cause subsequent generation pdf unexecuted .

The exception process parameters are similar :--type=crashpad-handler "--user-data-dir=xxx" /prefetch:7 --monitor-self-annotation=ptype=crashpad-handler "--database=xx" "--metrics-dir=xx" --url=https://clients2.google.com/cr/report --annotation=channel= --annotation=plat=Win64 --annotation=prod=Chrome

that , There is no way to reuse Google processes , And can generate pdf Operation? ? Then you need to use the second way .

2、Chrome DevTools Protocol The way

The main steps of this method are :

  • Create a Google process without interface
#region Start the Google browser process
/// <summary>
/// Start the Google process , If it has been started, it will not be started
/// </summary>
/// <exception cref="ChromeException"></exception>
private void StartChromeHeadless()
{
if (IsChromeRunning)
{
return;
}
var workingDirectory = Path.GetDirectoryName(_chromeExeFileName);
_chromeProcess = new Process();
var processStartInfo = new ProcessStartInfo
{
FileName = _chromeExeFileName,
Arguments = string.Join(" ", DefaultChromeArguments),
CreateNoWindow = true,
};
_chromeProcess.ErrorDataReceived += _chromeProcess_ErrorDataReceived;
_chromeProcess.EnableRaisingEvents = true;
processStartInfo.UseShellExecute = false;
processStartInfo.RedirectStandardError = true;
_chromeProcess.StartInfo = processStartInfo;
_chromeProcess.Exited += _chromeProcess_Exited;
try
{
_chromeProcess.Start();
}
catch (Exception exception)
{
throw;
}
_chromeWaitEvent = new ManualResetEvent(false);
_chromeProcess.BeginErrorReadLine();
if (_conversionTimeout.HasValue)
{
if (!_chromeWaitEvent.WaitOne(_conversionTimeout.Value))
throw new Exception($" exceed {_conversionTimeout.Value}ms, Can't connect to Chrome development tool ");
}
_chromeWaitEvent.WaitOne();
_chromeProcess.ErrorDataReceived -= _chromeProcess_ErrorDataReceived;
_chromeProcess.Exited -= _chromeProcess_Exited;
}
/// <summary>
/// Exit event
/// </summary>
/// <param name="sender"></param>
/// <param name="e"></param>
private void _chromeProcess_Exited(object sender, EventArgs e)
{
try
{
if (_chromeProcess == null) return;
var exception = Marshal.GetExceptionForHR(_chromeProcess.ExitCode);
throw new Exception($"Chrome Unexpected exit , {exception}");
}
catch (Exception exception)
{
_chromeEventException = exception;
_chromeWaitEvent.Set();
}
}/// <summary>
/// When Chrome Raised when data is sent to the error output
/// </summary>
/// <param name="sender"></param>
/// <param name="args"></param>
private void _chromeProcess_ErrorDataReceived(object sender, DataReceivedEventArgs args)
{
try
{
if (args.Data == null || string.IsNullOrEmpty(args.Data) || args.Data.StartsWith("[")) return;
if (!args.Data.StartsWith("DevTools listening on")) return;
// DevTools listening on ws://127.0.0.1:50160/devtools/browser/53add595-f351-4622-ab0a-5a4a100b3eae
var uri = new Uri(args.Data.Replace("DevTools listening on ", string.Empty));
ConnectToDevProtocol(uri);
_chromeProcess.ErrorDataReceived -= _chromeProcess_ErrorDataReceived;
_chromeWaitEvent.Set();
}
catch (Exception exception)
{
_chromeEventException = exception;
_chromeWaitEvent.Set();
}
}
#endregion
  • Get the browser from the process output information ws Connection address , And create ws Connect ; Send... To the Google browser process ws news : Open a TAB
WebSocket4Net.WebSocket _browserSocket = null;
/// <summary>
/// Create connection
/// </summary>
/// <param name="uri"></param>
private void ConnectToDevProtocol(Uri uri)
{
// establish socket Connect
// Browser connection :ws://127.0.0.1:50160/devtools/browser/53add595-f351-4622-ab0a-5a4a100b3eae
_browserSocket = new WebSocket4Net.WebSocket(uri.ToString());
_browserSocket.MessageReceived += WebSocket_MessageReceived;
JObject jObject = new JObject();
jObject["id"] =
1;
jObject[
"method"] = "Target.createTarget"; jObject["params"] = new JObject(); jObject["params"]["url"] = "about:blank"; _browserSocket.Send(jObject.ToString()); // Create page cards Socket Connect // Page card connection :ws://127.0.0.1:50160/devtools/browser/53add595-f351-4622-ab0a-5a4a100b3eae var pageUrl = $"{uri.Scheme}://{uri.Host}:{uri.Port}/devtools/page/ Page card id"; }
  • according to devtools The agreement is created to the current page card ws Connect
    WebSocket4Net.WebSocket _pageSocket = null;
    private void WebSocket_MessageReceived(object sender, WebSocket4Net.MessageReceivedEventArgs e)
    {
    string msg = e.Message;
    var pars = JObject.Parse(msg);
    string id = pars["id"].ToString();
    switch (id)
    {
    case "1":
    var pageUrl = $"{_browserUrl.Scheme}://{_browserUrl.Host}:{_browserUrl.Port}/devtools/page/{pars["result"]["targetId"].ToString()}";
    _pageSocket = new WebSocket4Net.WebSocket(pageUrl);
    _pageSocket.MessageReceived += _pageSocket_MessageReceived;
    _pageSocket.Open();
    break;
    }
    }
  • Send a command to the page card , Jump to the need to generate pdf The page of
// Send refresh command 
JObject jObject = new JObject();
jObject["method"] = "Page.navigate"; // Method 
jObject["id"] = "2"; //id
jObject["params"] = new JObject(); // Parameters 
jObject["params"]["url"] = "http://www.baidu.com";
_pageSocket.Send(jObject.ToString());
  • Finally, the page card sends a command to generate pdf
    // Send refresh command 
    jObject = new JObject();
    jObject["method"] = "Page.printToPDF"; // Method 
    jObject["id"] = "3"; //id
    jObject["params"] = new JObject(); // Parameter print parameter settings 
    jObject["params"]["landscape"] = false;
    jObject["params"]["displayHeaderFooter"] = false;
    jObject["params"]["printBackground"] = false;
    _pageSocket.Send(jObject.ToString());

Details of command support , Detailed view DevTools Content of agreement

Reference resources :

DevTools agreement : Chrome DevTools Protocol - Page domain

    Google parameter description :List of Chromium Command Line Switches « Peter Beverloo

版权声明
本文为[chaney1992]所创,转载请带上原文链接,感谢
https://qdmana.com/2021/08/20210808203932292w.html

  1. HTML + CSS + JavaScript to achieve cool Fireworks (cloud like particle text 3D opening)
  2. HTML + CSS + JavaScript realizes 520 advertising love tree (including music), which is necessary for programmers to express themselves
  3. Solve the problem of Web front-end deployment server (it can be deployed online without a server)
  4. HTML + CSS + JS make wedding countdown web page template (520 / Tanabata Valentine's Day / programmer advertisement)
  5. What else can driverless minibus do besides "Park connection"?
  6. Cloud native leads the era of all cloud development
  7. NRM mirror source management tool
  8. Bring it to you, flex Jiugong
  9. Lolstyle UI component development practice (II) -- button group component
  10. Deconstruction assignment in ES6
  11. Luo 2 peerless Tang clan was officially launched. The official gave a key point, and the broadcast time was implied
  12. 20初识前端HTML(1)
  13. 当新零售遇上 Serverless
  14. 20 initial knowledge of front-end HTML (1)
  15. When new retail meets serverless
  16. [golang] - go into go language lesson 5 type conversion
  17. [golang] - go into go language lesson 6 conditional expression
  18. HTML5(八)——SVG 之 path 详解
  19. HTML5 (8) -- detailed explanation of SVG path
  20. 需要开通VIP以后页面内容才能复制怎么办?控制台禁用javascript即可
  21. Web前端|CSS入门教程(超详细的CSS使用讲解,适合前端初学者)
  22. 实践积累 —— 用Vue3简单写一个单行横向滚动组件
  23. Serverless 全能选手,再下一城
  24. What if you need to open a VIP to copy the page content? Just disable JavaScript on the console
  25. Web front end | CSS introductory tutorial (super detailed CSS explanation, suitable for front-end beginners)
  26. Practice accumulation - write a single line horizontal scroll component simply with vue3
  27. Dili Reba is thin again. She looks elegant and high in a strapless hollow skirt, and her "palm waist" is beautiful to a new height
  28. Serverless all-round player, next city
  29. The difference between MySQL semi synchronous replication and lossless semi synchronous replication
  30. Vue表单设计器的终极解决方案
  31. The ultimate solution for Vue form designer
  32. Nginx从理论到实践超详细笔记
  33. Yu Shuxin's red backless swimsuit is split to the waist and tail, with a concave convex figure and excessive color matching, and his face is white to dazzling
  34. Nginx ultra detailed notes from theory to practice
  35. 【动画消消乐|CSS】086.炫酷水波浪Loading过渡动画
  36. typecho全站启用https
  37. CCTV has another popular employee. The off-site interpretation is very professional, and the appearance ability is no less than that of Wang Bingbing
  38. [animation Xiaole | CSS] 086. Cool water wave loading transition animation
  39. Enable HTTPS in Typecho
  40. 50天用JavaScript完成50个web项目,我学到了什么?
  41. 根据JavaScript中原生的XMLHttpRequest实现jQuery的Ajax
  42. What have I learned from completing 50 web projects with JavaScript in 50 days?
  43. "My neighbor doesn't grow up" has hit the whole network. There are countless horse music circles, and actor Zhou Xiaochuan has successfully made a circle
  44. 根据JavaScript中原生的XMLHttpRequest实现jQuery的Ajax
  45. Implement the Ajax of jQuery according to the native XMLHttpRequest in JavaScript
  46. Implement the Ajax of jQuery according to the native XMLHttpRequest in JavaScript
  47. 30 + women still wear less T-shirts and jeans. If they wear them like stars, they will lose weight
  48. 数栈技术分享前端篇:TS,看你哪里逃~
  49. Several stack technology sharing front end: TS, see where you escape~
  50. 舍弃Kong和Nginx,Apache APISIX 在趣链科技 BaaS 平台的落地实践
  51. Abandon the landing practice of Kong and nginx, Apache apisik on the baas platform of fun chain technology
  52. 浪迹天涯king教你用elementui做复杂的表格,去处理报表数据(合并表头,合并表体行和列)
  53. 前端HTML两万字图文大总结,快来看看你会多少!【️熬夜整理&建议收藏️】
  54. Wandering around the world king teaches you to use elementui to make complex tables and process report data (merge header, merge table body rows and columns)
  55. 路由刷新数据丢失 - vuex数据读取的问题
  56. Front end HTML 20000 word graphic summary, come and see how much you can【 Stay up late to sort out & suggestions]
  57. Route refresh data loss - vuex data reading problem
  58. Systemctl系统启动Nginx服务脚本
  59. Systemctl system startup nginx service script
  60. sleepless